Hw18 by SuleimanovShakir · Pull Request #4 · SuleimanovShakir/SequenceMaster

SuleimanovShakir · 2024-04-30T20:28:52Z

No description provided.

…classes

iam28th

Шакир, добрый день!

Рефакторинг
По коду есть закоменченные куски, кое-где не хватает типизации, и не везде порядок импортов соответствует общепринятому. Для борьбы с последним очень рекомендую использовать тулы isort или ruff (сам пользуюсь вторым).
Тут -3б.
Великолепный readme, явно заслуживающий 3 доп. баллов (если бы это было первое задание с ним, то накинул бы больше)).
(12+3)/15
Случайный лес
Реализация fit похожа на правду, но не распараллелен predict (-10б) и нет примеров (-5б).
10/25
Тесты
Здесь всё хорошо, 10/10

Итого у вас (32+3)/50 за это дз.

На репозиторий и код приятно смотреть)
Успехов с защитой в ИБ и в дальнейшей карьере!

iam28th · 2024-05-06T11:25:54Z

+            #dump(current_tree, f'{os.getcwd()}/my_randomforest_model_{pid}.joblib') 
+
+        #processes = [multiprocessing.Process(target=fit_tree, args=(data, pid)) for pid, data in enumerate(self.bootstrapped_data)]
+        #for proc in processes:
+        #    proc.start()
+        #for proc in processes:
+        #    proc.join()
+        #return results
+        return self


закоменченные куски кода конечно стоит убрать...

iam28th · 2024-05-06T11:26:44Z

+        records = []
+        while not self.StopIter:
+            records.append(self.__next__())
+        return records


в этом файле type hint-ов не хватает

iam28th · 2024-05-06T11:26:52Z

+        '''
+        probas = self.predict_proba(X)
+        predictions = np.argmax(probas, axis=1)
+        return predictions


в этом файле type hint-ов не хватает

iam28th · 2024-05-06T11:28:34Z

+        return (float): The mass of the protein.
+        '''
        mass = sum(self.dictionary.get(aa) for aa in self.sequence)
        return mass


нужно проаннотировать типы

iam28th · 2024-05-06T11:31:39Z

+import numpy as np
+from concurrent.futures import ProcessPoolExecutor
+from sklearn.base import BaseEstimator
+from sklearn.tree import DecisionTreeClassifier
+from joblib import dump, load
+import os
+import multiprocessing


порядок импортов неправильный

Suggested change

import numpy as np

from concurrent.futures import ProcessPoolExecutor

from sklearn.base import BaseEstimator

from sklearn.tree import DecisionTreeClassifier

from joblib import dump, load

import os

import multiprocessing

import multiprocessing

import os

from concurrent.futures import ProcessPoolExecutor

import numpy as np

from joblib import dump, load

from sklearn.base import BaseEstimator

from sklearn.tree import DecisionTreeClassifier

iam28th · 2024-05-06T11:31:59Z

 # Import modules
-from source.folder_parser import folder_parser
+from dataclasses import dataclass
+import os


порядок импортов неправильный

iam28th · 2024-05-06T11:33:44Z

+    if output is None:
+        output_path = f'{input_folder}/{folder_name}/{input_name}'
+    else:
+        output_path = f'{input_folder}/{folder_name}/{output}.{format}'


Suggested change

if output is None:

output_path = f'{input_folder}/{folder_name}/{input_name}'

else:

output_path = f'{input_folder}/{folder_name}/{output}.{format}'

output_path = f"{input_folder}/{folder_name}/" + (

input_name if output is None else f"{output}.{format}"

)

хотя может и не очень получилось...
но хочется каким-то образом избавиться от повторения `f'{input_folder}/{folder_name}/``

SuleimanovShakir added 20 commits April 28, 2024 16:07

Add TG_logger and genscan_parser functions. Write docstrings for all …

8168d8d

…classes

Stop tracking vscode settings

0537558

Fix return of slice method to return type(self) object

f225128

Add new file to example folder

ede2638

Create file with tests for sequence_tools.py

f3ffe5e

Add venv to .gitignore

e1b8c68

Create notebook for examples

3075ac9

Add .env to .gitignore

43a25d6

Add OpenFasta and FastaRecord. Fix module import issues

b1abd68

Delete test.py

5b08870

Add custom_random_forest.py and manage imports

c818a81

Fix output message and folder names in fastq_filter

3e89c96

Add 4 showcases

8f1615b

Add 9th test for fastq_filter file writing

ce574eb

Start working on multiprocessing the random forest

c928077

Rewrite the README

ec85365

Update README

a32931b

Add new logos

5ed3bc2

Final fix in README

adcddde

Add requirements.txt

c973bfa

iam28th reviewed May 6, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hw18#4

Hw18#4
SuleimanovShakir wants to merge 20 commits into
mainfrom
hw18

SuleimanovShakir commented Apr 30, 2024

Uh oh!

iam28th left a comment

Uh oh!

iam28th May 6, 2024

Uh oh!

iam28th May 6, 2024

Uh oh!

iam28th May 6, 2024

Uh oh!

iam28th May 6, 2024

Uh oh!

iam28th May 6, 2024

Uh oh!

iam28th May 6, 2024

Uh oh!

iam28th May 6, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

SuleimanovShakir commented Apr 30, 2024

Uh oh!

iam28th left a comment

Choose a reason for hiding this comment

Uh oh!

iam28th May 6, 2024

Choose a reason for hiding this comment

Uh oh!

iam28th May 6, 2024

Choose a reason for hiding this comment

Uh oh!

iam28th May 6, 2024

Choose a reason for hiding this comment

Uh oh!

iam28th May 6, 2024

Choose a reason for hiding this comment

Uh oh!

iam28th May 6, 2024

Choose a reason for hiding this comment

Uh oh!

iam28th May 6, 2024

Choose a reason for hiding this comment

Uh oh!

iam28th May 6, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants