Skip to content

Error in rule simFeatures #11

@MarkKJF

Description

@MarkKJF

Thanks for you kindly help on the last question.
Restarting the code with --cores = 20 ,

$ snakemake -s Snakefile pretrained_predict -p --cores 20
--config mutations="/home/kjf/2020plus-1.2.2/data/bladder.txt" output_dir="output" trained_classifier="/home/kjf/2020plus-1.2.2/data/2020plus_10k.Rdata"

I get a much greater speed.

However, when it run to
25 of 48 steps (52%) done
A error occurs (I pick some or the warning, removing the duplicated parts)

############################
**[Tue Apr 2 13:50:01 2019]
rule simFeatures:
input: output/simulated_summary/chasm_sim_summary10.txt, output/simulated_summary/oncogene_sim10.txt, output/simulated_summary/tsg_sim10.txt
output: output/simulated_summary/simulated_features10.txt
jobid: 16
wildcards: iter=10

python which 2020plus.py features -s output/simulated_summary/chasm_sim_summary10.txt --tsg-test output/simulated_summary/tsg_sim10.txt -og-test output/simulated_summary/oncogene_sim10.txt -o output/simulated_summary/simulated_features10.txt

Version: 1.2.2
Command: /home/kjf/2020plus-1.2.2/2020plus.py features -s output/simulated_summary/chasm_sim_summary6.txt --tsg-test output/simulated_summary/tsg_sim6.txt -og-test output/simulated_summary/oncogene_sim6.txt -o output/simulated_summary/simulated_features6.txt


AN ERROR HAS OCCURRED: check the log file

Type: <class 'ModuleNotFoundError'>

Exception: No module named 'sklearn'
Traceback:
File "/home/kjf/2020plus-1.2.2/2020plus.py", line 263, in
import src.classify.python.classifier
File "/home/kjf/2020plus-1.2.2/src/classify/python/classifier.py", line 2, in
from src.classify.python.dummy_clf import DummyClf
File "/home/kjf/2020plus-1.2.2/src/classify/python/dummy_clf.py", line 1, in
from sklearn.dummy import DummyClassifier

[Tue Apr 2 13:50:02 2019]
Error in rule simFeatures:
Error in rule simFeatures:
jobid: 8
jobid: 16
output: output/simulated_summary/simulated_features2.txt
output: output/simulated_summary/simulated_features10.txt

RuleException:
CalledProcessError in line 282 of /home/kjf/2020plus-1.2.2/Snakefile:
Command 'set -euo pipefail; python which 2020plus.py features -s output/summary.txt --tsg-test output/tsg.txt -og-test output/oncogene.txt -o output/features.txt' returned non-zero exit status 1.
File "/home/kjf/2020plus-1.2.2/Snakefile", line 282, in __rule_features
File "/home/kjf/anaconda3/envs/2020plus/lib/python3.6/concurrent/futures/thread.py", line 56, in run

Finished working on chromosome: chr13.
[Tue Apr 2 13:52:26 2019]
Finished job 30.
26 of 48 steps (54%) done
Shutting down, this might take some time.
Exiting because a job execution failed. Look above for error message
Complete log: /home/kjf/2020plus-1.2.2/.snakemake/log/2019-04-02T123210.444145.snakemake.log**

Could you please help with the error? Thank you.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions