Skip to content

Commit 367cd84

Browse files
author
Xiao Wang
committed
add descriptions for domain-based DiffModeler
1 parent e68740b commit 367cd84

1 file changed

Lines changed: 13 additions & 4 deletions

File tree

README.md

Lines changed: 13 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -126,8 +126,7 @@ Chimera (for map visualization): https://www.cgl.ucsf.edu/chimera/download.html
126126
<summary>Command Parameters</summary>
127127

128128
```commandline
129-
usage: main.py [-h] --mode MODE [-F F] [-M M] [--config CONFIG] [--gpu GPU] [--output OUTPUT]
130-
[--contour CONTOUR]
129+
usage: main.py [-h] --mode MODE [-F F] [-M M] [--config CONFIG] [--gpu GPU] [--output OUTPUT] [--contour CONTOUR]
131130
132131
options:
133132
-h, --help show this help message and exit
@@ -141,6 +140,10 @@ options:
141140
--gpu GPU specify the gpu we will use
142141
--output OUTPUT Output directory
143142
--contour CONTOUR Contour level for input map, suggested 0.5*[author_contour]. (Float), Default value: 0.0
143+
--fast Specify where to use fast version or not (used in server to save computations of fitting with different parameters)
144+
--seq_search only search sequence against db, that will get templates but not do structure modeling
145+
--af_only only search sequence against AlphaFold DB, for benchmark usage. Default: search RSCB first and then search AFDB
146+
--domain use domain based structure for modeling, split one single-chain to multiple possible domains via SWORD2
144147
```
145148
</details>
146149

@@ -153,7 +156,9 @@ It is fine to run if you only know some of the template structures..
153156
```commandline
154157
python3 main.py --mode=0 -F=[Map_Path] -P=[Single_chain_structure_dir] -M=[protin_config_path] --config=[pipeline_config_file] --contour=[Contour_Level] --gpu=[GPU_ID] --resolution=[resolution]
155158
```
156-
[Map_Path] is the path of the input experimental cryo-EM map, [Single_chain_structure_dir] specifis the directory of all single-chain PDB files. You can also zip them into a zip/.tar/.tar.gz file here to pass the zip file path here. [protin_config_path] is the text file path that records the protein single chain PDB name and corressponding chains, [pipeline_config_file] is the pipeline's parameter configuration file, saved in ``config`` directory; [Contour_Level] is the map density threshold to remove outside regions to save processing time (suggested to use half author recommended contour level), [GPU_ID] specifies the gpu used for inference. [resolution] specified the map resolution, where 0-2A will skip the diffusion model. Therefore, you can use an approximate resolution value here.
159+
[Map_Path] is the path of the input experimental cryo-EM map, [Single_chain_structure_dir] specifis the directory of all single-chain PDB files. You can also zip them into a zip/.tar/.tar.gz file here to pass the zip file path here. [protin_config_path] is the text file path that records the protein single chain PDB name and corressponding chains, [pipeline_config_file] is the pipeline's parameter configuration file, saved in ``config`` directory; [Contour_Level] is the map density threshold to remove outside regions to save processing time (suggested to use half author recommended contour level), [GPU_ID] specifies the gpu used for inference. [resolution] specified the map resolution, where 0-2A will skip the diffusion model. Therefore, you can use an approximate resolution value here. <br>
160+
161+
If you want to use domain-based structure modeling, you can simply add the ``--domain`` option. The program will first call SWORD2 to split each single chain into different domains and then model structures using these domain structures. Alternatively, you can use [SWORD2_server](https://www.dsimb.inserm.fr/SWORD2/index.html) to explore different domain splitting choices and provide the domain structureS as single-chain structures to DiffModeler Here DiffModeler can model protein complexes based on the domain templates you provide.
157162

158163
Example of PDB config file
159164
```commandline
@@ -192,6 +197,8 @@ python3 main.py --mode=1 -F=[Map_Path] -P=[fasta_path] --config=[pipeline_config
192197
[Map_Path] is the path of the input experimental cryo-EM map, [fasta_path] specifis the path of sequence file with .fasta format. [pipeline_config_file] is the pipeline's parameter configuration file, saved in ``config`` directory; [Contour_Level] is the map density threshold to remove outside regions to save processing time (suggested to use half author recommended contour level), [GPU_ID] specifies the gpu used for inference. [resolution] specified the map resolution, where 0-2A will skip the diffusion model. Therefore, you can use an approximate resolution value here.
193198
<b>Please update ``email`` field in config/diffmodeler.json to your email.</b>
194199

200+
<br>If you want to use domain-based structure modeling, you can simply add the ``--domain`` option. The program will first call SWORD2 to split each single chain into different domains and then model structures using these domain structures. Alternatively, you can use [SWORD2_server](https://www.dsimb.inserm.fr/SWORD2/index.html) to explore different domain splitting choices and provide the domain structureS as single-chain structures to DiffModeler Here DiffModeler can model protein complexes based on the domain templates you provide.
201+
195202
This is based on <a href='https://www.ebi.ac.uk/Tools/sss/fasta/'>EBI Search Tool</a> aginst structure database to find most similar structures as templates for us to model protein complex.
196203
<br><b>Please use our [server](https://em.kiharalab.org/algorithm/DiffModeler(seq)) if with more than 4 non-identical chains. </b> EBI's API is too slow to respond when you have many non-identical sequences.
197204

@@ -234,7 +241,9 @@ After configuring the environment, please run
234241
```commandline
235242
python3 main.py --mode=2 -F=[Map_Path] -P=[fasta_path] --config=[pipeline_config_file] --contour=[Contour_Level] --gpu=[GPU_ID] --resolution=[resolution]
236243
```
237-
[Map_Path] is the path of the input experimental cryo-EM map, [fasta_path] specifis the path of sequence file with .fasta format. [pipeline_config_file] is the pipeline's parameter configuration file, saved in ``config`` directory; [Contour_Level] is the map density threshold to remove outside regions to save processing time (suggested to use half author recommended contour level), [GPU_ID] specifies the gpu used for inference. [resolution] specified the map resolution, where 0-2A will skip the diffusion model. Therefore, you can use an approximate resolution value here.
244+
[Map_Path] is the path of the input experimental cryo-EM map, [fasta_path] specifis the path of sequence file with .fasta format. [pipeline_config_file] is the pipeline's parameter configuration file, saved in ``config`` directory; [Contour_Level] is the map density threshold to remove outside regions to save processing time (suggested to use half author recommended contour level), [GPU_ID] specifies the gpu used for inference. [resolution] specified the map resolution, where 0-2A will skip the diffusion model. Therefore, you can use an approximate resolution value here. <br>
245+
246+
If you want to use domain-based structure modeling, you can simply add the ``--domain`` option. The program will first call SWORD2 to split each single chain into different domains and then model structures using these domain structures. Alternatively, you can use [SWORD2_server](https://www.dsimb.inserm.fr/SWORD2/index.html) to explore different domain splitting choices and provide the domain structureS as single-chain structures to DiffModeler Here DiffModeler can model protein complexes based on the domain templates you provide.
238247

239248
Example of fasta file
240249
```

0 commit comments

Comments
 (0)