CDS-miner

CDS-miner aligns a list of query CDS in fasta format to a genome assembly in order to retrieve similar CDS above a specific threshold. Results are stored in fasta format. It works for eukaryote genomes with intron-exon structure.

Disclaimer

The code is not optimized to work particularly fast or efficient, and it might not be stable. I uploaded it here in the hope it might be useful for others.

Dependencies

CDS-miner can be run from a linux OS with the following dependencies:

python
exonerate
bedtools
convert_exonerate_gff_to_gff3.py: https://github.com/jorvis/biocode/blob/master/sandbox/jorvis/convert_exonerate_gff_to_gff3.py

Usage

Name query file 'query.fa' and change the extension of the assemblies into '.fna' and put all files in the same directory as the source code.

bash CDS-miner.sh

Retrieved sequences can be found in the final_results folder.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CDS-miner

Disclaimer

Dependencies

Usage

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

CDS-miner

Disclaimer

Dependencies

Usage