Skip to content

Latest commit

 

History

History
24 lines (16 loc) · 907 Bytes

File metadata and controls

24 lines (16 loc) · 907 Bytes

CDS-miner

CDS-miner aligns a list of query CDS in fasta format to a genome assembly in order to retrieve similar CDS above a specific threshold. Results are stored in fasta format. It works for eukaryote genomes with intron-exon structure.

Disclaimer

The code is not optimized to work particularly fast or efficient, and it might not be stable. I uploaded it here in the hope it might be useful for others.

Dependencies

CDS-miner can be run from a linux OS with the following dependencies:

Usage

Name query file 'query.fa' and change the extension of the assemblies into '.fna' and put all files in the same directory as the source code.

bash CDS-miner.sh

Retrieved sequences can be found in the final_results folder.