Used to filter NiFe hydrogenase hits to retain sequences >200 amino acids with two CxxC motifs 200 residues apart.
python3 hydrogenase_motif_finder.py -i test_hydrogenase.faa -o test_hydrogenase_wCxxCmotif.txt
Motif found in the following sequences:
Scaffold: Hydrogenase1, Motif Position: 73-371
Scaffold: Hydrogenase2, Motif Position: 62-419
Of the three potential hydrogenases only 2 pass the filtering. Hydrogenase3 is then excluded in further analysis.