Describe your problem
I am trying to run MacsyFinder to classify/annotate conjugation machinery from plasmids.
When using MacsyFinder with the CONJScan model (v. 2.0.1) on multiple plasmids (~2000 using parallel_macsyfinder.nf) it resulted in no identified system, for any plasmid.
I then examined an example plasmid (attached).
Using blastp I found IMGPR_plasmid_2923296884_000005_2923296884_2923296888_4 to be a MOBQ relaxase.
I then tried to run MacsyFinder on the specific plasmid, to see if the problem was with parallel_macsyfinder.nf specifically, but still no MOB system was identified.
IMGPR_plasmid_2923296884_000005_2923296884_2923296888.faa.txt
Finally I inspected the hmmsearch outputs, that correctly identifies MOBQ as the best match for the IMGPR_plasmid_2923296884_000005_2923296884_2923296888_4 sequence.
T4SS_MOBQ.search_hmm.out.txt
Second best match (MOBP1 and MOBV)
T4SS_MOBP1.search_hmm.out.txt
T4SS_MOBV.search_hmm.out.txt
I do not know If I am missing something or there is a true problem at play. I focused on a MOB system, as it is simple and only requires a relaxase as a mandatory component for the system to be "called" (as I understand it).
To Reproduce
Steps to reproduce the behavior:
The exact command line used:
macsyfinder -o test_macsyfinder --db-type ordered_replicon --models CONJScan/Plasmids all --sequence-db IMGPR_plasmid_2923296884_000005_2923296884_2923296888.faa.txt
OS:
MacSyFinder Version:
MacSyFinder 2.1.5
using:
- Python 3.13.7 | packaged by conda-forge | (main, Sep 3 2025, 14:24:46) [Clang 19.1.7 ]
- MacSyLib 1.0.3
- NetworkX 3.5
- Pandas 2.3.3
Describe your problem
I am trying to run MacsyFinder to classify/annotate conjugation machinery from plasmids.
When using MacsyFinder with the CONJScan model (v. 2.0.1) on multiple plasmids (~2000 using
parallel_macsyfinder.nf) it resulted in no identified system, for any plasmid.I then examined an example plasmid (attached).
Using blastp I found
IMGPR_plasmid_2923296884_000005_2923296884_2923296888_4to be aMOBQrelaxase.I then tried to run MacsyFinder on the specific plasmid, to see if the problem was with
parallel_macsyfinder.nfspecifically, but still no MOB system was identified.IMGPR_plasmid_2923296884_000005_2923296884_2923296888.faa.txt
Finally I inspected the hmmsearch outputs, that correctly identifies MOBQ as the best match for the
IMGPR_plasmid_2923296884_000005_2923296884_2923296888_4sequence.T4SS_MOBQ.search_hmm.out.txt
Second best match (MOBP1 and MOBV)
T4SS_MOBP1.search_hmm.out.txt
T4SS_MOBV.search_hmm.out.txt
I do not know If I am missing something or there is a true problem at play. I focused on a MOB system, as it is simple and only requires a relaxase as a mandatory component for the system to be "called" (as I understand it).
To Reproduce
Steps to reproduce the behavior:
The exact command line used:
OS:
MacSyFinder Version: