Fix spectronaut reader mapping #396

vbrennsteiner · 2026-01-14T16:34:58Z

Reasons for the PR:

Reading Spectronaut reports from versions 20.0.250515.50606 and 20.0.250515.50606 failed with the spectronaut_report reader config due to missing columns (intensity, precursor_intensity).
_pre_process() tried to parse charge out of non-existent columns (ModifiedSequence)

The issues

Some columns that are present in these report files were not covered by the reader config (FG.PrecMz for charge, PG.ProteinNames for genes).
The _preprocess() method tried to parse charge from the mod_seq_column "ModifiedSequence", which does not exist in these reports.

The fixes

Extend spectronaut_report configuration to parse additional columns correctly
Add EG.PrecursorId to mod_seq_columns to get charge information in case FG.PrecMz is missing
Modify _pre_process() to first check whether the charge column is present and only then parse it from the precursor_id column

…ctronaut_report' reader configuration; refactor 'spectronaut' and 'spectronaut_report' reader configurations and remove separate key 'precursor_id_columns' in favor of 'precursor_id' inside 'column_mapping'

Copilot

Pull request overview

This PR fixes issues reading Spectronaut report files from versions 20.0.250515.50606 where certain columns were missing or named differently, causing failures in the spectronaut_report reader configuration.

Changes:

Extended the spectronaut_report configuration to support additional column names for charge, genes, and other fields
Added EG.PrecursorId as a fallback mod_seq_column for extracting charge information
Modified _pre_process() to conditionally parse charge from precursor_id only when the charge column is missing
Changed msfragger_psm_tsv modification_mapping_type from 'msfragger' to 'maxquant'
Removed unused msfragger-specific modification mappings (SATA, SATP, mTRAQ variants, TMTpro)

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
alphabase/psm_reader/dia_psm_reader.py	Added conditional check in `_pre_process()` to only parse charge from precursor_id when charge column is missing
alphabase/constants/const_files/psm_reader.yaml	Extended spectronaut_report column mappings, added EG.PrecursorId to mod_seq_columns, removed unused msfragger modification mappings, changed msfragger_psm_tsv to use 'maxquant' mapping type

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-14T16:37:47Z

alphabase/constants/const_files/psm_reader.yaml


 pfind:


The deletion of the msfragger-specific modification mappings section (lines 165-201 in the original file) also removes mappings that are still referenced by msfragger_psm_tsv's mass_mapped_mods list (lines 211-212), specifically 'TMTpro@K' and 'TMTpro@Any_N-term'. While changing modification_mapping_type to 'maxquant' may provide these mappings, this should be verified. The maxquant modification_mappings section does not appear to contain TMTpro mappings based on the visible code (lines 100-157), which could cause translation failures for TMTpro-labeled peptides in MSFragger data.

alphabase/constants/const_files/psm_reader.yaml

mschwoer · 2026-01-20T16:45:31Z

alphabase/constants/const_files/psm_reader.yaml

-    'genes': 'PG.Genes'
+    'charge': ['charge', 'FG.Charge']
+    'mobility': 'FG.ApexIonMobility'
+    'proteins': ['PG.ProteinNames','PG.ProteinGroups', 'PG.ProteinAccessions']


please enumerate lists items with - to be consistent with the rest of this

mschwoer · 2026-01-20T16:46:34Z

alphabase/psm_reader/dia_psm_reader.py

-        ].str.split(".", expand=True, n=2)
+        # In case charge state column is missing, we splice it out of the precursor id column
+        if PsmDfCols.CHARGE not in df.columns:
+            df[[self.mod_seq_column, PsmDfCols.CHARGE]] = df[


now df[self.mod_seq_column] is not set when PsmDfCols.CHARGE is present .. is this intended?

vbrennsteiner added 3 commits January 14, 2026 16:33

add missing columns for spectronaut version 20.0.250515.50606 in 'spe…

db118b5

…ctronaut_report' reader configuration; refactor 'spectronaut' and 'spectronaut_report' reader configurations and remove separate key 'precursor_id_columns' in favor of 'precursor_id' inside 'column_mapping'

add missing columns for spectronaut version 20.0.250515.50606 in 'spe…

d50e199

…ctronaut_report' reader configuration; refactor 'spectronaut' and 'spectronaut_report' reader configurations and remove separate key 'precursor_id_columns' in favor of 'precursor_id' inside 'column_mapping'

fix column order for tests

0f8fab5

vbrennsteiner requested review from GeorgWa, Copilot, lucas-diedrich and mschwoer January 14, 2026 16:34

vbrennsteiner self-assigned this Jan 14, 2026

Copilot started reviewing on behalf of vbrennsteiner January 14, 2026 16:35 View session

Copilot AI reviewed Jan 14, 2026

View reviewed changes

vbrennsteiner marked this pull request as draft January 15, 2026 09:18

fix psm_reader.yaml

e42a92b

vbrennsteiner marked this pull request as ready for review January 15, 2026 09:30

mschwoer reviewed Jan 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix spectronaut reader mapping #396

Fix spectronaut reader mapping #396

vbrennsteiner commented Jan 14, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jan 14, 2026

Uh oh!

Uh oh!

Uh oh!

mschwoer Jan 20, 2026

Uh oh!

mschwoer Jan 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix spectronaut reader mapping #396

Are you sure you want to change the base?

Fix spectronaut reader mapping #396

Conversation

vbrennsteiner commented Jan 14, 2026

Reasons for the PR:

The issues

The fixes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Jan 14, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mschwoer Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

mschwoer Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants