Skip to content

Standardizer introduces a valence error #59

@Kohulan

Description

@Kohulan

We’ve noticed that the ChEBI entry CHEBI:141373
appears to have been modified by the Structure pipeline (although it is unclear whether ChEBI utilises this pipeline). In the current record, the sulfate is rendered as OSH([O-])[O-], i.e., sulfur shown with three single bonds and an explicit hydrogen. This state differs from the structure reported in the original paper.

Image

SMILES: O=C1CC(c2ccc(OSH([O-])[O-])cc2)Oc2cccc(O)c21

The Original structure from the paper looks like this:

Image

SMILES: C1=C(C=CC(=C1)OS(=O)(=O)[O-])C2CC(=O)C3=C(C=CC=C3O)O2

Could you clarify why the pipeline introduces this hydrogen?

We notice this happens when we try to standardise a molecule using the structure pipeline.

Example: C=CC=S[SH]/C=C\CSHC[C]=C
Standarized: C=[C]CSH+C/C=C[SH]/[SH]=C/C=C

Here you could see that S with 3 single bonds, hydrogen and a positive charge, this is a valence error.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions