Skip to content

Fix formatting and inconsistencies between CSV and SMI#5

Merged
fbaensch-beilstein merged 6 commits into
Beilstein-Institut:mainfrom
cthoyt:fixes
Nov 5, 2025
Merged

Fix formatting and inconsistencies between CSV and SMI#5
fbaensch-beilstein merged 6 commits into
Beilstein-Institut:mainfrom
cthoyt:fixes

Conversation

@cthoyt
Copy link
Copy Markdown
Contributor

@cthoyt cthoyt commented Oct 23, 2025

This PR does the following:

  1. Fixes inconsistencies in the CSV format. This happened for a record that had a comma in the label (2,6-(CH3)2C6H3) and for one where a space in the label caused misencoding (Py 2)
  2. Add 14 abbreviations that appeared in the SMI file to the CSV file
  3. Use consistent tabs for spacing in the SMI file
  4. Add a script that checks that the CSV and SMI files are parsable and synced (Along with including it in the GitHub Actions workflow)
  5. Check that the SMI file is a proper TSV

Unfortunately, this PR became stale because the csv file was deleted from the main branch before review of this PR, so I updated it accordingly.

@cthoyt
Copy link
Copy Markdown
Contributor Author

cthoyt commented Nov 5, 2025

@fbaensch-beilstein did you see this?

@fbaensch-beilstein
Copy link
Copy Markdown
Collaborator

@cthoyt Oh sorry, I was very busy the last weeks and haven't seen your PR. I will have a look at it asap.
Thanks a lot for your contribution.

@fbaensch-beilstein
Copy link
Copy Markdown
Collaborator

@cthoyt Sorry for making you go to the trouble of doing the (automated) comparison and then me just throwing out the csv.
The Lint script is great, thank you very much!

@fbaensch-beilstein fbaensch-beilstein merged commit a4444bc into Beilstein-Institut:main Nov 5, 2025
1 check passed
@cthoyt cthoyt deleted the fixes branch November 5, 2025 15:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants