Skip to content

fix: use IUPAC 'X' instead of 'N' for unknown amino acids#388

Open
neha222222 wants to merge 1 commit intogc-os-ai:mainfrom
neha222222:fix-clean-protein-seq-placeholder
Open

fix: use IUPAC 'X' instead of 'N' for unknown amino acids#388
neha222222 wants to merge 1 commit intogc-os-ai:mainfrom
neha222222:fix-clean-protein-seq-placeholder

Conversation

@neha222222
Copy link
Copy Markdown

N (Asparagine) is a valid standard amino acid. Using it as a placeholder for unknown residues is biologically incorrect. IUPAC convention uses X for unknown amino acids.

Closes #328

Reference Issues/PRs

What does this implement/fix? Explain your changes.

What should a reviewer concentrate their feedback on?

Did you add any tests for the change?

Any other comments?

PR checklist

  • The PR title starts with either [ENH], [MNT], [DOC], or [BUG]. [BUG] - bugfix, [MNT] - CI, test framework, [ENH] - adding or improving code, [DOC] - writing or improving documentation or docstrings.
  • Added/modified tests
  • Used pre-commit hooks when committing to ensure that code is compliant with hooks. Install hooks with pre-commit install.
    To run hooks independent of commit, execute pre-commit run --all-files

N (Asparagine) is a valid standard amino acid. Using it as a
placeholder for unknown residues is biologically incorrect.
IUPAC convention uses X for unknown amino acids.

Closes gc-os-ai#328
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[ENH] Comprehensive refactor and stabilization of the utils module

1 participant