Skip to content

Extension for Maple format: including insertions #8

Description

@corneliusroemer

In order to allow lossless compression, insertions need to be included in the Maple format.

I could find this edge case specified in the preprint.

It's easy to do, one just needs to agree to a convention, e.g.

2134 ins ACGTT

for an insertion of ACGTT after (or before) nucleotide 2134.

Alternative: no need for magic word, one simply includes multiple letters instead of one (I think this would be akin to VCF). If 2134 is usually C, one would write:

2134 CACGTT

for an insertion of ACGTT after nucleotide 2134.

Would be good of you could include treatment of insertions in the preprint.

I think both proposals would work in principle. Both have advantages.

The first is a bit more explicit, the second doesn't require a magic word.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions