Skip to content

feature: text cleaning / normalization #31

@MadBomber

Description

@MadBomber

After spiting the text the typical next step is sometimes to clean (aka normalize) if before before indexing or vectorization processes. Here is the class that I wrote. It might be an interesting new feature to add to baran.

https://github.com/MadBomber/lib_ruby/blob/master/text_cleaner.rb

I think the TextCleaner class is too small to be a stand-alone gem. It might fit in well with your project.

Dewayne
o-*

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions