v1.0.3 #45
chonknick
announced in
Announcements
v1.0.3
#45
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
✨ Highlights
Visualizeris here! You can now view your chunks, understand chunk quality and debug your chunker with visual feedback~ Use theprintmethod to print rich text on your terminal or use thesavemethod to save a highlightedhtmlon your device! It's very simple to use, just pass in your chunks~Recipeswhich allow you to use multilingual chunking out of the box, as well as document specific chunking methods. Initial support starts with:en,hi,zh,jpandko, while document typemarkdownis supported too. Use it via thefrom_recipeclass method with any chunker that takes delimiters orRecursiveRules.RecursiveChunker,SentenceChunker, andWordTokenizerWhat's Changed
SentenceChunkerto remove estimate + feedback by @chonknick in Fix: RefactorSentenceChunkerto remove estimate + feedback #23RecursiveRules.from_dict()showingKey error: 'levels' does not existbecause of.pop()by @chonknick in Fix:RecursiveRules.from_dict()showingKey error: 'levels' does not existbecause of.pop()#24.findfor indexing in theRecursiveChunkerfor better efficiency by @Pratik960 in Fix: Remove.findfor indexing in theRecursiveChunkerfor better efficiency #26Recipesand multilingual CHONKs! by @chonknick in Feat: Add Initial support forRecipesand multilingual CHONKs! #27RecursiveChunkerwhitespace splitting is not reconstructable (missing spaces) + (Fix: Remove.findfor indexing in theRecursiveChunkerfor better efficiency #26, Feat: Add Initial support forRecipesand multilingual CHONKs! #27) by @chonknick in Fix:RecursiveChunkerwhitespace splitting is not reconstructable (missing spaces) + (#26, #27) #28WordTokenizer.count_tokensshould use.tokenizeinstead of.encodeby @chonknick in Fix:WordTokenizer.count_tokensshould use.tokenizeinstead of.encode#36Visualizerby @chonknick in Add Chonkie Vizard — easy chunk visualization withVisualizer#39Visualizerclass for easy chunk visualizations by @chonknick in Feat: Add Chonkie Vizard to main!Visualizerclass for easy chunk visualizations #40chonkie.utilsmodule to package list inpyproject.tomlby @chonknick in Fix: Addchonkie.utilsmodule to package list inpyproject.toml#41chonkie.clouddoes not containchunkerserror + bump up version tov1.0.3by @chonknick in Fix:chonkie.clouddoes not containchunkerserror + bump up version tov1.0.3#43New Contributors
.findfor indexing in theRecursiveChunkerfor better efficiency #26Full Changelog: v1.0.2...v1.0.3
This discussion was created from the release v1.0.3.
Beta Was this translation helpful? Give feedback.
All reactions