v0.9.1 #136
benbrandt
announced in
Announcements
v0.9.1
#136
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
What's Changed
Python
TextSplitter
andMarkdownSplitter
now both provide a newchunk_indices
method that returns a list not only of chunks, but also their corresponding character offsets relative to the original text. This should allow for different string comparison and matching operations on the chunks.A similar method already existed on the Rust side. The key difference is that these offsets are character not byte offsets. For Rust strings, it is usually helpful to have the byte offset, but in Python, most string methods and operations deal with character indices.
by @benbrandt in #135
Full Changelog: v0.9.0...v0.9.1
This discussion was created from the release v0.9.1.
Beta Was this translation helpful? Give feedback.
All reactions