-
Notifications
You must be signed in to change notification settings - Fork 908
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support multiple new-line characters in regex APIs #15961
Merged
rapids-bot
merged 65 commits into
rapidsai:branch-24.10
from
davidwendt:regex-new-lines
Sep 17, 2024
Merged
Changes from all commits
Commits
Show all changes
65 commits
Select commit
Hold shift + click to select a range
793c7dc
Support multiple new-line characters in regex APIs
davidwendt 4f98848
Merge branch 'branch-24.08' into regex-new-lines
davidwendt 1db46f7
Merge branch 'branch-24.08' into regex-new-lines
davidwendt 10efa85
Merge branch 'branch-24.08' into regex-new-lines
davidwendt 833cfaa
Merge branch 'branch-24.08' into regex-new-lines
davidwendt 896e3e8
Merge branch 'branch-24.08' into regex-new-lines
davidwendt 421a276
Merge branch 'branch-24.08' into regex-new-lines
davidwendt 4717059
Merge branch 'branch-24.08' into regex-new-lines
davidwendt 6517432
Merge branch 'branch-24.08' into regex-new-lines
davidwendt f429f83
Merge branch 'branch-24.08' into regex-new-lines
davidwendt cb36e73
Merge branch 'branch-24.08' into regex-new-lines
davidwendt aec936d
Merge branch 'branch-24.08' into regex-new-lines
davidwendt a2a79a3
Merge branch 'branch-24.08' into regex-new-lines
davidwendt bd08443
Merge branch 'branch-24.08' into regex-new-lines
davidwendt 1f6da03
Merge branch 'branch-24.08' into regex-new-lines
davidwendt f543f30
Merge branch 'branch-24.08' into regex-new-lines
davidwendt 22d5f66
Merge branch 'branch-24.08' into regex-new-lines
davidwendt d93a4b6
Merge branch 'branch-24.08' into regex-new-lines
davidwendt c701004
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 8d472ba
Merge branch 'branch-24.10' into regex-new-lines
davidwendt d693d43
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 71c063a
add new flag
davidwendt 14e1c78
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 72e222a
update state engine for ext newlines
davidwendt 57f3567
Merge branch 'branch-24.10' into regex-new-lines
davidwendt e3425a6
add support for ANY inst
davidwendt 9ebf087
Merge branch 'branch-24.10' into regex-new-lines
davidwendt b789fc1
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 6a0eae3
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 920ed87
Merge branch 'branch-24.10' into regex-new-lines
davidwendt d82fe08
add gtest for extract
davidwendt f23c8d8
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 8e83b99
Merge branch 'branch-24.10' into regex-new-lines
davidwendt b396e75
adds more gtests: extract, findall
davidwendt 58e0f95
Merge branch 'branch-24.10' into regex-new-lines
davidwendt d7c4dec
Merge branch 'regex-new-lines' of github.com:davidwendt/cudf into reg…
davidwendt e974cb6
Merge branch 'branch-24.10' into regex-new-lines
davidwendt b41989f
add special_chars.h and more gtests
davidwendt b5befac
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 2c144f9
fix BOL/CHAR logic
davidwendt 7c41318
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 7c10de4
add dotall test for completeness
davidwendt feaae6d
Merge branch 'branch-24.10' into regex-new-lines
davidwendt d9b5481
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 85ebbe5
update tests; update regex.md doc
davidwendt 7eec095
Merge branch 'regex-new-lines' of github.com:davidwendt/cudf into reg…
davidwendt f4b28ab
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 797004e
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 4da7ef5
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 143e396
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 75c3643
Merge branch 'branch-24.10' into regex-new-lines
davidwendt bbf28c3
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 0477392
fix merge conflict
davidwendt 58003b5
Merge branch 'regex-new-lines' of github.com:davidwendt/cudf into reg…
davidwendt 6f9a55b
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 858dfd9
Merge branch 'branch-24.10' into regex-new-lines
davidwendt b71fbb8
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 364ed09
Merge branch 'branch-24.10' into regex-new-lines
davidwendt ac1e5cd
Merge branch 'regex-new-lines' of github.com:davidwendt/cudf into reg…
davidwendt c05c3ac
Merge branch 'branch-24.10' into regex-new-lines
davidwendt b230e0f
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 612061f
fix wording in .md file
davidwendt fd9fcaf
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 5a78495
Merge branch 'branch-24.10' into regex-new-lines
davidwendt 53c925c
Merge branch 'branch-24.10' into regex-new-lines
davidwendt File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vuule marked this conversation as resolved.
Show resolved
Hide resolved
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This optimization speeds up this operation in ASCII strings -- ones with no multi-byte chars.