Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Supporting very large texts and arbitrary backtracking on modern .NET #95

Open
stephan-tolksdorf opened this issue Nov 6, 2022 · 1 comment

Comments

@stephan-tolksdorf
Copy link
Owner

Since .NET Core dropped support for the serialization of System.Text.Decoder instances, the current serialization-based approach for supporting any Encoding on large input streams seems no longer viable.
A programatic way forward would be to only support UTF-(8|16|32) encodings and then do the decoding state management ourselves. This would also allow us to get rid of Cloning.cs. (Side note: the time I put ~15 years ago into making the serialization-based approach work and performant was completely insane.)

@stephan-tolksdorf
Copy link
Owner Author

I've started working on this, but don't yet know when I'll have time to bring it into a publishable state.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant