Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bodyfile parser: handle unpaired surrogates #4705

Open
joachimmetz opened this issue Jul 2, 2023 · 0 comments
Open

bodyfile parser: handle unpaired surrogates #4705

joachimmetz opened this issue Jul 2, 2023 · 0 comments
Labels
enhancement New or improved functionality parsers Issues related to parsers and parser plug-ins

Comments

@joachimmetz
Copy link
Member

NTFS file names can contain unpaired surrogates. It is currently unclear how these should be represented in the format sleuthkit/sleuthkit#2837, however the Python Unicode implementation rejects unpaired surrogates as invalid Unicode.

Maybe the best way is to escape them, for example as "\ud800"

@joachimmetz joachimmetz added enhancement New or improved functionality parsers Issues related to parsers and parser plug-ins labels Jul 2, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New or improved functionality parsers Issues related to parsers and parser plug-ins
Projects
None yet
Development

No branches or pull requests

1 participant