-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
nquads silently drops lines containing quotes #72
Comments
This appears to be merely an issue with the 30kmoviedata.nt dataset (see below) in conjunction with swallowing error conditions. This should be fixed in an upcoming PR from me (this will add error return for cases like this and fix the incorrect
|
kortschak
added a commit
to kortschak/cayley
that referenced
this issue
Jul 22, 2014
Fixes issue cayleygraph#72 This change simplifies interactions with parsing N-Quads and makes reading datasets more robust. Changes made while here also improve performance: benchmark old ns/op new ns/op delta BenchmarkParser 1058 667 -36.96% We still use string concatenation which I'm not wildly happy about, but I think this can be left for a later change. Initial changes towards idiomatic error handling have been made. More significant changes are needed, but these have subtle design implication and need to be thought about more. 30kmoviesdata.nt.gz has been altered to properly escape double quotes. This was done mechanically and with manual curation to pick up straglers.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
If you apply the patch below or equivalent, you see 137 lines output as having been dropped when you execute
cayley http --dbpath=30kmoviedata.nt
. All have at least one quote (single or double) mark.The text was updated successfully, but these errors were encountered: