-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for parsing timestamps from CSV files #958
Comments
Arrow contains the code to parse a string --> timestamp correctly here: https://github.com/apache/arrow-rs/blob/master/arrow/src/compute/kernels/cast_utils.rs#L69 This ticket would likely be a matter of hooking that code up into the CSV parser: https://github.com/apache/arrow-rs/blob/master/arrow/src/csv/reader.rs So most of the code in this PR might best belong in arrow-rs rather than datafusion |
I would like to pick this up. Please assign to me as appropriate // @alamb |
I have an example of this now on novemberkilo@d9f096a To reproduce, follow the directions in
|
novemberkilo@d9f096a <-- looks very cool 👍 |
@alamb looks like datafusion is pinned to version |
@novemberkilo since apache/arrow-rs#832 doesn't break any public api, it will be released as part of arrrow 6.x. @alamb already have a PR ready to merge for arrow-rs 6.x integration: #984. Process wise, we need to get arrow-rs 6.0.0 released first. I will let @alamb decide whether your arrow-rs PR should be merged and released as part of the 6.0.0 release or the release after that. |
arrow 6.0.0 is released. When apache/arrow-rs#832 is merged I'll backport that( will be included in 6.1.0, due to be released around Nov 1 2021) |
🤔 I wonder if this issue is now done? Or does it need more work? |
iirc we just wanted to wait until we could confirm that the version of |
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
I updated the nyc benchmark schema to use timestamps:
I tried running a query and got this error.
Describe the solution you'd like
I would like to be able to query CSV files containing timestamps.
Describe alternatives you've considered
None.
Additional context
None.
The text was updated successfully, but these errors were encountered: