TableTools.readCsv should be able to read the timestamp string representation Pandas uses (seems to be RFC3339). #1571
Replies: 7 comments 3 replies
-
@devinrsmith did add support for ISO 8601 timestamps. I think we're looking for 'T' as a separator. Wikipedia claims space is not an acceptable separator:
|
Beta Was this translation helpful? Give feedback.
-
We should consider whether to be more permissive than the ISO standard. |
Beta Was this translation helpful? Give feedback.
-
Right now, the code is effectively using:
This looks like it's quite authoritatively parsing strict ISO 8601. We might be able to use @jcferretti Does Pandas publish a discussion for what kind of permissive formats they allow outside of the ISO standard? |
Beta Was this translation helpful? Give feedback.
-
This seems relevant:
|
Beta Was this translation helpful? Give feedback.
-
More context: This mentions Postgres does the same on output (space). |
Beta Was this translation helpful? Give feedback.
-
So, maybe we turn this discussion into a feature request to support RFC-3339 timestamps. What I don't want is an open-ended "support whatever Pandas does" ticket without more research. |
Beta Was this translation helpful? Give feedback.
-
Happy to support the common date-time formats that python/pandas produces by default. |
Beta Was this translation helpful? Give feedback.
-
A file generated in python pandas using
.to_csv(...)
for a dataframe that was loaded from a parquet file with a timestamp(nanos) type column:The timestamp column formatted as above is not recognized by DHC when using
TableTools.readCsv
; the column is read as String.More generally, we should accept valid ISO 8601 strings as timestamps, any of them.As Ryan points out below, this is not valid ISO 8601; it seems to be valid RFC3339 but is not necessarily what Pandas is even trying to do here; read the sequence of comments that follows.
Beta Was this translation helpful? Give feedback.
All reactions