You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Cloud provider: Corporate S3 implementation based on Ceph
OS: Ubuntu
Bug
What happened:
Schema error for timestamp column when trying to create a delta table from a pyarrow dataframe, read from a csv file: deltalake.PyDeltaTableError: Schema error: Invalid data type for Delta Lake: Timestamp(Millisecond, Some("UTC"))
How to reproduce it:
Have a csv file with a timestamp column with the format: 2023-05-11 18:00:00+03:00 (Timezone information explicitly mentioned)
Read it using pyarrow: pyarrow.csv.read_csv(<file>)
Try to write to a delta table: deltalake.write_deltalake(<path>, storage_options=<storage_options>)
More details:
Repeating the same with Pandas works without any issues.
Pyarrow parses it as timestamp[s, tz=UTC]
Pandas parses it as an Object
The text was updated successfully, but these errors were encountered:
Environment
Delta-rs version: 0.9.0
Binding: Python
Environment:
Bug
What happened:
Schema error for timestamp column when trying to create a delta table from a pyarrow dataframe, read from a csv file:
deltalake.PyDeltaTableError: Schema error: Invalid data type for Delta Lake: Timestamp(Millisecond, Some("UTC"))
How to reproduce it:
pyarrow.csv.read_csv(<file>)
deltalake.write_deltalake(<path>, storage_options=<storage_options>)
More details:
timestamp[s, tz=UTC]
The text was updated successfully, but these errors were encountered: