-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Python] Table / RecordBatch repr displays the wrong timezone for non-UTC timestamps #38629
Comments
Thank you for reporting the issue @nph! Yes, this is a known issue with timezone aware timestamps. The problem is the dependency on finding a timezone database which is not yet supported on Windows at the moment for example. I will be adding an alternative solution which would keep the printed value in UTC but adding an indication at the end of the string to make it a bit clearer that UTC times are printed. See #30117. Will close this issue as a duplicate, please follow the linked one for future fix. |
Duplicate of #30117 |
(I was by coincidence replying on exactly the same moment ;), keeping a small part of my answer)
Just one additional note on this: it doesn't show the "incorrect" timezone in the schema. Arrow does support keeping track of a timezone parameter on the timestamp type, and therefore when creating the table from tz-aware values, we preserve that information in the schema. |
Thanks @jorisvandenbossche, @AlenkaF. |
Describe the bug, including details regarding any error messages, version, and platform.
Printing a
Table
or aRecordBatch
containing timezone-aware timestamps displays the time values in UTC but shows the original (possibly non-UTC) timezone in the schema header.See also this related DuckDB issue
Pyarrow Version:
13.0.0
Platform:
macOS 12.7
Component(s)
Python
The text was updated successfully, but these errors were encountered: