Replies: 1 comment 10 replies
-
@adriangb both do quiete different things .files() returns the filepaths of the table in a list, while to_pyarrow_dataset also has to construct dataset fragments of each file while parsing the table stats into pyarrow.compute expressions |
Beta Was this translation helpful? Give feedback.
10 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm seeing ~500ms for the former and ~1ms for the later (running locally, absolute numbers shouldn't matter)
@ion-elgreco is it possible the table state is not being properly re-used or similar?
Beta Was this translation helpful? Give feedback.
All reactions