-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add page iterator to ReadRowsStream #7680
Conversation
f7bce5b
to
df5500e
Compare
] | ||
avro_schema = _bq_to_avro_schema(bq_columns) | ||
read_session = _generate_read_session(avro_schema) | ||
bq_blocks_1 = [ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Minor nit: the naming of the "blocks" variables reads oddly to me. You're testing a single block with multiple rows, not multiple blocks. It's consistent with existing tests, just seems odd.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actually, this does contain 2 blocks. black
just formatted it in such a way that it's not clear.
I've separated each block into its own variable and added a comment for why there are two groups of blocks. Hopefully this makes it clearer.
This allows readers to read blocks (called pages for compatibility with BigQuery client library) one at a time from a stream. This enables use cases such as progress bar support and streaming workers that expect pandas DataFrames.
5bbaaf6
to
222d413
Compare
This allows readers to read blocks (called pages for compatibility with
BigQuery client library) one at a time from a stream. This enables use
cases such as progress bar support and streaming workers that expect
pandas DataFrames.
Towards #7654