fix: Fix metadata reflection without a default dataset #1089

JacobHayes · 2024-06-27T06:09:04Z

Fixes #838 and fixes #1088. 🦕

The get_table_names and get_view_names methods are supposed to return the bare names (no {schema}. prefix) of the resources for a single schema/dataset (where schema=None is the "default schema", not "all schemas"). However, the BigQueryDialect implementation returns:

bare names for a specific schema: if the connection has a default dataset
{schema}. prefixed names for one schema: if the connection doesn't have a default dataset and the schema arg is a string
{schema}. prefixed names for all schemas: if the connection doesn't have a default dataset and the schema arg is None

The bolded parts cause issues outlined in #1088. This PR fixes the get_table_names and get_view_names implementations to return bare names as SQLAlchemy expects.

This is a breaking change for a subset of users:

(no impact) connections with a default dataset (likely a majority of users?) should behave the same
(impact) connections without a default dataset using Metadata.reflect() without a schema argument were probably expecting to reflect all datasets, but they would now not reflect anything.
- BigQuery doesn't have the same concept of a default schema the way eg: PostgreSQL does (public) - so without a default set on the connection string, we don't have anything to search and return an empty list.
(no impact) connections without a default dataset using Alembic were probably broken anyway

This has been working for me locally with alembic + no default dataset (the models specify the schema) + include_schemas=True (required for cross-schema stuff).

Make sure to open an issue as a bug/issue before writing your code! That way we can discuss the change, evaluate designs, and agree on the general idea
Ensure the tests and linter pass
- unit tests + linter pass locally
Code coverage does not decrease (if any source code was changed)
Appropriate docs were updated (if necessary)

conventional-commit-lint-gcf · 2024-06-27T06:12:04Z

🤖 I detect that the PR title and the commit message differ and there's only one commit. To use the PR title for the commit history, you can use Github's automerge feature with squashing, or use automerge label. Good luck human!

-- conventional-commit-lint bot
https://conventionalcommits.org/

JacobHayes · 2024-07-19T01:21:14Z

Hi, any chance I could get the tests approved to run on the PR? I've been trying to setup the system tests locally, but they're a bit onerous (understandably).

Linchin · 2024-07-22T23:21:22Z

Sorry for the delay, just started running the tests!

JacobHayes · 2024-07-23T01:24:35Z

Sorry for the delay, just started running the tests!

no worries, thank you!

chalmerlowe · 2024-11-13T12:19:54Z

@lingchin if this is a breaking change as Jacob indicates, then we need to have some sort of deprecation warning that the break is coming and get this into the schedule.

This is a breaking change for a subset of users:

(no impact) connections with a default dataset (likely a majority of users?) should behave the same

(impact) connections without a default dataset using Metadata.reflect() without a schema argument were probably expecting to reflect all datasets, but they would now not reflect anything. BigQuery doesn't have the same concept of a default schema the way eg: PostgreSQL does (public) - so without a default set on the connection string, we don't have anything to search and return an empty list.

(no impact) connections without a default dataset using Alembic were probably broken anyway

product-auto-label bot added size: m Pull request size is medium. api: bigquery Issues related to the googleapis/python-bigquery-sqlalchemy API. labels Jun 27, 2024

JacobHayes force-pushed the fix-table-reflection branch from c35e20e to 7d72652 Compare June 27, 2024 06:12

JacobHayes force-pushed the fix-table-reflection branch 2 times, most recently from 969685f to 4a006bf Compare June 27, 2024 07:06

JacobHayes marked this pull request as ready for review June 27, 2024 07:12

JacobHayes requested review from a team as code owners June 27, 2024 07:12

JacobHayes requested a review from shollyman June 27, 2024 07:12

blunderbuss-gcf bot assigned chalmerlowe Jun 27, 2024

JacobHayes mentioned this pull request Jun 27, 2024

Metadata.reflect() fails if user does not have access to all datasets/tables in project. #838

Closed

JacobHayes changed the title ~~Fix get_table_names and get_view_names without a default dataset~~ Fix metadata reflection without a default dataset Jul 2, 2024

JacobHayes changed the title ~~Fix metadata reflection without a default dataset~~ fix: Fix metadata reflection without a default dataset Jul 18, 2024

Linchin added the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Jul 22, 2024

yoshi-kokoro removed the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Jul 22, 2024

fix!: Fix get_table_names and get_view_names without a default dataset

75a933a

JacobHayes force-pushed the fix-table-reflection branch from 64ab97f to 75a933a Compare July 23, 2024 01:30

Linchin added kokoro:force-run Add this label to force Kokoro to re-run the tests. owlbot:run Add this label to trigger the Owlbot post processor. labels Jul 23, 2024

gcf-owl-bot bot removed the owlbot:run Add this label to trigger the Owlbot post processor. label Jul 23, 2024

yoshi-kokoro removed the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Jul 23, 2024

Linchin and others added 2 commits September 20, 2024 14:31

Merge branch 'main' into fix-table-reflection

f77547b

Merge branch 'main' into fix-table-reflection

a1490a5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Fix metadata reflection without a default dataset #1089

fix: Fix metadata reflection without a default dataset #1089

JacobHayes commented Jun 27, 2024 •

edited

Loading

conventional-commit-lint-gcf bot commented Jun 27, 2024

JacobHayes commented Jul 19, 2024

Linchin commented Jul 22, 2024

JacobHayes commented Jul 23, 2024

chalmerlowe commented Nov 13, 2024

fix: Fix metadata reflection without a default dataset #1089

Are you sure you want to change the base?

fix: Fix metadata reflection without a default dataset #1089

Conversation

JacobHayes commented Jun 27, 2024 • edited Loading

conventional-commit-lint-gcf bot commented Jun 27, 2024

JacobHayes commented Jul 19, 2024

Linchin commented Jul 22, 2024

JacobHayes commented Jul 23, 2024

chalmerlowe commented Nov 13, 2024

JacobHayes commented Jun 27, 2024 •

edited

Loading