BigQuery: add `--max_results` option to magic #9169

shubha-rajan · 2019-09-04T01:41:32Z

Second of 3 PRs towards resolving #9105 as described in review for #9147

tswast · 2019-09-04T19:40:57Z

I just merged #9167. We might have to rebase.

googlebot · 2019-09-04T19:58:23Z

All (the pull request submitter and all commit authors) CLAs are signed, but one or more commits were authored or co-authored by someone other than the pull request submitter.

We need to confirm that all authors are ok with their commits being contributed to this project. Please have them confirm that by leaving a comment that contains only @googlebot I consent. in this pull request.

Note to project maintainer: There may be cases where the author cannot leave a comment, or the comment is not properly detected as consent. In those cases, you can manually confirm consent of the commit author(s), and set the cla label to yes (if enabled on your project).

ℹ️ Googlers: Go here for more info.

googlebot · 2019-09-04T20:17:10Z

CLAs look good, thanks!

ℹ️ Googlers: Go here for more info.

tswast

I'd actually love to see a system / integration test for this feature, but I know we don't have any examples for you to work off of for that. (I'll see what I can do about that, since I know this will come up again in the future.)

Since we don't have a system for notebook integration tests in this repo, I'd like to see the results of manually testing this feature in a notebook.

tswast · 2019-09-04T20:19:48Z

bigquery/google/cloud/bigquery/magics.py

@@ -300,7 +300,7 @@ def _run_query(client, query, job_config=None):
    while True:
        print("\rQuery executing: {:0.2f}s".format(time.time() - start_time), end="")
        try:
-            query_job.result(timeout=0.5)
+            query_job.result(timeout=0.5, max_results=max_results)


Since we aren't actually returning the results in this line (just waiting for the query to finish), we don't need to pass max_results here. I believe that means we can remove the max_results parameter from _run_query as well.

Removed in f88ffe8

tswast · 2019-09-04T20:21:07Z

bigquery/google/cloud/bigquery/magics.py

+        "Defaults to returning all rows."
+    ),
+)
+@magic_arguments.argument(


Looks like these arguments got doubled up. (Maybe the two commits thing, we noticed when rebasing?)

That's probably what happened (moral of the story, always run tests before pushing). Removed the duplicate in 147e43d

tswast · 2019-09-04T20:22:41Z

bigquery/tests/unit/test_magics.py

@@ -414,7 +414,7 @@ def test_bigquery_magic_with_legacy_sql():
    with run_query_patch as run_query_mock:
        ip.run_cell_magic("bigquery", "--use_legacy_sql", "SELECT 17 AS num")

-        job_config_used = run_query_mock.call_args_list[0][0][-1]
+        job_config_used = run_query_mock.call_args_list[0][1]["job_config"]


Was this change intentional? Maybe a bad rebase?

This was intentional. It was to fix some test failures I was getting because the argument that was being retrieved was the wrong type (a result of changing the method signature for _run_query). This way the arg being accessed will always be job_config, even if other named parameters are added. Since I reverted the changes to _run_query, I can probably change this back too. I think the tests will pass either way

tswast · 2019-09-04T20:29:57Z

bigquery/google/cloud/bigquery/magics.py

@@ -433,7 +455,9 @@ def _cell_magic(line, query):

    error = None
    try:
-        query_job = _run_query(client, query, job_config)
+        query_job = _run_query(
+            client, query, job_config=job_config, max_results=max_results


max_results is not actually needed here, it's not until the call to to_dataframe that we need max_results.

Note: to_dataframe probably doesn't have a max_results argument, and I'm actually not certain that we'd want to add one. Instead, we can call query_job.results with a max_results argument and then call to_dataframe on the resulting RowIterator.

I updated to call to_dataframe on query_job.result, passing max_results as an argument if max_results is present in f88ffe8

…ha-rajan/google-cloud-python into bq-add-max-results-to-magic

shubha-rajan · 2019-09-04T21:46:23Z

Screenshot of running a query in a notebook with max_results set:

* added max_results magic option and fixed broken tests * added tests for --max_results magic option * added max_results magic option and fixed broken tests * added tests for --max_results magic option * Removed duplicate `--max_results` magic argument * removed max_results param from run_query, updated tests

googlebot added the cla: yes This human has signed the Contributor License Agreement. label Sep 4, 2019

shubha-rajan marked this pull request as ready for review September 4, 2019 18:32

shubha-rajan requested a review from a team September 4, 2019 18:32

shubha-rajan added 2 commits September 4, 2019 12:55

added max_results magic option and fixed broken tests

81b0ca2

added tests for --max_results magic option

b04cc37

shubha-rajan requested a review from busunkim96 as a code owner September 4, 2019 19:58

googlebot added cla: no This human has *not* signed the Contributor License Agreement. and removed cla: yes This human has signed the Contributor License Agreement. labels Sep 4, 2019

shubha-rajan added 2 commits September 4, 2019 13:10

added max_results magic option and fixed broken tests

7176592

added tests for --max_results magic option

37b4d39

shubha-rajan force-pushed the bq-add-max-results-to-magic branch from 23fb140 to 37b4d39 Compare September 4, 2019 20:17

googlebot added cla: yes This human has signed the Contributor License Agreement. and removed cla: no This human has *not* signed the Contributor License Agreement. labels Sep 4, 2019

tswast requested changes Sep 4, 2019

View reviewed changes

shubha-rajan added 3 commits September 4, 2019 13:43

Removed duplicate --max_results magic argument

147e43d

removed max_results param from run_query, updated tests

f88ffe8

Merge branch 'bq-add-max-results-to-magic' of https://github.com/shub…

a01f1fc

…ha-rajan/google-cloud-python into bq-add-max-results-to-magic

shubha-rajan requested a review from tswast September 5, 2019 22:25

tswast approved these changes Sep 6, 2019

View reviewed changes

tswast merged commit f00b60b into googleapis:master Sep 6, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BigQuery: add `--max_results` option to magic #9169

BigQuery: add `--max_results` option to magic #9169

shubha-rajan commented Sep 4, 2019

tswast commented Sep 4, 2019

googlebot commented Sep 4, 2019

googlebot commented Sep 4, 2019

tswast left a comment

tswast Sep 4, 2019

shubha-rajan Sep 4, 2019

tswast Sep 4, 2019

shubha-rajan Sep 4, 2019

tswast Sep 4, 2019

shubha-rajan Sep 4, 2019 •

edited

Loading

tswast Sep 4, 2019

shubha-rajan Sep 4, 2019

shubha-rajan commented Sep 4, 2019

BigQuery: add --max_results option to magic #9169

BigQuery: add --max_results option to magic #9169

Conversation

shubha-rajan commented Sep 4, 2019

tswast commented Sep 4, 2019

googlebot commented Sep 4, 2019

googlebot commented Sep 4, 2019

tswast left a comment

Choose a reason for hiding this comment

tswast Sep 4, 2019

Choose a reason for hiding this comment

shubha-rajan Sep 4, 2019

Choose a reason for hiding this comment

tswast Sep 4, 2019

Choose a reason for hiding this comment

shubha-rajan Sep 4, 2019

Choose a reason for hiding this comment

tswast Sep 4, 2019

Choose a reason for hiding this comment

shubha-rajan Sep 4, 2019 • edited Loading

Choose a reason for hiding this comment

tswast Sep 4, 2019

Choose a reason for hiding this comment

shubha-rajan Sep 4, 2019

Choose a reason for hiding this comment

shubha-rajan commented Sep 4, 2019

BigQuery: add `--max_results` option to magic #9169

BigQuery: add `--max_results` option to magic #9169

shubha-rajan Sep 4, 2019 •

edited

Loading