docs: add predict sample to samples/snippets/bqml_getting_started_test.py #388

DevStephanie · 2024-02-22T19:04:47Z

Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly:

Make sure to open an issue as a bug/issue before writing your code! That way we can discuss the change, evaluate designs, and agree on the general idea
Ensure the tests and linter pass
Code coverage does not decrease (if any source code was changed)
Appropriate docs were updated (if necessary)

Fixes #<issue_number_goes_here> 🦕

snippet-bot · 2024-02-22T19:04:54Z

Here is the summary of changes.

You are about to add 2 region tags.

samples/snippets/bqml_getting_started_test.py:158, tag bigquery_dataframes_bqml_getting_started_tutorial_predict_by_country
samples/snippets/bqml_getting_started_test.py:228, tag bigquery_dataframes_bqml_getting_started_tutorial_predict_by_visitor

You are about to delete 1 region tag.

samples/snippets/bqml_getting_started_test.py:166, tag bigquery_dataframes_bqml_getting_started_tutorial_predict

This comment is generated by snippet-bot.
If you find problems with this result, please file an issue at:
https://github.com/googleapis/repo-automation-bots/issues.
To update this comment, add snippet-bot:force-run label or use the checkbox below:

Refresh this comment

…t.py

shobsi · 2024-02-26T19:08:58Z

samples/snippets/bqml_getting_started_test.py

+        }
+    )
+    # Use Logistic Regression predict method to, find more information here in
+    # [BigFrames](/bigframes/latest/bigframes.ml.linear_model.LogisticRegression#bigframes_ml_linear_model_LogisticRegression_predict)


Would it result in a clickable link leading to docs.google.com documentation? Asking because in the other place (line 157) we are using absolute https://... path

It will not. We should use absolute path here in comments.

yes, corrected.

FYI: If this has been corrected, your change hasn't been pushed to GitHub yet.

shobsi · 2024-02-26T22:42:34Z

samples/snippets/bqml_getting_started_test.py

+
+    predictions = model.predict(features)
+
+    visitor_id = predictions.groupby(["country"])[["predicted_transactions"]].sum()


Not sure why we call it visitor_id here, looks same as countries few lines above

There is another query that asks for predictions for visitors, so the query looks almost identical except for that one change.

It's call SUM(predicted_label) as total_predicted_purchases in SQL. https://cloud.google.com/bigquery/docs/create-machine-learning-model#run_the_mlpredict_query Let's use the same name as SQL.

yes, will do that.

Yes, will correct that.

tswast · 2024-02-27T18:58:43Z

samples/snippets/bqml_getting_started_test.py

+
+    operatingSystem = df["device"].struct.field("operatingSystem")
+    operatingSystem = operatingSystem.fillna("")
+    isMobile = df["device"].struct.field("isMobile")


Let's use snake_case for this variable.

yes, will correct that.

Do you want me to change that variable name for the earlier part of the code?

Yes, please.

…dataframes into bqml_predict1

…dict1

samples/snippets/bqml_getting_started_test.py

tswast · 2024-02-29T16:27:28Z

samples/snippets/bqml_getting_started_test.py

-            "os": operatingSystem,
-            "is_mobile": isMobile,
+            "os": operating_system,
+            "isMobile": is_mobile,


The "is_mobile" string didn't need to be changed, but if you do you must change it everywhere.

…est.py

tswast · 2024-03-06T19:51:54Z

samples/snippets/bqml_getting_started_test.py

@@ -151,7 +143,7 @@ def test_bqml_getting_started(random_model_id):
    # - log_loss — The loss function used in a logistic regression. This is the measure of how far the
    # model's predictions are from the correct labels.

-    # - roc_auc — The area under the ROC curve. This is the probability that a classifier is more confident that
+    # - roc_auc — The area under the ROC curve. This is the probability that a classifier is morepy confident that


Typo: morepy

Suggested change

# - roc_auc — The area under the ROC curve. This is the probability that a classifier is morepy confident that

# - roc_auc — The area under the ROC curve. This is the probability that a classifier is more confident that

tswast · 2024-03-06T20:13:26Z

samples/snippets/bqml_getting_started_test.py

+            "pageviews": pageviews,
+        }
+    )
+    # Use Logistic Regression predict method to, find more information here in


Incomplete sentence.

gcf-merge-on-green · 2024-03-07T02:28:16Z

Merge-on-green attempted to merge your PR for 6 hours, but it was not mergeable because either one of your required status checks failed, one of your required reviews was not approved, or there is a do not merge label. Learn more about your required status checks here: https://help.github.com/en/github/administering-a-repository/enabling-required-status-checks. You can remove and reapply the label to re-run the bot.

gcf-merge-on-green · 2024-03-07T22:50:16Z

Merge-on-green attempted to merge your PR for 6 hours, but it was not mergeable because either one of your required status checks failed, one of your required reviews was not approved, or there is a do not merge label. Learn more about your required status checks here: https://help.github.com/en/github/administering-a-repository/enabling-required-status-checks. You can remove and reapply the label to re-run the bot.

gcf-merge-on-green · 2024-03-08T05:02:15Z

Merge-on-green attempted to merge your PR for 6 hours, but it was not mergeable because either one of your required status checks failed, one of your required reviews was not approved, or there is a do not merge label. Learn more about your required status checks here: https://help.github.com/en/github/administering-a-repository/enabling-required-status-checks. You can remove and reapply the label to re-run the bot.

DevStephanie added 3 commits January 31, 2024 15:25

docs: Add a sample to demonstrate the evaluation results

4cf9a0e

Adding comments explaining logistic regression results

ffcf185

editing read_gbd explanation

8e5ba68

DevStephanie requested review from a team as code owners February 22, 2024 19:04

DevStephanie requested review from vchudnov-g and shobsi February 22, 2024 19:04

product-auto-label bot added size: m Pull request size is medium. api: bigquery Issues related to the googleapis/python-bigquery-dataframes API. labels Feb 22, 2024

docs: add predict sample to samples/snippets/bqml_getting_started_tes…

202bf76

…t.py

DevStephanie changed the title ~~Bqml predict1~~ docs: add predict sample to samples/snippets/bqml_getting_started_test.py Feb 23, 2024

Merge remote-tracking branch 'origin/main' into bqml_predict1

ca3783f

product-auto-label bot added size: s Pull request size is small. samples Issues that are directly related to samples. and removed size: m Pull request size is medium. labels Feb 23, 2024

Merge branch 'main' into bqml_predict1

d3a8d8d

shobsi reviewed Feb 26, 2024

View reviewed changes

tswast reviewed Feb 27, 2024

View reviewed changes

DevStephanie and others added 7 commits February 27, 2024 13:41

Merge branch 'main' into bqml_predict1

7198e7f

Merge branch 'main' into bqml_predict1

4984cfc

Merge branch 'main' of https://github.com/googleapis/python-bigquery-…

b89f30b

…dataframes into bqml_predict1

Merge branch 'main' into bqml_predict1

0aba4d2

correcting variable names

fb79526

Merge remote-tracking branch 'refs/remotes/origin/main' into bqml_pre…

b6d6430

…dict1

Merge remote-tracking branch 'origin/bqml_predict1' into bqml_predict1

262661c

product-auto-label bot added size: m Pull request size is medium. and removed size: s Pull request size is small. labels Feb 28, 2024

tswast reviewed Feb 28, 2024

View reviewed changes

samples/snippets/bqml_getting_started_test.py Show resolved Hide resolved

tswast reviewed Feb 29, 2024

View reviewed changes

DevStephanie and others added 4 commits March 4, 2024 12:01

Merge branch 'main' into bqml_predict1

f0eaa6c

Merge branch 'main' into bqml_predict2

7f06521

feat: add predict by visit to samples/snippets/bqml_getting_started_t…

ca17b39

…est.py

Merge branch 'bqml_predict2' into bqml_predict1

190cf9e

tswast mentioned this pull request Mar 6, 2024

Bqml predict2 #405

Closed

4 tasks

tswast reviewed Mar 6, 2024

View reviewed changes

DevStephanie added 2 commits March 6, 2024 14:10

file

9df8bdd

file

1a25f5f

tswast reviewed Mar 6, 2024

View reviewed changes

file

daa3bdb

tswast approved these changes Mar 6, 2024

View reviewed changes

Merge branch 'main' into bqml_predict1

bde7a12

tswast added the automerge Merge the pull request once unit tests and other checks pass. label Mar 6, 2024

Merge branch 'main' into bqml_predict1

3613489

gcf-merge-on-green bot removed the automerge Merge the pull request once unit tests and other checks pass. label Mar 7, 2024

Merge branch 'main' into bqml_predict1

249631c

tswast added the automerge Merge the pull request once unit tests and other checks pass. label Mar 7, 2024

tswast requested a review from shobsi March 7, 2024 16:48

Merge branch 'main' into bqml_predict1

6ef78bb

gcf-merge-on-green bot removed the automerge Merge the pull request once unit tests and other checks pass. label Mar 7, 2024

Merge branch 'main' into bqml_predict1

aa6d323

tswast added the automerge Merge the pull request once unit tests and other checks pass. label Mar 7, 2024

gcf-merge-on-green bot removed the automerge Merge the pull request once unit tests and other checks pass. label Mar 8, 2024

Merge branch 'main' into bqml_predict1

defabf8

tswast merged commit 6a3b0cc into main Mar 8, 2024
10 of 13 checks passed

tswast deleted the bqml_predict1 branch March 8, 2024 17:19

release-please bot mentioned this pull request Mar 8, 2024

chore(main): release 0.24.0 #411

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: add predict sample to samples/snippets/bqml_getting_started_test.py #388

docs: add predict sample to samples/snippets/bqml_getting_started_test.py #388

DevStephanie commented Feb 22, 2024

snippet-bot bot commented Feb 22, 2024 •

edited

Loading

shobsi Feb 26, 2024

tswast Feb 27, 2024

DevStephanie Feb 28, 2024

tswast Feb 29, 2024

shobsi Feb 26, 2024

DevStephanie Feb 27, 2024

tswast Feb 27, 2024

DevStephanie Feb 27, 2024

DevStephanie Feb 27, 2024

tswast Feb 27, 2024

DevStephanie Feb 27, 2024

DevStephanie Feb 27, 2024

tswast Feb 27, 2024

tswast Feb 29, 2024

tswast Mar 6, 2024

tswast Mar 6, 2024

gcf-merge-on-green bot commented Mar 7, 2024

gcf-merge-on-green bot commented Mar 7, 2024

gcf-merge-on-green bot commented Mar 8, 2024


		predictions = model.predict(features)

		visitor_id = predictions.groupby(["country"])[["predicted_transactions"]].sum()

	# - roc_auc — The area under the ROC curve. This is the probability that a classifier is morepy confident that
	# - roc_auc — The area under the ROC curve. This is the probability that a classifier is more confident that

docs: add predict sample to samples/snippets/bqml_getting_started_test.py #388

docs: add predict sample to samples/snippets/bqml_getting_started_test.py #388

Conversation

DevStephanie commented Feb 22, 2024

snippet-bot bot commented Feb 22, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gcf-merge-on-green bot commented Mar 7, 2024

gcf-merge-on-green bot commented Mar 7, 2024

gcf-merge-on-green bot commented Mar 8, 2024

snippet-bot bot commented Feb 22, 2024 •

edited

Loading