Skip to content

Releases: googleapis/python-bigquery-dataframes

v1.0.0

25 Mar 17:17
1e2cd74
Compare
Choose a tag to compare

1.0.0 (2024-03-25)

⚠ BREAKING CHANGES

  • rename model parameter min_rel_progress to tol
  • early_stop setting no longer supported, always uses True
  • rename model parameter n_parallell_trees to n_estimators
  • rename class_weights to class_weight
  • rename learn_rate to learning_rate
  • PCA n_components supports float value and None, default to None
  • rename various ml model parameters for consistency with sklearn (#491)

Features

  • Add configuration option to read_gbq (#401) (85cede2)
  • Add ml ARIMAPlus model params (#488) (352cb85)
  • Add ml KMeans model params (#477) (23a8d9a)
  • Add ml LogisticRegression model params (#481) (f959b65)
  • Add ml PCA model params (#474) (fb5d83b)
  • Add params for LinearRegression model (#464) (21b2188)
  • Add support for Python 3.12 (#231) (df2976f)
  • Allow assigning directly to Series.name property (#495) (ad0e99e)
  • Ensure Series.str.len() can get length of array columns (#497) (10c0446)
  • Option to use bq connection without check (#460) (0b3f8e5)
  • PCA n_components supports float value and None, default to None (65c6f47)
  • Rename class_weights to class_weight (65c6f47)
  • Rename learn_rate to learning_rate (65c6f47)
  • Rename model parameter min_rel_progress to tol (65c6f47)
  • Rename model parameter n_parallell_trees to n_estimators (65c6f47)
  • Rename various ml model parameters for consistency with sklearn (#491) (65c6f47)
  • Support BQ regional endpoints for europe-west9, europe-west3, us-east4, and us-west1 (#504) (fbada4a)
  • Support dataframe.cov (#498) (c4beafd)
  • Support Series.dt.floor (#493) (2dd01c2)
  • Support Series.dt.normalize (#483) (0bf1e91)
  • Update plot sample to 1000 rows (#458) (60d4a7b)

Bug Fixes

  • early_stop setting no longer supported, always uses True (65c6f47)
  • Fix -1 offset lookups failing (#463) (2dfb9c2)
  • Plot.scatter c argument functionalities (#494) (d6ee994)
  • Properly support format param for numerical input. (#486) (ae20c35)
  • Renable to_csv and to_json related tests (#468) (2b9a01d)
  • Sampling plot cannot preserve ordering if index is not ordered (#475) (a5345fe)
  • Use actual BigQuery types rather than ibis types in to_pandas (#500) (82b4f91)

Dependencies

Documentation

  • Add code samples for metrics.{accuracy_score, confusion_matrix} (#478) (3e3329a)
  • Add code samples for metrics.{recall_score, precision_score, f11_score} (#502) (370fe90)
  • Improve API documentation (#489) (751266e)
  • Update bigquery connection documentation (#499) (4bfe094)
  • Update LLM + K-means notebook to handle partial failures (#496) (97afad9)

v0.26.0

20 Mar 04:32
4727563
Compare
Choose a tag to compare

0.26.0 (2024-03-20)

⚠ BREAKING CHANGES

  • exclude remote models for .register() (#465)

Features

  • (Series|DataFrame).plot (#438) (1c3e668)
  • read_gbq_table supports LIKE as a operator in filters (#454) (d2d425a)
  • Add DataFrame.pipe() method (#421) (95f5a6e)
  • Set force=True by default in DataFrame.peek() (#469) (4e8e97d)
  • Support datetime related casting in (Series|DataFrame|Index).astype (#442) (fde339b)
  • Support Series.dt.strftime (#453) (8f6e955)

Bug Fixes

  • Any() on empty set now correctly returns False (#471) (f55680c)
  • Df.drop_na preserves columns dtype (#457) (3bab1a9)
  • Disable to_json and to_csv related tests (#462) (874026d)
  • Exclude remote models for .register() (#465) (73fe0f8)
  • Fix broken link in covid notebook (#450) (adadb06)
  • Fix broken multiindex loc cases (#467) (b519197)
  • Fix grouping series on multiple other series (#455) (3971bd2)
  • Groupby aggregates no longer check if grouping keys are numeric (#472) (4fbf938)
  • Raise ValueError when read_pandas() receives a bigframes DataFrame (#447) (b28f9fd)
  • Series.(to_csv|to_json) leverages bq export (#452) (718a00c)
  • Warn when read_gbq / read_gbq_table uses the snapshot time cache (#441) (e16a8c0)

Documentation

  • Add code samples for ml.metrics.r2_score (#459) (85fefa2)
  • Add the docs for loc and iloc indexers (#446) (14ab8d8)
  • Add the pages for at and iat indexers (#456) (340f0b5)
  • Add version information to bug template (#437) (91bd39e)
  • Indicate that project and location are optional in example notebooks (#451) (1df0140)

v0.25.0

14 Mar 17:23
56de982
Compare
Choose a tag to compare

0.25.0 (2024-03-14)

Features

  • (Series|DataFrame).plot.(line|area|scatter) (#431) (0772510)
  • Support CMEK for remote_function cloud functions (#430) (2fd69f4)

v0.24.0

13 Mar 00:55
cba21ba
Compare
Choose a tag to compare

0.24.0 (2024-03-12)

⚠ BREAKING CHANGES

  • read_parquet uses a "pandas" engine to parse files by default. Use engine="bigquery" for the previous behavior

Features

Bug Fixes

  • Move third_party.bigframes_vendored to bigframes_vendored (#424) (763edeb)
  • Only do row identity based joins when joining by index (#356) (76b252f)
  • Read_pandas inline respects location (#412) (ae0e3ea)

Documentation

  • Add predict sample to samples/snippets/bqml_getting_started_test.py (#388) (6a3b0cc)
  • Document minimum IAM requirement (#416) (36173b0)
  • Fix the note rendering for DataFrames methods: nlargest, nsmallest (#417) (38bd2ba)

v0.23.0

05 Mar 19:21
45e6229
Compare
Choose a tag to compare

0.23.0 (2024-03-05)

Features

  • Add ml.metrics.pairwise.euclidean_distance (#397) (1726588)
  • Add TextEmbedding model version support (#394) (e0f1ab0)

Bug Fixes

  • Code exception in remote_function now prevents retry and surfaces in the client (#387) (dd3643d)
  • Docs link for metrics.pairwise (#400) (a60aba7)

Dependencies

  • Update ibis to version 8.0.0 and refactor remote_function to use ibis UDF method (#277) (350499b)

Documentation

  • Update README to point to new summary pages (#402) (bfe2b23)

v0.22.0

27 Feb 23:00
20de7f4
Compare
Choose a tag to compare

0.22.0 (2024-02-27)

⚠ BREAKING CHANGES

  • rename cosine_similarity to paired_cosine_distances (#393)
  • move model optional args to kwargs (#381)

Features

  • Add DataFrames.corr() method (#379) (67fd434)
  • Add ml.metrics.pairwise.manhattan_distance (#392) (9d31865)
  • Enable regional endpoints for me-central2 (#386) (469674d)

Bug Fixes

  • Avoid ibis warning for "database" table() method argument (#390) (a0490a4)
  • Correct the numeric literal dtype (#365) (93b02cd)
  • Rename cosine_similarity to paired_cosine_distances (#393) (81ece46)

Performance Improvements

Dependencies

  • Add minimum version constraint for sqlglot to 19.9.0 (#389) (8b62d77)

Documentation

  • Add a code sample for creating a kmeans model (#267) (4291d65)
  • Fix bigframes.pandas.concat documentation (#382) (234b61c)

Miscellaneous Chores

Code Refactoring

v0.21.0

13 Feb 22:09
747e5f6
Compare
Choose a tag to compare

0.21.0 (2024-02-13)

Features

  • Add Series.cov method (#368) (443db22)
  • Add ml.llm.GeminiTextGenerator model (#370) (de1e0a4)
  • Add ml.metrics.pairwise.cosine_similarity function (#374) (126f566)
  • Add XGBoostModel (#363) (d5518b2)
  • Limited support of lambdas in Series.apply (#345) (208e081)
  • Support bigframes.pandas.to_datetime for scalars, iterables and series. (#372) (ffb0d15)
  • Support read_gbq wildcard table path (#377) (90caf86)

Bug Fixes

Documentation

  • Clarify ADC pre-auth in a non-interactive environment (#348) (99a9e6e)

v0.20.1

06 Feb 21:21
91596b8
Compare
Choose a tag to compare

0.20.1 (2024-02-06)

Performance Improvements

  • Make repr cache the block where appropriate (#350) (068879f)

Documentation

  • Add a sample to demonstrate the evaluation results (#364) (cff0919)
  • Fix the DataFrame.apply code sample (#366) (1866a26)

v0.20.0

30 Jan 20:58
18efb83
Compare
Choose a tag to compare

0.20.0 (2024-01-30)

Features

  • Add DataFrame.peek() as an efficient alternative to head() results preview (#318) (9c34d83)
  • Add ARIMA_EVAULATE options in forecasting models (#336) (73e997b)
  • Add Index constructor, repr, copy, get_level_values, to_series (#334) (e5d054e)
  • Improve error message for drive based BQ table reads (#344) (0794788)
  • Update cut to work without labels = False and show intervals as dict (#335) (4ff53db)

Bug Fixes

  • Chance default connection name in getting_started.ipnyb (#347) (677f014)
  • Series iteration correctly returns values instead of index (#339) (2c6af9b)

Documentation

  • Add code samples for Series.{between, cumprod} (#353) (09a52fd)

v0.19.2

22 Jan 19:08
ae43905
Compare
Choose a tag to compare

0.19.2 (2024-01-22)

Bug Fixes

  • Read_gbq large response issue (#332) (b8178b9)
  • Use object dtype for ARRAY columns in to_pandas() with pandas 1.x (#329) (374ddb5)

Documentation