Releases: googleapis/python-bigquery-dataframes
Releases · googleapis/python-bigquery-dataframes
v1.0.0
1.0.0 (2024-03-25)
⚠ BREAKING CHANGES
- rename model parameter
min_rel_progress
totol
early_stop
setting no longer supported, always usesTrue
- rename model parameter
n_parallell_trees
ton_estimators
- rename
class_weights
toclass_weight
- rename
learn_rate
tolearning_rate
- PCA
n_components
supports float value andNone
, default toNone
- rename various ml model parameters for consistency with sklearn (#491)
Features
- Add configuration option to read_gbq (#401) (85cede2)
- Add ml ARIMAPlus model params (#488) (352cb85)
- Add ml KMeans model params (#477) (23a8d9a)
- Add ml LogisticRegression model params (#481) (f959b65)
- Add ml PCA model params (#474) (fb5d83b)
- Add params for LinearRegression model (#464) (21b2188)
- Add support for Python 3.12 (#231) (df2976f)
- Allow assigning directly to Series.name property (#495) (ad0e99e)
- Ensure
Series.str.len()
can get length of array columns (#497) (10c0446) - Option to use bq connection without check (#460) (0b3f8e5)
- PCA
n_components
supports float value andNone
, default toNone
(65c6f47) - Rename
class_weights
toclass_weight
(65c6f47) - Rename
learn_rate
tolearning_rate
(65c6f47) - Rename model parameter
min_rel_progress
totol
(65c6f47) - Rename model parameter
n_parallell_trees
ton_estimators
(65c6f47) - Rename various ml model parameters for consistency with sklearn (#491) (65c6f47)
- Support BQ regional endpoints for europe-west9, europe-west3, us-east4, and us-west1 (#504) (fbada4a)
- Support dataframe.cov (#498) (c4beafd)
- Support Series.dt.floor (#493) (2dd01c2)
- Support Series.dt.normalize (#483) (0bf1e91)
- Update plot sample to 1000 rows (#458) (60d4a7b)
Bug Fixes
early_stop
setting no longer supported, always usesTrue
(65c6f47)- Fix -1 offset lookups failing (#463) (2dfb9c2)
- Plot.scatter
c
argument functionalities (#494) (d6ee994) - Properly support format param for numerical input. (#486) (ae20c35)
- Renable to_csv and to_json related tests (#468) (2b9a01d)
- Sampling plot cannot preserve ordering if index is not ordered (#475) (a5345fe)
- Use actual BigQuery types rather than ibis types in to_pandas (#500) (82b4f91)
Dependencies
Documentation
- Add code samples for metrics.{accuracy_score, confusion_matrix} (#478) (3e3329a)
- Add code samples for metrics.{recall_score, precision_score, f11_score} (#502) (370fe90)
- Improve API documentation (#489) (751266e)
- Update bigquery connection documentation (#499) (4bfe094)
- Update LLM + K-means notebook to handle partial failures (#496) (97afad9)
v0.26.0
0.26.0 (2024-03-20)
⚠ BREAKING CHANGES
- exclude remote models for .register() (#465)
Features
- (Series|DataFrame).plot (#438) (1c3e668)
read_gbq_table
supportsLIKE
as a operator infilters
(#454) (d2d425a)- Add DataFrame.pipe() method (#421) (95f5a6e)
- Set
force=True
by default inDataFrame.peek()
(#469) (4e8e97d) - Support datetime related casting in (Series|DataFrame|Index).astype (#442) (fde339b)
- Support Series.dt.strftime (#453) (8f6e955)
Bug Fixes
- Any() on empty set now correctly returns False (#471) (f55680c)
- Df.drop_na preserves columns dtype (#457) (3bab1a9)
- Disable to_json and to_csv related tests (#462) (874026d)
- Exclude remote models for .register() (#465) (73fe0f8)
- Fix broken link in covid notebook (#450) (adadb06)
- Fix broken multiindex loc cases (#467) (b519197)
- Fix grouping series on multiple other series (#455) (3971bd2)
- Groupby aggregates no longer check if grouping keys are numeric (#472) (4fbf938)
- Raise
ValueError
whenread_pandas()
receives a bigframesDataFrame
(#447) (b28f9fd) - Series.(to_csv|to_json) leverages bq export (#452) (718a00c)
- Warn when
read_gbq
/read_gbq_table
uses the snapshot time cache (#441) (e16a8c0)
Documentation
- Add code samples for
ml.metrics.r2_score
(#459) (85fefa2) - Add the docs for loc and iloc indexers (#446) (14ab8d8)
- Add the pages for at and iat indexers (#456) (340f0b5)
- Add version information to bug template (#437) (91bd39e)
- Indicate that project and location are optional in example notebooks (#451) (1df0140)
v0.25.0
v0.24.0
0.24.0 (2024-03-12)
⚠ BREAKING CHANGES
read_parquet
uses a "pandas" engine to parse files by default. Useengine="bigquery"
for the previous behavior
Features
- (Series|Dataframe).plot.hist() (#420) (4aadff4)
- Add detect_anomalies to ml ARIMAPlus and KMeans models (#426) (6df28ed)
- Add engine parameter to
read_parquet
(#413) (31325a1) - Add ml PCA.detect_anomalies method (#422) (8d82945)
- Support BYOSA in
remote_function
(#407) (d92ced2) - Support CMEK for BQ tables (#403) (9a678e3)
Bug Fixes
- Move
third_party.bigframes_vendored
tobigframes_vendored
(#424) (763edeb) - Only do row identity based joins when joining by index (#356) (76b252f)
- Read_pandas inline respects location (#412) (ae0e3ea)
Documentation
v0.23.0
0.23.0 (2024-03-05)
Features
- Add ml.metrics.pairwise.euclidean_distance (#397) (1726588)
- Add TextEmbedding model version support (#394) (e0f1ab0)
Bug Fixes
- Code exception in
remote_function
now prevents retry and surfaces in the client (#387) (dd3643d) - Docs link for metrics.pairwise (#400) (a60aba7)
Dependencies
Documentation
v0.22.0
0.22.0 (2024-02-27)
⚠ BREAKING CHANGES
- rename cosine_similarity to paired_cosine_distances (#393)
- move model optional args to kwargs (#381)
Features
- Add
DataFrames.corr()
method (#379) (67fd434) - Add ml.metrics.pairwise.manhattan_distance (#392) (9d31865)
- Enable regional endpoints for me-central2 (#386) (469674d)
Bug Fixes
- Avoid ibis warning for "database" table() method argument (#390) (a0490a4)
- Correct the numeric literal dtype (#365) (93b02cd)
- Rename cosine_similarity to paired_cosine_distances (#393) (81ece46)
Performance Improvements
Dependencies
Documentation
- Add a code sample for creating a kmeans model (#267) (4291d65)
- Fix
bigframes.pandas.concat
documentation (#382) (234b61c)
Miscellaneous Chores
Code Refactoring
v0.21.0
0.21.0 (2024-02-13)
Features
- Add
Series.cov
method (#368) (443db22) - Add ml.llm.GeminiTextGenerator model (#370) (de1e0a4)
- Add ml.metrics.pairwise.cosine_similarity function (#374) (126f566)
- Add XGBoostModel (#363) (d5518b2)
- Limited support of lambdas in
Series.apply
(#345) (208e081) - Support bigframes.pandas.to_datetime for scalars, iterables and series. (#372) (ffb0d15)
- Support read_gbq wildcard table path (#377) (90caf86)
Bug Fixes
Documentation
v0.20.1
v0.20.0
0.20.0 (2024-01-30)
Features
- Add
DataFrame.peek()
as an efficient alternative tohead()
results preview (#318) (9c34d83) - Add ARIMA_EVAULATE options in forecasting models (#336) (73e997b)
- Add Index constructor, repr, copy, get_level_values, to_series (#334) (e5d054e)
- Improve error message for drive based BQ table reads (#344) (0794788)
- Update cut to work without labels = False and show intervals as dict (#335) (4ff53db)
Bug Fixes
- Chance default connection name in getting_started.ipnyb (#347) (677f014)
- Series iteration correctly returns values instead of index (#339) (2c6af9b)