Skip to content

Releases: delta-io/delta-rs

rust-v0.5.0

28 Nov 16:23
cf84886
Compare
Choose a tag to compare

What's Changed

  • Add max and min values to Statistics by @viirya in #327
  • Use WebIdentityProvider for DynamoDb client in k8s by @rusty-jules in #328
  • bump rust version in preparation for the next release by @houqp in #329
  • fix automated rust release CD job by @houqp in #326
  • add pandas keyword to python package metadata by @houqp in #325
  • expose update_incremental API to python binding by @houqp in #332
  • update python related docs by @houqp in #331
  • Upgrade arrow, parquet and datafusion by @Dandandan in #335
  • added warning message if the detected glibc version is < 2.28 by @Smurphy000 in #334
  • Convert scalar value to correct type based on arrow data type. by @viirya in #336
  • Fix consecutive checkpoints by @mosyp in #333
  • Fix new clippy warnings coming up in CI by @xianwill in #341
  • perform incremental update after transaction commit by @houqp in #343
  • Add timestamp handling to checkpoint writer by @xianwill in #340
  • Add clear table state in load_version when no checkpoint found. by @zijie0 in #347
  • Low level create table by @Smurphy000 in #342
  • pub DeltaTable method to retrieve table configurations by @Smurphy000 in #356
  • Modify partition_values field type in Add/Remove actions. by @zijie0 in #354
  • fix sleep workaround in checkpoint test by @houqp in #360
  • Modify get_files_by_partitions to use partition values by @zijie0 in #362
  • Fix get_latest_version returning version < 0. by @zijie0 in #364
  • fix typo in python release CI config by @houqp in #365
  • cache cargo builds by @houqp in #359
  • Add '.tmp' suffix to temporary file of prepared commit by @mosyp in #366
  • support partition value string deserialization for float/double/date by @houqp in #363
  • Implement atomic put_obj. by @zijie0 in #367
  • Make Format.options to be required field by @mosyp in #370
  • Allow filesystem backend put_obj to overwrite existing by @mosyp in #376
  • Wrap DeltaTransactionError with DeltaTableError. by @zijie0 in #374
  • Refactoring of black, isort, mypy tools usages into pyproject.toml by @fvaleye in #378
  • Implement consistent behavior in Windows with regard to swap parameter. by @zijie0 in #379
  • Merge Cargo.toml into pyproject.toml by @fvaleye in #381
  • Update datafusion and ballista links in README by @ei-grad in #382
  • Add sts assume role credentials provider for S3 by @mosyp in #383
  • Reuse table/storage instances in checkpoints by @mosyp in #384
  • additional error handling to atomic_rename by @Marnixvdb in #386
  • Upgrade to DataFusion 5.0 by @Dandandan in #389
  • added initial commit info on create method for a DeltaTable by @Smurphy000 in #387
  • Google cloud by @blogle in #355
  • Remove version param from create_checkpoint_from_table by @mosyp in #399
  • Implement delete_objs in fs and s3 storage backends. by @zijie0 in #395
  • Add examples for reading delta table with Rust API. by @zijie0 in #400
  • Update pyproject definition in pyproject.toml by @fvaleye in #405
  • Use tokio::fs::rename in put_obj. by @zijie0 in #403
  • Fix duplicates on update call by @mosyp in #398
  • Add a Makefile build task in the Python binding by @fvaleye in #410
  • Add implementation for load_with_datetime in Python package. by @zijie0 in #411
  • Add filesystem argument for reading DeltaTable in Python binding by @fvaleye in #414
  • Fix reading nullable action fields from parquet by @mosyp in #417
  • Ensures that all table schemas are of StructType by @blogle in #415
  • Gcs writer bugs by @blogle in #412
  • Add S3StorageOptions to allow configuring S3 backend explicitly by @xianwill in #418
  • Read a DeltaTable using a Data Catalog by @fvaleye in #419
  • Change checkpoint creation logs from info to debug by @mosyp in #423
  • Add LICENSE file in the Python binding and refer it in the pyproject by @fvaleye in #422
  • Audit action field optionality by @fvaleye in #380
  • Introduce DeltaConfig and tombstones retention policy by @mosyp in #420
  • [README] Replace the inactive rust-dataframe with polars by @sa- in #426
  • Bump arrow to 6.0.0-SNAPSHOT and bring map support to schema by @mosyp in #375
  • Support partition value string deserialization for timestamp/binary by @zijie0 in #371
  • Document the valid primitive types by @Ekleog in #430
  • Add is_non_acquirable field to the dynamodb lock by @mosyp in #429
  • Clean up DeltaTransactionError by @mosyp in #432
  • Optimize remove action apply with early iteration exit #424 by @akshay26031996 in #431
  • Decode path in Add and Remove actions. by @zijie0 in #434
  • reenable datafusion integration with temporary fork by @houqp in #436
  • Add history command in delta-rs by @fvaleye in #428
  • Release Python binding version 0.5.3 by @fvaleye in #439
  • Add delete_lock and fix release_lock by @mosyp in #440
  • Fixing test to compare sorted vec by @akshay26031996 in #443
  • Batch-apply remove actions in tombstone handling by @dispanser in #444
  • Update datafusion links by @bbigras in #446
  • Run all tests under s3 feature flag by @mosyp in #447
  • Add maturin develop command with extras in Python binding by @fvaleye in #448
  • README: mark Checkpoint creation as done for Rust by @bbigras in #449
  • Fix broken tombstones metadata when extended_file_metadata is different between tomstones in state by @mosyp in #450
  • No tombstone loading by @dispanser in #445
  • return lazy iterator in get tombstone methods by @houqp in #452
  • Generate new session name on assume role credentials provider refresh by @mosyp in #451
  • Add pool_idle_timeout options for s3 and sts clients by @mosyp in #458
  • Do action reconciliation by @viirya in #456
  • Use action default stats by @viirya in #459
  • Add new module for DeltaTableState by @viirya in #464
  • Support hash lookup by path string for Remove action by @viirya in #462
  • Fix coverage of the Python tests by @fvaleye in #467
  • materialize tables in pyhton via native storage backend by @roeap in #463
  • Make file storage backend's atomic rename async by @viirya in #471
  • Add GCS feature to the Python Cargo.toml file by @kelvins in #476
  • Throw an error when filter key is not in partitioned columns. by @zijie0 in #475
  • Fix documentation for the DeltaStorageHandler by @fvaleye in #483
  • Update README.adoc by @dennyglee in #482
  • Update az...
Read more

python-v0.6.4

28 Nov 18:14
0a6ce46
Compare
Choose a tag to compare

What's Changed

  • fix(python): enforce reader protocol version by @wjones127 in #932
  • feat: rewrite operations by @roeap in #852
  • chore: bump datafusion and arrow by @roeap in #940
  • Allow for reading columns as dictionaries using to_pyarrow_dataset by @Kuhlwein in #941
  • Add TableProviderFactory and test for SQL to register tables dynamically at runtime by @avantgardnerio in #892
  • feat: improve write perfromance of DeltaFileSystemHandler by @roeap in #943
  • Update CONTRIBUTING.md by @wjones127 in #944
  • build(deps): bump serde_json from 1.0.87 to 1.0.88 by @dependabot in #947
  • build(deps): bump reqwest from 0.11.12 to 0.11.13 by @dependabot in #946
  • docs: add simple operations example by @roeap in #953
  • bump rust core version to 0.5.0 by @houqp in #961
  • pin glic_version version in dep to unblock creates.io release by @houqp in #967
  • remove all wildcard version pin in Cargo.toml by @houqp in #968
  • Bump version of the Python binding to 0.6.4 by @fvaleye in #970

New Contributors

Full Changelog: python-v0.6.3...python-v0.6.4

python-v0.6.3

09 Nov 17:33
760b30a
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: python-v0.6.2...python-v0.6.3

What's Changed

New Contributors

Full Changelog: python-v0.6.2...python-v0.6.3

python-v0.6.2

11 Oct 06:14
5ce4a40
Compare
Choose a tag to compare

What's Changed

Full Changelog: python-v0.6.1...python-v0.6.2

python-v0.6.1

06 Sep 15:55
45a0404
Compare
Choose a tag to compare

What's Changed

  • feat: add gcs integration tests by @roeap in #779
  • build(deps): bump lz4-sys from 1.9.2 to 1.9.4 in /aws/delta-checkpoint by @dependabot in #782
  • build(deps): bump lz4-sys from 1.9.2 to 1.9.4 in /delta-inspect by @dependabot in #783
  • build(deps): bump tokio from 1.20.1 to 1.21.0 by @dependabot in #790
  • build(deps): bump thiserror from 1.0.32 to 1.0.34 by @dependabot in #792
  • build(deps): bump pretty_assertions from 1.2.1 to 1.3.0 by @dependabot in #791
  • build(deps): bump anyhow from 1.0.62 to 1.0.64 by @dependabot in #793
  • build(deps): bump env_logger from 0.7.1 to 0.9.0 by @dependabot in #794
  • hotfix: python object store paths by @roeap in #787
  • prepare python release 0.6.1 by @roeap in #795

Full Changelog: python-v0.6.0...python-v0.6.1

python-v0.6.0

31 Aug 16:30
174cf84
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: python-v0.5.8...python-v0.6.0

python-v0.5.8

23 Jun 19:46
27c0cbb
Compare
Choose a tag to compare

What's Changed

  • Expose read and write options in public API by @george-zubrienko in #581
  • [proof] make sure lock at least expires once by @houqp in #591
  • Python API - delta.appendOnly enforcement by @WarSame in #590
  • Avoid building pandas and numpy from source by @wjones127 in #595
  • Introduce require_files for tracking the add files in table state by @mosyp in #594
  • Make sure pandas is optional by @wjones127 in #597
  • High level Delta Operations with Datafusion by @roeap in #584
  • Re-enable datafusion tests and improve supported types. by @roeap in #601
  • default to root for empty path in azure store by @roeap in #603
  • publish dynamodb_lock to crates.io by @houqp in #605
  • Configure Azure storage using a map (#555) by @Blajda in #598
  • Azure options by @roeap in #606
  • Update rusoto dependencies to 0.48 by @ahmedriza in #611
  • upgrade to datafusion 8 by @houqp in #612
  • fix: cap sphinx version to avoid bug in 5.0 by @wjones127 in #615
  • Provide Python aarch64 wheels for Linux. by @fvaleye in #613
  • Refactoring of the Python release Github action by @fvaleye in #616
  • fix: Use relative paths for add paths by @wjones127 in #618
  • Bin packing optimization by @Blajda in #607
  • feat: impl rename_noreplace with std::fs::hard_link by default by @wjones127 in #621
  • feat(python): validate schema in write_deltalake by @wjones127 in #624
  • Fix the AWS_REGION environment variable configuration in S3 backend by @fvaleye in #633
  • Refactor azure storage with crate updates by @roeap in #644
  • Defer creation of storage backend in DeltaTableBuilder by @Blajda in #639
  • fix: Add correct size and null paritition values to add actions by @wjones127 in #625
  • Bump flatbuffers from 0.8.4 to 2.1.2 in /aws/delta-checkpoint by @dependabot in #626
  • Bump hyper from 0.14.9 to 0.14.19 in /aws/delta-checkpoint by @dependabot in #628
  • Bump regex from 1.5.4 to 1.5.5 in /aws/delta-checkpoint by @dependabot in #629
  • Bump regex from 1.5.4 to 1.5.6 in /delta-inspect by @dependabot in #630
  • Bump thread_local from 1.1.3 to 1.1.4 in /aws/delta-checkpoint by @dependabot in #646
  • fix: Prevent warning spam when reading tables generated by delta 1.2.1 by @Tom-Newton in #651
  • refactor: move version field to DeltaTableState by @roeap in #649
  • feat: add enforce_retention_duration param to vacuum method by @houqp in #648
  • fix: read vacuumed delta log without _last_checkpoint by @roeap in #643
  • feat: Upgrade to arrow/parquet 15 and datafusion 9 by @xianwill in #652
  • Release of the Python binding version 0.5.8 by @fvaleye in #640

New Contributors

Full Changelog: python-v0.5.7...python-v0.5.8

python-v0.5.7

20 Apr 07:27
4aad6aa
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: python-v0.5.6...python-v0.5.7

python-v0.5.6

06 Feb 20:41
2fa81c6
Compare
Choose a tag to compare
  • Bump version of Python binding to 0.5.6 (#558)
  • Move delta-inspect to its own crate (#557)
  • Fix VACUUM by using table_uri when filtering files to delete (#551)
  • Formally verify S3 atomic rename (#540)
  • Implement missing Azure storage backend methods (#499)
  • Implement polling for table updates (#550)
  • Add target in Python release Github action workflow. (#548)

Credits:
QP Hou, Thomas Vollmer, David Blajda, Florian Valeye

Full Changelog: python-v0.5.5...python-v0.5.6

python-v0.5.5

24 Jan 17:42
8fb45b2
Compare
Choose a tag to compare
  • Add storage options for backends (#544)
  • Remove coupling of DynamoDbLockClient from S3 storage (#535)
  • add macOS 11 support in python binding release (#541)
  • Refresh Python usage documentation (#539)
  • [Python] Create PyArrow dataset fragments from delta log (#525)
  • Fix Delta metadata transaction schema (#531)
  • Add gcs test and improve credential error (#533)
  • Return complete history (#526)
  • Move dynamodb lock into its own crate (#508)
  • Add datafusion examples to docs (#519)
  • Fix S3 list_objs and cleanup_metadata (#518)
  • Add support for creating List and Map schema types (#517)
  • Update datafusion version to 6 (#516)
  • Retry S3 get request on 500 Internal Server Error (#510)
  • Fix memory overhead when creating checkpoint (#502)
  • Fix nullable partition values (#498)
  • Fix cleanup_expired_logs timestamp (#503)
  • Add bool config enableExpiredLogCleanup. (#500)
  • pin arrow to major version (#501)

Credits:
Florian Valeye, ahmedriza, Will Jones, Liang-Chi Hsieh, Gabriel J. Michael, Matthew Turner, Mykhailo Osypov, Andrei Ionescu, QP Hou

Full Changelog: python-v0.5.4...python-v0.5.5