Releases: Eventual-Inc/Daft
Releases · Eventual-Inc/Daft
v0.4.2
What's Changed 🚀
- build: Publish A Long Term Support CPU Release of Daft @samster25 (#3650)
✨ Features
- feat(connect):
printSchema
@andrewgazelka (#3617) - feat: Allow building probe table for either side of anti semi joins @colin-ho (#3643)
- feat(optimizer): Add join reordering as an optimizer rule @desmondcheongzx (#3642)
- feat(swordfish): Memory manager @colin-ho (#3599)
- feat(scantask-2): Implement new module for splitting Parquet ScanTask @jaychia (#3628)
- feat(scantask-1): add a config flag for new scantask splitting algorithm @jaychia (#3615)
- feat: Support intersect all and except distinct/all in DataFrame API @advancedxy (#3537)
- feat: support new PyIceberg IO properties and custom IOConfig in write_iceberg @kevinzwang (#3633)
- feat(expressions): Extend Expression.url.upload() to support row-specific URLs @desmondcheongzx (#3518)
🐛 Bug Fixes
- fix: special characters in GCS urls @kevinzwang (#3651)
- fix(swordfish): Track future poll times for explain analyze @colin-ho (#3511)
👷 CI
🔧 Maintenance
- chore: update PyO3 version to 0.23 @kevinzwang (#3647)
- chore: Fix parquet benchmark test @colin-ho (#3632)
- chore: Clean up join order iteration @desmondcheongzx (#3638)
⬆️ Dependencies
- build(deps-dev): bump moto[s3,server] from 5.0.21 to 5.0.26 @dependabot (#3640)
Full Changelog: v0.4.1...v0.4.2
v0.4.1
What's Changed 🚀
✨ Features
- feat(optimizer): Implement naive join ordering @desmondcheongzx (#3616)
- feat(connect): add more unresolved functions @andrewgazelka (#3618)
- feat(connect):
with_columns_renamed
@andrewgazelka (#3386) - feat(connect): read/write → csv, write → json @andrewgazelka (#3361)
🐛 Bug Fixes
🚀 Performance
- perf(optimizer): convert filter predicate to CNF to push through join @kevinzwang (#3623)
📖 Documentation
- docs: daft documentation v2 @ccmao1130 (#3595)
✅ Tests
- test(connect): verify
show()
output @andrewgazelka (#3610)
👷 CI
- ci: Output results in a CSV format @raunakab (#3625)
- ci: Add build step to run-cluster @raunakab (#3606)
🔧 Maintenance
- chore: Build progress bar only on first update @colin-ho (#3626)
- chore: Fix csv benchmark test @colin-ho (#3631)
Full Changelog: v0.4.0...v0.4.1
v0.4.0
What's Changed 🚀
💥 Breaking Changes
- feat: Default native runner @colin-ho (#3608)
- chore!: upgrade Ray pins and pyarrow pins @jaychia (#3612)
- chore!: drop support for Python 3.8 @kevinzwang (#3592)
- chore!: remove pyarrow-based file reader @kevinzwang (#3587)
✨ Features
- feat: Default native runner @colin-ho (#3608)
- feat(swordfish): Progress Bar @colin-ho (#3571)
- feat(connect): df.show @universalmind303 (#3560)
- feat(connect): support
DdlParse
@andrewgazelka (#3580) - feat(swordfish): Optimize grouped aggregations @colin-ho (#3534)
- feat(swordfish): Enable left/right joins to build probe table on either side @colin-ho (#3548)
- feat: Add DataType inference from Python types @jaychia (#3555)
- feat(shuffles): Locality aware pre shuffle merge @colin-ho (#3505)
- feat: Implement count-distinct for sql @raunakab (#3553)
- feat(connect): add drop support @andrewgazelka (#3345)
- feat: support for basic subquery execution @kevinzwang (#3536)
- feat(connect): add
df.filter
@andrewgazelka (#3346) - feat: Make serialization code not unwrap and panic on failures @raunakab (#3546)
- feat: Unity Catalog writes using
daft.DataFrame.write_deltalake()
@anilmenon14 (#3522) - feat(connect): add parquet support @andrewgazelka (#3360)
- feat: Add iterators to more types @raunakab (#3539)
- feat(optimizer): Add scaffolding to create join graphs from logical plans @desmondcheongzx (#3501)
- feat(tpcds-benchmarking): Add basic tpcds benchmarking for local testing @raunakab (#3509)
- feat(list): add fixed-size list support for value_counts @andrewgazelka (#3521)
- feat(parquet): Limit parallel tasks in remote parquet reader @colin-ho (#3490)
- feat(parquet): Target parquet writes by size bytes instead of rows @colin-ho (#3457)
- feat: cross join @kevinzwang (#3437)
- [FEAT] connect: remove excessive warnings from spark connect @universalmind303 (#3499)
- [CHORE] connect, test:
df.withColumn
@andrewgazelka (#3359) - [FEAT]: expr simplifier @universalmind303 (#3393)
- [FEAT] shuffle testing @raunakab (#3492)
- [FEAT]: add
coalesce
to dataframe and SQL @universalmind303 (#3482) - [FEAT] add register-table helper to sql-catalog @chuanlei-coding (#2837)
- [FEAT] Respect resource request for projections in swordfish @colin-ho (#3460)
- [FEAT] Enable Actor Pool UDFs by default @kevinzwang (#3488)
- [FEAT] connect: add modulus operator and withColumns support @andrewgazelka (#3351)
- [FEAT] connect: createDataFrame @andrewgazelka (#3363)
- [FEAT] Support parquet RLE decoding for booleans @desmondcheongzx (#3477)
- [FEAT] Cap parallelism on local parquet reader @colin-ho (#3310)
- [FEAT] connect: add binary operators @andrewgazelka (#3350)
- [FEAT] connect: support basic column operations @andrewgazelka (#3362)
- [FEAT] extend
build-commit
workflow to support different compile-archs @raunakab (#3459) - [FEAT] Add
count-distinct
aggregation @raunakab (#3455)
🐛 Bug Fixes
- fix(udf): udf call with empty table and batch size @kevinzwang (#3604)
- fix: use arrow's schema instead of spark's for local rel @universalmind303 (#3602)
- fix: guard concurrent extension datatype setting with a lock @jaychia (#3589)
- fix(parquet): Fix parquet reads of required fields nested within optional fields @desmondcheongzx (#3598)
- fix: boolean and/or expressions with null @kevinzwang (#3544)
- fix(run-cluster-workflow): Add null check when parsing metadata @raunakab (#3507)
- fix(tpcds): fix bugs in tpcds datagen script @universalmind303 (#3495)
- [BUG] Fix build commit workflow @raunakab (#3487)
- [BUG]: dont panic on count(distinct) @universalmind303 (#3481)
- [BUG] Block on parquet schema future in estimate_size_bytes @colin-ho (#3484)
🚀 Performance
- perf: filter null join key optimization rule @kevinzwang (#3583)
- perf: lazily import pyiceberg and unity catalog if available @jaychia (#3565)
♻️ Refactor
- refactor: allow InMemory to take in non python based entries @universalmind303 (#3554)
- refactor: create a rust based
PartitionSet
@universalmind303 (#3515) - refactor(swordfish): Generic broadcast state bridge @colin-ho (#3508)
📖 Documentation
- docs: update tpch benchmark link @ccmao1130 (#3542)
- docs: Enable Linting of docstrings @samster25 (#3506)
- [FEAT] Enable Actor Pool UDFs by default @kevinzwang (#3488)
✅ Tests
- test(connect): add more tests for
createDataFrame
@andrewgazelka (#3607) - test: Add more size estimation tests from our s3 bucket @jaychia (#3514)
👷 CI
- ci: Always download logs @jaychia (#3588)
- ci: Add ability to array-ify args and run multiple jobs @raunakab (#3584)
- ci: Add "build" label type to accepted PR titles @raunakab (#3541)
- ci: add a tool to launch workloads on cluster @jaychia (#3516)
- ci(release-drafter): use conventional commit labels @andrewgazelka (#3503)
🔧 Maintenance
- chore!: upgrade Ray pins and pyarrow pins @jaychia (#3612)
- chore: add warning for native runner @jaychia (#3613)
- chore!: drop support for Python 3.8 @kevinzwang (#3592)
- chore!: remove pyarrow-based file reader @kevinzwang (#3587)
- chore: Fix ordering in sql tests + pin docker images in read_sql tests @colin-ho (#3596)
- chore: move symbolic and boolean algebra code into new crate @kevinzwang (#3570)
- [CHORE] use conventional commits @andrewgazelka (#3493)
- [CHORE] connect, test:
df.withColumn
@andrewgazelka (#3359) - [CHORE] Add tests for parquet size estimations @jaychia (#3405)
- [CHORE] Move all python wrapping logic to separate module @raunakab (#3458)
Full Changelog: v0.3.15...v0.3.16
v0.3.15
Changes
✨ New Features
- [FEAT] run cluster on commit @raunakab (#3461)
- [FEAT]: Support
.clip
function @conradsoon (#3136) - [FEAT] Add cluster profiles @raunakab (#3426)
- [FEAT] add pyiceberg 0.8.0 support @rongfengliang (#3448)
- [FEAT] migrate schema inference → async, block at py boundary @andrewgazelka (#3432)
- [CHORE] connect:
df.schema
@andrewgazelka (#3353) - [CHORE] connect test:
df.get_attr
@andrewgazelka (#3349) - [FEAT] Get native execution enablement from DAFT_RUNNER @desmondcheongzx (#3409)
- [FEAT] Add ability to download log files from ray-cluster @raunakab (#3406)
- [FEAT] Add ability to run arbitrary command on a set working directory @raunakab (#3404)
- [FEAT] Add steps to spin up, submit job, and spin down ray clusters @raunakab (#3403)
- [CHORE] connect: add tests for
df.take()
method @andrewgazelka (#3385) - [FEAT] Create new run workflow @raunakab (#3402)
- [FEAT] Enable group by keys in aggregation expressions @kevinzwang (#3399)
- [FEAT] Build release python wheels and upload to AWS S3 @raunakab (#3398)
- [FEAT] connect: Add support for
select
@andrewgazelka (#3344) - [FEAT] connect: add
df.limit
anddf.first
@andrewgazelka (#3309) - [FEAT] connect:
to_daft_*
use ref instead of value @andrewgazelka (#3355) - [FEAT] connect: add alias support @andrewgazelka (#3342)
- [FEAT] Filter predicates in SQL join @kevinzwang (#3371)
- [FEAT] connect: collect @andrewgazelka (#3326)
🚀 Performance Improvements
- [PERF] Improve hash table probe side decisions for Swordfish @desmondcheongzx (#3327)
👾 Bug Fixes
- [BUG] Fix extension type display @jaychia (#3456)
- [BUG] Remove enum imports from match statements @raunakab (#3436)
- [BUG] Explicitly set IO config in unity catalog load table @colin-ho (#3453)
- [BUG] Include storage options in lance write commit @colin-ho (#3451)
- [BUG] Replace semicolons in filenames with underscore @raunakab (#3430)
- [BUG] Terminate nodes instead of stopping them @raunakab (#3427)
- [BUG] Fix run-cluster passing in environment variables wrongly @jaychia (#3422)
📖 Documentation
- [FEAT]: Support
.clip
function @conradsoon (#3136) - [DOCS] Shorten union of Literals @desmondcheongzx (#3449)
- [DOCS] Add missing list expression entries @desmondcheongzx (#3428)
🧰 Maintenance
- [CHORE] Add warning in PyRunner to switch to Native @colin-ho (#3472)
- [CHORE] Address comments on previous PR @raunakab (#3473)
- [CHORE] Write tpch parquet files one at a time @colin-ho (#3396)
- [CHORE] Remove CountMode and ResourceRequest from public API @desmondcheongzx (#3429)
- [CHORE] Add schemas for remaining local plan ops @colin-ho (#3446)
- [CHORE] Put empty table when building probe table @colin-ho (#3445)
- [CHORE] Explain block_on function in common-runtime @colin-ho (#3442)
- [CHORE] connect:
df.schema
@andrewgazelka (#3353) - [CHORE] Update execution config to turn on Ray tracing @jaychia (#3431)
- [CHORE] connect test:
df.get_attr
@andrewgazelka (#3349) - [CHORE] Cleanup ExprResolver @kevinzwang (#3401)
- [CHORE] connect: add tests for
df.take()
method @andrewgazelka (#3385) - [CHORE] Change IOConfig to be serialized into binary instead of JSON @kevinzwang (#3400)
- [CHORE] Pin PyIceberg version to <0.8 @kevinzwang (#3391)
- [CHORE] Add TPC-H queries in SQL @kevinzwang (#3392)
- [CHORE] connect: Optimize plans in connect @colin-ho (#3378)
- [CHORE] delete empty file xyz @andrewgazelka (#3370)
⬆️ Dependencies
14 changes
- Bump orjson from 3.10.11 to 3.10.12 @dependabot (#3464)
- Bump grpcio from 1.67.0 to 1.68.1 @dependabot (#3465)
- Bump arrow-buffer from 51.0.0 to 53.3.0 @dependabot (#3467)
- Bump regex-syntax from 0.7.5 to 0.8.4 @dependabot (#3468)
- Bump memmap2 from 0.9.4 to 0.9.5 @dependabot (#3470)
- Bump image from 0.25.4 to 0.25.5 @dependabot (#3471)
- Bump bytes from 1.7.1 to 1.8.0 @dependabot (#3411)
- Bump astral-sh/setup-uv from 3 to 4 @dependabot (#3410)
- Bump serde_json from 1.0.124 to 1.0.133 @dependabot (#3413)
- Bump sample-arrow2 from 0.1.0 to 0.17.2 @dependabot (#3414)
- Bump chrono-tz from 0.8.6 to 0.10.0 @dependabot (#3415)
- Bump azure-storage-blob from 12.17.0 to 12.24.0 @dependabot (#3416)
- Bump opencv-python from 4.8.1.78 to 4.10.0.84 @dependabot (#3417)
- Bump sqlalchemy from 2.0.25 to 2.0.36 @dependabot (#3418)
v0.3.14
Changes
✨ New Features
- [FEAT]: sql HAVING @universalmind303 (#3364)
- [FEAT] consolidate Spark session fixture into conftest.py @andrewgazelka (#3341)
- [FEAT]: allow for implicit coercion between str & date @universalmind303 (#3337)
- [FEAT] daft-connect range use python generator @andrewgazelka (#3308)
- [FEAT] Monotonically Increasing Id for Swordfish @colin-ho (#3180)
- [FEAT] Support for correlated subqueries in SQL (not yet executable) @kevinzwang (#3304)
- [FEAT]: SQL read_csv @itzhakstern (#3255)
- [FEAT] Daft Catalog API @jaychia (#3036)
- [FEAT]: allow
is_in
to take inVec<Expr>
instead ofExpr
@universalmind303 (#3294) - [FEAT] Lance writes for swordfish @colin-ho (#3299)
- [FEAT] Support for aggregation expressions that use multiple AggExprs @kevinzwang (#3296)
- [FEAT] SQL union/union all and sql intersect @universalmind303 (#3274)
👾 Bug Fixes
- [BUG] Implement deserialize for Python objects serialized as sequences @kevinzwang (#3339)
- [BUG]: tbl alias with join @universalmind303 (#3333)
- [BUG] Fixes regexp_replace expression @ConeyLiu (#3306)
- [BUG] Fix ray wait in RayPartitionSet @jaychia (#3251)
- [BUG] Partially qualified joins
join a.x = y
andjoin x = b.y
@universalmind303 (#3290) - [BUG] Check env in benchmarking script @colin-ho (#3297)
- [BUG] Fix writes for empty dataframes if target directory does not exist @colin-ho (#3278)
- [BUG]: panic in sql subquery @universalmind303 (#3291)
📖 Documentation
- [DOCS] Fix typo in limit example @colin-ho (#3303)
- [DOCS] Update incomplete SQL doc pages @willvo2004 (#3298)
🧰 Maintenance
- [CHORE]: prepare for nulls first/last kernels @universalmind303 (#3301)
- [CHORE] Fix join alias test @kevinzwang (#3335)
- Bump bytemuck from 1.16.3 to 1.19.0 @dependabot (#3171)
- [CHORE]: move utf8 functions from daft-dsl to daft-functions @ConeyLiu (#3101)
- [CHORE] Swordfish refactors @colin-ho (#3256)
- [CHORE]: better subquery handling @universalmind303 (#3295)
- [CHORE] Possibility to create environment with system installed uv @maruschin (#3281)
- [CHORE]: defer Expr subquery error until eval @universalmind303 (#3272)
- [CHORE] Fix style in workflow file @jaychia (#3284)
⬆️ Dependencies
12 changes
- Bump bytemuck from 1.16.3 to 1.19.0 @dependabot (#3171)
- Bump psycopg2-binary from 2.9.9 to 2.9.10 @dependabot (#3174)
- Bump codecov/codecov-action from 3 to 5 @dependabot (#3318)
- Bump slackapi/slack-github-action from 1.27.0 to 2.0.0 @dependabot (#3317)
- Bump lxml from 5.1.0 to 5.3.0 @dependabot (#3172)
- Bump moto[s3,server] from 5.0.2 to 5.0.21 @dependabot (#3312)
- Bump async-stream from 0.3.5 to 0.3.6 @dependabot (#3313)
- Bump unicode-normalization from 0.1.23 to 0.1.24 @dependabot (#3314)
- Bump tikv-jemallocator from 0.5.4 to 0.6.0 @dependabot (#3316)
- Bump sysinfo from 0.30.13 to 0.32.0 @dependabot (#3168)
- Bump pretty_assertions from 1.4.0 to 1.4.1 @dependabot (#3169)
- Bump lz4 from 1.26.0 to 1.28.0 @dependabot (#3170)
v0.3.13
Changes
✨ New Features
- [FEAT] Pre Shuffle Merge Strategy @colin-ho (#3191)
- [FEAT] Minimal indices dtype for FixedShapeSparseTensors @sagiahrac (#3149)
- [FEAT] implement range operation and data streaming @andrewgazelka (#3267)
- [FEAT] Support intersect as a DataFrame API @advancedxy (#3134)
- [FEAT] Adds a
read_generator
method that reads tables from a generator @colin-ho (#3258) - [FEAT] Add initial Spark Connect support @andrewgazelka (#3261)
👾 Bug Fixes
- [BUG] Always run telemetry codepath @jaychia (#3275)
- [BUG] Cleanup context side-effects @jaychia (#3270)
- [BUG]: bad merge from intersect PR @universalmind303 (#3273)
📖 Documentation
- [FEAT] Minimal indices dtype for FixedShapeSparseTensors @sagiahrac (#3149)
- [DOCS] Use an absolute path for the canonical link @desmondcheongzx (#3271)
- [DOCS] Set canonical link @desmondcheongzx (#3269)
🧰 Maintenance
- [CHORE] Allow manual launch of release-drafter.yml @jaychia (#3283)
- [CHORE] add daftrunner env to install and test step @colin-ho (#3279)
- [CHORE] Remove the concept of runner configs @jaychia (#3276)
- [CHORE] Expose read_sql partition bound strategy and default to min-max @colin-ho (#3246)
- [CHORE]: remove daft-table dependency from daft-logical-plan @universalmind303 (#3265)
- [CHORE] Remove daft-scan dependency from planning crates @kevinzwang (#3250)
v0.3.12
Changes
✨ New Features
- [FEAT]: Sql joins with duplicate cols @universalmind303 (#3241)
- [FEAT] Add tracing for runner @jaychia (#3113)
- [FEAT] add spark-connect protocol @andrewgazelka (#3189)
🚀 Performance Improvements
- [PERF] Harden GCP Retries @samster25 (#3253)
👾 Bug Fixes
- [BUG]: orderby with aggs @universalmind303 (#3190)
📖 Documentation
- [DOCS] Fix typo in write_parquet's parameters @desmondcheongzx (#3252)
- [DOCS] Changing docs for UDF @jaychia (#2880)
🧰 Maintenance
- [CHORE] implement mean and stddev for decimal @samster25 (#3159)
v0.3.11
Changes
✨ New Features
- [FEAT] Native Runner @colin-ho (#3178)
- [FEAT]: sql "extract" temporal function @universalmind303 (#3188)
🚀 Performance Improvements
- [PERF] Remove upfront buffer allocations for local CSV reader @desmondcheongzx (#3242)
📖 Documentation
- [FEAT] Native Runner @colin-ho (#3178)
- [DOCS] Update Iceberg roadmap on docs @kevinzwang (#3240)
- [FEAT]: sql "extract" temporal function @universalmind303 (#3188)
🧰 Maintenance
- [CHORE] Fix flaky test in test_decimal_to_decimal_cast @advancedxy (#3243)
- [CHORE] Split logical and physical plans into separate crates @kevinzwang (#3239)
v0.3.10
Changes
✨ New Features
- [FEAT] Overwrite mode for write parquet/csv @colin-ho (#3108)
- [FEAT] Support null equal safe join in SQL @advancedxy (#3166)
- [FEAT] Streaming Catalog Writes @colin-ho (#3160)
- [FEAT] Infer Azure storage account from uri @kevinzwang (#3165)
- [FEAT] Support null safe equal in joins @advancedxy (#3161)
- [FEAT] Support hive partitioned reads @desmondcheongzx (#3029)
- [FEAT] Add better detection of Ray Job environment @jaychia (#3148)
- [FEAT] Streaming physical writes for native executor @colin-ho (#2992)
- [FEAT]: Throw error for invalid ** usage outside folder segments (e.g. /tmp/**.csv) @conradsoon (#3100)
- [FEAT]: sql concat and stddev @universalmind303 (#3153)
- [FEAT]: Sql common table expressions (CTE's) @universalmind303 (#3137)
- [FEAT] enable decimal between @samster25 (#3154)
- [FEAT] dec128 math @samster25 (#3143)
- [FEAT] Support SQL
INTERVAL
@austin362667 (#3146) - [FEAT] Swordfish Stateful UDF support @kevinzwang (#3127)
- [FEAT]: sql cross join @universalmind303 (#3110)
- [FEAT] Add floor division @ConeyLiu (#3064)
- [FEAT] Compute pool for native executor @colin-ho (#2986)
🚀 Performance Improvements
- [PERF] Add a parallel local CSV reader @desmondcheongzx (#3055)
👾 Bug Fixes
- [BUG]: Sql groupby and orderby with aliases and projections @universalmind303 (#3177)
- [BUG] Separate PartitionTask done from results @jaychia (#3155)
- [BUG]: between panic on unsupported types @universalmind303 (#3150)
- [BUG] fix type widening for rem @samster25 (#3131)
📖 Documentation
- Temporal docs added to expressions.rst @sunaysanghani (#2487)
- [DOCS] Update banner on README.rst @ccmao1130 (#3130)
- [DOCS] Update Daft logo @ccmao1130 (#3129)
🧰 Maintenance
- [CHORE] Add tests for decimal casting @desmondcheongzx (#3179)
- [CHORE] Refactor RayRunner so that we can add tracing @jaychia (#3163)
- [CHORE] Swordfish specific test fixtures @colin-ho (#3164)
- [CHORE]: tpc-ds datagen @universalmind303 (#3103)
- [CHORE] Cancel tasks spawned on compute runtime @colin-ho (#3128)
- [CHORE] Enable debug in test profile @advancedxy (#3135)
- [FEATURE] add min_hash alternate hashers @andrewgazelka (#3052)
- [CHORE] (Revert:) Add rust cache to s3 build artifacts action @jaychia (#3147)
- [CHORE] Add rust cache to s3 build artifacts action @jaychia (#3144)
- [CHORE] Refactor shuffles to use a unified ShuffleExchange PhysicalPlan variant @jaychia (#3083)
⬆️ Dependencies
4 changes
- Bump orjson from 3.9.5 to 3.10.11 @dependabot (#3176)
- Bump adlfs from 2023.10.0 to 2024.7.0 @dependabot (#2547)
- Bump image from 0.24.9 to 0.25.4 @dependabot (#3088)
- Bump slackapi/slack-github-action from 1.26.0 to 1.27.0 @dependabot (#2776)
v0.3.9
Changes
✨ New Features
- [FEAT]: sql
IN
operator @universalmind303 (#3086) - [FEAT] Enable explode for swordfish @colin-ho (#3077)
- [FEAT]: add sql DISTINCT @universalmind303 (#3087)
- [FEAT] Enable concat for swordfish @colin-ho (#2976)
- [FEAT] Enable unpivot for swordfish @colin-ho (#3078)
- [FEAT] Outer joins for native executor @colin-ho (#2860)
- [FEAT] Enable pivot for swordfish @colin-ho (#3081)
- [FEAT] Enable sample for swordfish @colin-ho (#3079)
- [FEAT] Add stateful actor context and set CUDA_VISIBLE_DEVICES @kevinzwang (#3002)
- [FEAT]: sql tbl alias, and compount ident for joins @universalmind303 (#3066)
- [FEAT]: sql between @universalmind303 (#3062)
- [FEAT]: Interval dtype @universalmind303 (#3018)
- [FEAT] Enable to_json_string for physical plan @colin-ho (#3023)
- [FEAT]: Daft support for Azure storage for Unity Catalog
daft.read_deltalake
@anilmenon14 (#3025) - [FEAT] Iceberg MOR for streaming parquet @colin-ho (#2975)
- [FEAT] Include file paths as column from read_parquet/csv/json @colin-ho (#2953)
🚀 Performance Improvements
- [PERF] Remove stateful actor child materialization limit @kevinzwang (#3099)
👾 Bug Fixes
- [BUG] Bump up max_header_size @raunakab (#3068)
- [BUG] Autodetect AWS region during deltalake scan @kevinzwang (#3104)
- [BUG] Add over clause in read_sql percentile reads @colin-ho (#3094)
- [BUG] Disable Linux SSL CERT override @samster25 (#3098)
- [BUG] Fix into_partitions to use a more naive approach without materialization @jaychia (#3080)
- [BUG] Fix actor pool initialization in ray client mode @kevinzwang (#3028)
- [BUG]: joins with duplicate column names and qualified table expansion @universalmind303 (#3074)
- [BUG]: sql functions case sensitivit @universalmind303 (#3063)
- [BUG] Fix write_deltalake add action file path prefix @kevinzwang (#3053)
- [BUG] Fix intersection checking when unioning schemas @desmondcheongzx (#3039)
- [BUG] Sampling without replacement not working @colin-ho (#3035)
🧰 Maintenance
- [CHORE]: replace the
.venv
value with global variableVENV
@mohamedrezk122 (#3084) - [CHORE] Enable lancedb reads for native executor @colin-ho (#2925)
- [CHORE] Auto attach LLDB debugger to python #2940 @sagiahrac (#3020)
- [CHORE] Rename config.yaml to config.yml @samster25 (#3045)
- [CHORE] add config.yaml for issues @samster25 (#3044)
- [CHORE] validation on dropdown @samster25 (#3043)
- [CHORE] preserve quotes in yaml @samster25 (#3042)
- [CHORE] Checkbox for contribution @samster25 (#3041)
- [CHORE] update feature request @samster25 (#3040)