You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Let's investigate the level of abstraction on the write path.
Currently, we are doing schema-compatible checks, schema coercion, bin-packing, transformation, etc at different levels of the stack. It'll be good to optimize and see which functions can be pushed up the stack.
For example, here's what the overwrite path looks like
Feature Request / Improvement
Let's investigate the level of abstraction on the write path.
Currently, we are doing schema-compatible checks, schema coercion, bin-packing, transformation, etc at different levels of the stack. It'll be good to optimize and see which functions can be pushed up the stack.
For example, here's what the
overwrite
path looks like(copied over from #910 (review))
Another example #786 (comment)
More info
overwrite
checks schema compatibilityiceberg-python/pyiceberg/table/__init__.py
Lines 541 to 550 in 3f44dfe
_dataframe_to_data_files
bin-packs the pyarrow Tableiceberg-python/pyiceberg/io/pyarrow.py
Lines 2222 to 2225 in 3f44dfe
write_parquet
transforms table schemaiceberg-python/pyiceberg/io/pyarrow.py
Lines 2001 to 2008 in 3f44dfe
and
iceberg-python/pyiceberg/io/pyarrow.py
Lines 2011 to 2021 in 3f44dfe
The text was updated successfully, but these errors were encountered: