[Python] Converting data frame to Table with large nested column fails Invalid Struct child array has length smaller than expected
#32440
Labels
Invalid Struct child array has length smaller than expected
#32440
Hey,
I have a data frame for which one column is a nested struct array. Converting it to a
pyarrow.Table
fails if the data frame gets too big. I could reproduce the bug with a minimal example with anonymized data that is roughly similar to mine. When I set, e.g.,N_ROWS=500_000
, or smaller, it is working fine.The length is always smaller than expected by 1.
Expected behavior:
Run without errors or fail with a better error message.
System Info and Versions:
Apple M1 Pro but also happened on amd64 Linux machine on AWS
I could also reproduce with
pyarrow 8.0.0
Reporter: Simon Weiß
Related issues:
Invalid Struct child array has length smaller than expected
(is a clone of)Note: This issue was originally created as ARROW-17138. Please see the migration documentation for further details.
The text was updated successfully, but these errors were encountered: