You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Feb 18, 2024. It is now read-only.
> RUST_BACKTRACE=1 cargo run --release --example parquet_read test.parquet 0 0
Finished release [optimized] target(s) in 0.12s
Running `target/release/examples/parquet_read test.parquet 0 0`
thread 'main' panicked at 'index out of bounds: the len is 1000 but the index is 1000', /home/vincev/arrow2/src/io/parquet/read/primitive/basic.rs:67:40
stack backtrace:
0: rust_begin_unwind
at /rustc/a178d0322ce20e33eac124758e837cbd80a6f633/library/std/src/panicking.rs:515:5
1: core::panicking::panic_fmt
at /rustc/a178d0322ce20e33eac124758e837cbd80a6f633/library/core/src/panicking.rs:92:14
2: core::panicking::panic_bounds_check
at /rustc/a178d0322ce20e33eac124758e837cbd80a6f633/library/core/src/panicking.rs:69:5
3: arrow2::io::parquet::read::primitive::basic::extend_from_page
4: arrow2::io::parquet::read::primitive::iter_to_array
5: arrow2::io::parquet::read::page_iter_to_array
6: parquet_read::main
If I change the generator to produce 500 rows then the read works well.
I am using latest master version:
> git rev-parse HEAD
77650672233bd7bbb9839a2a616f11ebffa15807
The text was updated successfully, but these errors were encountered:
The
parquet_read
example is unable to read a parquet file generated with Pandas.To reproduce the problem I run the following script:
This generates the following output on my box:
> python gen.py Pandas version: 1.3.2 Numpy version: 1.21.2 Pyarrow version: 5.0.0 Wrote 1000 rows
Then I run
parquet_read
:If I change the generator to produce 500 rows then the read works well.
I am using latest master version:
The text was updated successfully, but these errors were encountered: