-
Notifications
You must be signed in to change notification settings - Fork 147
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RuntimeError: CUDF failure at: /__w/cudf/cudf/cpp/src/io/parquet/reader_impl_helpers.cpp:379: Invalid rowgroup index[BUG] #756
Comments
@Oussamakhammassi can you please tell us how did you install transformers4rec? are you using Please also start with https://github.com/NVIDIA-Merlin/Transformers4Rec/tree/main/examples/getting-started-session-based examples since the tutorial nbs have not been updated recently. |
Hi rnyak! pip install transformers4rec[nvtabular] No i'm not using merlin-pytorch image |
@Oussamakhammassi I'd recommend you to use docker image. Installing only if you want to install via pip you need to install rapids cudf and dask_cudf first (please see their doc here: https://docs.rapids.ai/install) and then install other Merlin libs as well:
|
Yess i did all that but still don't work! |
@Oussamakhammassi you need a compatible GPU and properly installed cuda driver to be able to import and use cudf library. what's your GPU specs? can you share the prints out of |
@Oussamakhammassi also can you please run this example notebooks first? https://github.com/NVIDIA-Merlin/Transformers4Rec/tree/main/examples/getting-started-session-based |
For the version, here's the output: For the example that you've sent to me, yes i did run it and it works well but i don't know why the other examples have this error |
Wed Nov 8 15:51:59 2023 +-----------------------------------------------------------------------------+ |
i worked with this, example notebooks are working fine, but when i run with custom data, it throws this error with i call trainer.evaluate() method. |
@Bharathjpv please share your error, and a reproducible toy example. we need to see what you are doing in your NVT and model training and eval pipeline to help you. thanks. |
Tried to run the tutorial of transformers4rec and i got this error
RuntimeError Traceback (most recent call last)
in
/usr/local/lib/python3.10/dist-packages/transformers/trainer.py in evaluate(self, eval_dataset, ignore_keys, metric_key_prefix)
3005 self._memory_tracker.start()
3006
-> 3007 eval_dataloader = self.get_eval_dataloader(eval_dataset)
3008 start_time = time.time()
3009
16 frames
/usr/local/lib/python3.10/dist-packages/cudf/io/parquet.py in _read_parquet(filepaths_or_buffers, engine, columns, row_groups, use_pandas_metadata, *args, **kwargs)
819 f"following positional arguments: {list(args)}"
820 )
--> 821 return libparquet.read_parquet(
822 filepaths_or_buffers,
823 columns=columns,
parquet.pyx in cudf._lib.parquet.read_parquet()
parquet.pyx in cudf._lib.parquet.read_parquet()
RuntimeError: CUDF failure at: /__w/cudf/cudf/cpp/src/io/parquet/reader_impl_helpers.cpp:379: Invalid rowgroup index
The text was updated successfully, but these errors were encountered: