Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Drop usage of ARFF files in favor of parquet #637

Open
PGijsbers opened this issue Sep 14, 2024 · 0 comments
Open

Drop usage of ARFF files in favor of parquet #637

PGijsbers opened this issue Sep 14, 2024 · 0 comments
Labels
data For issues with datasets in the current benchmark enhancement New feature or request

Comments

@PGijsbers
Copy link
Collaborator

OpenML will deprecate the use of ARFF files in the near future, and at some point new datasets will not be available in ARFF files. I think this is a good reason to drop ARFF support altogether, as the format is in general not really used that much anymore.
Benefits of only using parquet is faster downloads, less disk space, and (for us) less code to maintain.

@PGijsbers PGijsbers added enhancement New feature or request data For issues with datasets in the current benchmark labels Sep 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data For issues with datasets in the current benchmark enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant