Set default value of avoid_duplicate_runs to false for run_model_on_task #1145

chadmarchand · 2022-06-26T02:23:51Z

Reference Issue

What does this PR implement/fix? Explain your changes.

Sets the default value of avoid_duplicate_runs in the run_model_on_task function to False. When true, this option avoids running an experiment that already exists on OpenML, but this requires an API key. This change means that an API key is not required unless explicitly setting avoid_duplicate_runs to true.

How should this PR be tested?

Tested that without an API key, the following code block does not return a 401 error any more after this change has been made.

from sklearn import ensemble
from openml import tasks, runs

clf = ensemble.RandomForestClassifier()
task = tasks.get_task(3954)
run = runs.run_model_on_task(clf, task)

I don't believe that an automated test should be required as we are just changing a default value and not changing any implementation, but please let me know if otherwise and I will add a test.

PGijsbers · 2023-02-21T08:47:21Z

Thanks for putting in the effort. After some more thought we have decided that instead we simply want to issue a warning if the user has avoid_duplicate set to true when no apikey is set. In the future we expect flow/exists to be callable without authentication and then we can remove the warning.

Set default value of avoid_duplicate_runs to false

eaa2ff6

chadmarchand mentioned this pull request Jun 26, 2022

Pre-commit hook fails due to import error #1146

Closed

PGijsbers closed this Feb 21, 2023

PGijsbers mentioned this pull request Feb 21, 2023

Add warning to run_model_on_task with avoid duplicates if no authentication #1210

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set default value of avoid_duplicate_runs to false for run_model_on_task #1145

Set default value of avoid_duplicate_runs to false for run_model_on_task #1145

chadmarchand commented Jun 26, 2022

PGijsbers commented Feb 21, 2023

Set default value of avoid_duplicate_runs to false for run_model_on_task #1145

Set default value of avoid_duplicate_runs to false for run_model_on_task #1145

Conversation

chadmarchand commented Jun 26, 2022

Reference Issue

What does this PR implement/fix? Explain your changes.

How should this PR be tested?

PGijsbers commented Feb 21, 2023