Trajectories that raise an error are ignored #13

aleSuglia · 2022-02-11T10:11:16Z

I was reading about the recent changes to the code reported in #10 and we unfortunately get results that differ substantially from yours. I started dissecting the code to understand what's the reason for such discrepancies in the results. From my understanding of the inference_runner.py script, you spawn several processes, each with a given portion of the tasks. However, I can see that the exception handling logic simply ignores an instance that raises an error: https://github.com/emma-simbot/teach/blob/speaker_tokens/src/teach/inference/inference_runner.py#L130

This is detrimental because if a dataset instance errors for whatever reason, its contribution to the overall metrics is ignored. Instead, the proper way of dealing with this should be to ignore that trajectory and still add to the metrics that you were not successful. Potentially, such faulty trajectories should be reported in the metrics file for future debugging.

Am I missing something?

The text was updated successfully, but these errors were encountered:

aishwaryap · 2022-02-12T04:18:31Z

Hi @aleSuglia

There is a separation here to differentiate between possible errors from the model and the rest of the inference code. Most of the error handling is inside InferenceRunner._run_edh_instance (here) where exceptions from invocations of the model should get logged in metrics. Are you having specific errors at other places that are not getting caught?

That said, we probably should be returning the list of failed files in InferenceRunner._run() for debugging. I will work on this on Monday.

For leaderboard evaluations, we are separately confirming that all inference files did get saved.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Trajectories that raise an error are ignored #13

Trajectories that raise an error are ignored #13

aleSuglia commented Feb 11, 2022 •

edited

Loading

aishwaryap commented Feb 12, 2022

Trajectories that raise an error are ignored #13

Trajectories that raise an error are ignored #13

Comments

aleSuglia commented Feb 11, 2022 • edited Loading

aishwaryap commented Feb 12, 2022

aleSuglia commented Feb 11, 2022 •

edited

Loading