refactor: Do one prediction per input sequence, easier experimentation #27

carlosgjs · 2024-01-24T18:56:09Z

The prediction/evaluation interfaces used List[List[str]] as the prediction type to account for: a) batch inference, i.e. generate docs for multiple code snippets in one pass and b) generating multiple predictions per input (sampling). But we've now decided to do deterministic prediction (temp=0) so there's no need for the second level.

This PR changes to the prediction return type to List[str], which also makes the code simpler.

Two additional changes:

Replace the use of the max_length parameter in favor of max_new_tokens
Break-up the eval function into eval_promp for easier prompt experimentation
Updated generate.ipynb notebook accordingly

codecov-commenter · 2024-01-24T19:00:37Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (10294bc) 97.00% compared to head (6f2c5da) 97.17%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #27      +/-   ##
==========================================
+ Coverage   97.00%   97.17%   +0.16%     
==========================================
  Files           3        3              
  Lines         167      177      +10     
==========================================
+ Hits          162      172      +10     
  Misses          5        5

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

refactor: Do one prediction per input sequence, easier experimentation

6f2c5da

carlosgjs requested a review from anujsinha3 January 24, 2024 18:56

anujsinha3 approved these changes Jan 24, 2024

View reviewed changes

carlosgjs merged commit 3c7e0a0 into main Jan 25, 2024
9 checks passed

carlosgjs deleted the carlosg/onepred branch January 25, 2024 01:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: Do one prediction per input sequence, easier experimentation #27

refactor: Do one prediction per input sequence, easier experimentation #27

carlosgjs commented Jan 24, 2024

codecov-commenter commented Jan 24, 2024 •

edited

Loading

refactor: Do one prediction per input sequence, easier experimentation #27

refactor: Do one prediction per input sequence, easier experimentation #27

Conversation

carlosgjs commented Jan 24, 2024

codecov-commenter commented Jan 24, 2024 • edited Loading

Codecov Report

codecov-commenter commented Jan 24, 2024 •

edited

Loading