Add `top_p_size` step fn, `StepFunctionArgs` class #206

gsarti · 2023-07-26T10:36:01Z

Description

This PR adds the following capabilities to the Inseq library:

A new top_p_size_fn (identifier: "top_p_size") step function returning the number of tokens needed to reach probability p for a specific generation step (e.g. 5 with p=0.95 means that the top 5 tokens in the probability distribution over the vocabulary are needed to reach a CDF of 95%)
The kl_divergence step function now supports a new parameter top_p: float defined in [0,1] (default 1, full distribution) to preserve only tokens in the top p of either the original or the contrastive output distributions before computing the KL divergence between the two.

🔥 Breaking change: This PR introduces a new StepFunctionArgs to better structure the inputs to step function methods. This doesn't change anything in the usage of pre-registered functions, but functions that were previously registered with explicit default params (attribution_model, forward_output, encoder_input_ids, etc.) will now break, and should be converted to use StepFunctionArgs. The step function registration tutorial will be updated accordingly.

gsarti added 2 commits July 26, 2023 12:06

Add top-p step fn and refactor of step fn args

c88cc51

Fix args wrongly passed by reference

20c78ff

gsarti merged commit 5ad7a7d into main Jul 26, 2023
4 checks passed

gsarti deleted the step-fn-top-p branch July 26, 2023 11:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `top_p_size` step fn, `StepFunctionArgs` class #206

Add `top_p_size` step fn, `StepFunctionArgs` class #206

gsarti commented Jul 26, 2023

Add top_p_size step fn, StepFunctionArgs class #206

Add top_p_size step fn, StepFunctionArgs class #206

Conversation

gsarti commented Jul 26, 2023

Description

Add `top_p_size` step fn, `StepFunctionArgs` class #206

Add `top_p_size` step fn, `StepFunctionArgs` class #206