Target prefix-constrained generation #172

gsarti · 2023-04-19T11:20:39Z

Description

This PR introduces a generate_from_target_prefix argument in the model.attribute method and the inseq attribute CLI, allowing the usage of pre-specified generated_text as a prefix for the model to complete, rather than a finished generation.

This functionality exploits the capabilities of the generate function in 🤗 transformers, and is restricted to encoder-decoder models only, since controlling the prefix is handled through the input_text parameter for decoder-only models. The same behavior could already be achieved in inseq v0.4 by providing explicit decoder_input_ids as generation_args.

Current behavior:

>>> import inseq

>>> m = inseq.load_model("Helsinki-NLP/opus-mt-en-it", "saliency")
>>> prefix_ids = m.encode("Hey mondo!", as_targets=True)
>>> out = model.attribute(
...	   "Hello world! My name is Inseq",
...	   generation_args={"decoder_input_ids": prefix_ids.input_ids}
...)
>>> out.info["generated_texts"]
['Hey mondo! mi chiamo Inseq.']

New behavior using generate_from_target_prefix:

>>> import inseq

>>> m = inseq.load_model("Helsinki-NLP/opus-mt-en-it", "saliency")
>>> out = model.attribute(
...    "Hello world! My name is Inseq",
...    "Hey mondo!",
...    generate_from_target_prefix=True
... )
>>> out.info["generated_texts"]
['Hey mondo! mi chiamo Inseq.']

The new argument also works with batched inputs. Setting the target prefix to an empty string will perform the fully unconstrained generation, as if no prefix was provided:

>>> import inseq

>>> model = inseq.load_model("Helsinki-NLP/opus-mt-en-it", "saliency")
>>> src = ["Hello world! My name is Inseq", "I love bugging around code all day"]
>>> prefix = ["Hey mondo!", ""]
>>> out = model.attribute(src, prefix, generate_from_target_prefix=True)
>>> out.info["generated_texts"]
... ['Hey mondo! mi chiamo Inseq', 'Adoro le cimici intorno al codice tutto il giorno']

Type of Change

🚀 New feature (non-breaking change which adds functionality)

* origin/main: Target prefix-constrained generation (#172)

Added generate_from_target_prefix to attribute and CLI

8e21da7

gsarti merged commit a4a43e2 into main Apr 19, 2023

gsarti deleted the gen-from-prefix branch April 19, 2023 11:34

gsarti added a commit that referenced this pull request Apr 19, 2023

Merge remote-tracking branch 'origin/main' into value-zeroing

ea30e50

* origin/main: Target prefix-constrained generation (#172)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Target prefix-constrained generation #172

Target prefix-constrained generation #172

gsarti commented Apr 19, 2023

Target prefix-constrained generation #172

Target prefix-constrained generation #172

Conversation

gsarti commented Apr 19, 2023

Description

Type of Change