Constrained decoding fix #248

mjpost · 2016-03-10T15:14:26Z

Constrained decoding works by ensuring that LM state tuples match against the specified target side. This fails when using KenLM, since it maintains its own private state and doesn't expose the n-gram boundary words to the decoder (instead using the generic DP state mechanism). This could be fixed by always creating a dummy LM feature whenever constrained decoding is requested, which will do the state maintenance that is needed. This could maybe be done at the sentence level or could be done via a decoder switch (meaning that constrained decoding would only work when you've explicitly requested it, which might be a good thing).

Another approach that might be easier would be to check edges against the input and block there, instead of checking states. However the state constraint itself already has holes (there are cases where it can permit a hypothesis that doesn't exactly match the output), and I think this would be worse.

mjpost self-assigned this Mar 10, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Constrained decoding fix #248

Constrained decoding fix #248

mjpost commented Mar 10, 2016

Constrained decoding fix #248

Constrained decoding fix #248

Comments

mjpost commented Mar 10, 2016