You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
One thing you can try (especially if you're using a multilingual model like https://huggingface.co/nielsr/lilt-xlm-roberta-base), then you can remove token embeddings of tokens of languages that you don't need.
Hi,
I'm using Hugging Face libraries in order to run
LiLT
.How can I decrease inference time? Which code to use?
I've already try
BetterTransformer
(Optimum
) andONNX
but none of them acceptsLiLT
model.NotImplementedError: The model type lilt is not yet supported to be used with BetterTransformer.
KeyError: "lilt is not supported yet.
Thank you.
Note: I asked this question here, too: jpWang/LiLT#42
The text was updated successfully, but these errors were encountered: