How to decrease inference time of LiLT? #284

piegu · 2023-04-29T09:11:44Z

Hi,

I'm using Hugging Face libraries in order to run LiLT.
How can I decrease inference time? Which code to use?

I've already try BetterTransformer (Optimum) and ONNX but none of them accepts LiLT model.

BetterTransformer: NotImplementedError: The model type lilt is not yet supported to be used with BetterTransformer.
ONNX: KeyError: "lilt is not supported yet.

Thank you.

Note: I asked this question here, too: jpWang/LiLT#42

The text was updated successfully, but these errors were encountered:

piegu · 2023-05-02T09:43:10Z

Issue opened in the Optimum library: huggingface/optimum#1024

bkocis · 2023-06-27T07:53:42Z

Have you considered making a smaller model? What is your model size?

NielsRogge · 2023-07-03T08:38:34Z

One thing you can try (especially if you're using a multilingual model like https://huggingface.co/nielsr/lilt-xlm-roberta-base), then you can remove token embeddings of tokens of languages that you don't need.

See this blog post for more info: https://medium.com/@coding-otter/reduce-your-transformers-model-size-by-removing-unwanted-tokens-and-word-embeddings-eec08166d2f9

piegu changed the title ~~How to improve inference time of LiLT?~~ How to decrease inference time of LiLT? Apr 30, 2023

piegu mentioned this issue Apr 30, 2023

How to decrease inference time of LiLT? jpWang/LiLT#42

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to decrease inference time of LiLT? #284

How to decrease inference time of LiLT? #284

piegu commented Apr 29, 2023 •

edited

Loading

piegu commented May 2, 2023

bkocis commented Jun 27, 2023

NielsRogge commented Jul 3, 2023 •

edited

Loading

How to decrease inference time of LiLT? #284

How to decrease inference time of LiLT? #284

Comments

piegu commented Apr 29, 2023 • edited Loading

piegu commented May 2, 2023

bkocis commented Jun 27, 2023

NielsRogge commented Jul 3, 2023 • edited Loading

piegu commented Apr 29, 2023 •

edited

Loading

NielsRogge commented Jul 3, 2023 •

edited

Loading