Why does it take so long to process a 1 minute video? #110

Root-FTW · 2023-08-16T02:30:48Z

How can I make it run faster? It takes too long for a 1 minute video.

I am using this CLI command:
whisper_timestamped --accurate video.mp4 --model large-v1 --output_format srt --vad False --device "cuda:0" --output_dir .

My PC:

Windows 11 PRO
GPU: GTX 1080 TI
CPU: i9-9900k

Jeronymous · 2023-08-17T07:29:50Z

How much does it take? dozens of minutes?

That is weird. Is there a chance you can share the video?
Is the transcription (almost) correct?

IntendedConsequence · 2023-11-20T17:59:57Z

Something doesn't feel right. When I run with --model large-v3, VRAM shoots to 10GB and it's all very slow. And yet, running the large-v3 model in whisper.cpp, and using vanilla transformers pipeline (in this notebook https://huggingface.co/spaces/hf-audio/whisper-large-v3/blob/main/whisper_notebook.ipynb), both of them only use about 4GB VRAM, and are noticeably faster.

Jeronymous · 2023-11-23T11:12:17Z

Thanks for spotting @IntendedConsequence

Unfortunately the issue of high VRAM consumption happens in the openai-whisper package itself 😬
openai/whisper#1670 here it's reported from version 20230918, but I tried previous versions (up to 20230124) and experience the same: openai-whisper always hits ~10GB VRAM for large models.

Jeronymous closed this as completed Nov 15, 2023

IntendedConsequence mentioned this issue Nov 23, 2023

Use silero v3.1 #142

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why does it take so long to process a 1 minute video? #110

Why does it take so long to process a 1 minute video? #110

Root-FTW commented Aug 16, 2023

Jeronymous commented Aug 17, 2023

IntendedConsequence commented Nov 20, 2023

Jeronymous commented Nov 23, 2023

Why does it take so long to process a 1 minute video? #110

Why does it take so long to process a 1 minute video? #110

Comments

Root-FTW commented Aug 16, 2023

Jeronymous commented Aug 17, 2023

IntendedConsequence commented Nov 20, 2023

Jeronymous commented Nov 23, 2023