Memory requirements? #5
-
I attempted to run whisper on an audio file using the medium model, and I got this:
Which eventually ran out of memory (this machine has 8GB) and was killed by the OOM killer. Would it be possible to document the estimated memory requirements for running whisper? EDIT: it looks like the cache migration worked, but it's whisper itself that's ballooning memory. |
Beta Was this translation helpful? Give feedback.
Replies: 7 comments 13 replies
-
I had no problems running medium size model using 8 GB card (GTX 1070) |
Beta Was this translation helpful? Give feedback.
-
I am interested I'm in this too. What would be a reasonable time to process 2 minutes of recorded phone conversation? I am testing on Win 11 virtual machine with 4gb ram and host I i9-9900K CPU. It takes quite a while to process 2 minutes audio. Medium model throws "not enough memory" error |
Beta Was this translation helpful? Give feedback.
-
I've just added Available models and languages section in README.md; to quote:
The VRAM requirements are from simulations using |
Beta Was this translation helpful? Give feedback.
-
Additional testing on:
|
Beta Was this translation helpful? Give feedback.
-
And more testing on diff GPU:
8vCORE avx512 T4 16GB
24vCORE avx512 RTX A5000 24GB
24vCORE avx512 A30 24GB
12vCORE avx512 A2 14GB
6vCORE avx512 A100 40GB
|
Beta Was this translation helpful? Give feedback.
-
Is it possible to do batch processing on the audio files so that we can transcribe more audio files in less time? |
Beta Was this translation helpful? Give feedback.
-
@n ➜ /workspaces/whisper (main ✗) $ whisper Jtest.mp3 --model small Why is the procedure KILLED? Weird |
Beta Was this translation helpful? Give feedback.
I've just added Available models and languages section in README.md; to quote:
tiny.en
tiny
base.en
base
small.en
small
medium.en
medium
large
The VRAM requirements are from simulations using
torch.cuda.set_per_process_memory_fraction()
, so it may not be actually reflecting what happens in e.g. a GPU with exactly 5 GB VRAM.