Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Add support for insanely-fast-whisper over faster-whisper #53

Open
Timtam opened this issue Nov 17, 2023 · 2 comments
Open

[Feature] Add support for insanely-fast-whisper over faster-whisper #53

Timtam opened this issue Nov 17, 2023 · 2 comments
Assignees
Labels
enhancement New feature or request

Comments

@Timtam
Copy link

Timtam commented Nov 17, 2023

Description of the feature you are requesting:

I'm by far no expert, but I noticed that insanely-fast-whisper is now out which further reduces time to transcribe audio: https://github.com/Vaibhavs10/insanely-fast-whisper

Describe the solution you'd like

Switch from faster-whisper to insanely-fast-whisper as a permanent replacement.

Describe alternatives you've considered

Optionally enable the user to decide wether they want to run faster-whisper or insanely-fast-whisper via an environment variable or something similar.

Additional context

None

@Arche151
Copy link

Arche151 commented Nov 18, 2023

Just tried it out and it's reeeally good! Also, it uses the large-v3 model by default, instead of large-v2 like faster-whisper. Aand the developer said, he will add diarization this week!

Edit:

Insanely-fast-whisper was about 4x faster than faster-whisper in my test, but it also uses much more VRAM than faster-whisper - about 13GB, compared to faster-whisper's 4,7GB.

So yeah, being able to choose which whisper-implementation to use in an environment variable would be great!

@pluja
Copy link
Owner

pluja commented Nov 21, 2023

I will look into this, it should not be that hard to add it as an alternative whisper backend with the current transcription-api implementation.

I would like to keep faster-whisper as a backend since it allows the use of CPU, and insanely-fast-whisper only allows GPU. So it should be a dropdown that allows the user to choose which backend they want to use.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants