-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add transcriber tool #851
Merged
Merged
Add transcriber tool #851
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
… and isdir condition with mutiple processing in vosk_transcriber.py
Can the command |
@ls-milkyway yes, srt option gonna be there. |
nshmyrev
reviewed
Feb 28, 2022
@ls-milkyway, Hello, test the program please, need feedback :) |
nshmyrev
requested changes
Mar 22, 2022
…some variables as constans, changed spot of calculate(old)-get_result(new) method inside of main, changed type transmitted args between scripts and methods, changed function from print to log for output script info(ececution time and xRT)
nshmyrev
reviewed
Mar 29, 2022
…added into bin script with setup.py machinery; some algorithm mistakes were fixed
nshmyrev
requested changes
Apr 5, 2022
…rocessing args.lang and args.model_name
nshmyrev
reviewed
Apr 14, 2022
nshmyrev
reviewed
Apr 14, 2022
nshmyrev
reviewed
Apr 14, 2022
nshmyrev
reviewed
Apr 14, 2022
… method; reworked input args types for files or folders simultineously
nshmyrev
requested changes
Apr 18, 2022
nshmyrev
reviewed
Apr 19, 2022
…e_list and get_list_languages methods with using set; changed log msgs; moved download_model method inside get_model method; added error msgs for non-existend args
nshmyrev
requested changes
Apr 19, 2022
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Transcriber allows you transcribe your audiofile. The project contain ffmpeg transform that allows you transcribe all ffmpeg formats. Also in result data you can see time of transcription and xRT. Also in setup.py was added ability to run the aligner not only from the folder with it was added.
How to work:
Run the script(example 1 file): python3 vosk_transcriber.py moon.wav
You can get result in terminal or choose output file like txt or srt, it will look like:
python3 vosk_transcriber.py moon.wav -output moon.txt
and key -otype srt(default -otype is txt)
Run script(example file folder):
python3 vosk_transcriber.py ~/file_folder -output ~/results
In result you will get output folder with transcribed files in txt or srt format
To see available models -list_models;
Model vosk-model-small-en-us-0.15(https://alphacephei.com/vosk/models/vosk-model-small-en-us-0.15.zip) as
default model
If you wanna choose another model there are 2 ways:
a) -lang 'language' (available languages 'lang' - more info: https://alphacephei.com/vosk/models/model-list.json). It will load the smallest model for the given language.
b) -model_name 'name' of json file above. Example: -model_name vosk-model-small-tr-0.3