This repository has been archived by the owner on Dec 13, 2022. It is now read-only.

26 Oct 08:05

fcakyon

1.0.6

4a6508d

v1.0.6 Latest

Latest

What's Changed

get latest updates from upstream by @fcakyon in #18
update version by @fcakyon in #19

Full Changelog: 1.0.5...1.0.6

Contributors

fcakyon

Assets 2

0 Join discussion

15 Oct 19:56

fcakyon

1.0.5

964b9dc

v1.0.5

What's Changed

update to upstream at 15.10.22 by @fcakyon in #16

Full Changelog: 1.0.4...1.0.5

Contributors

fcakyon

Assets 2

0 Join discussion

02 Oct 11:02

fcakyon

1.0.4

1687a46

v1.0.4

What's Changed

update readme by @fcakyon in #13
update to 02.10.22 openai/whisper by @fcakyon in #14

Full Changelog: 1.0.3...1.0.4

Contributors

fcakyon

Assets 2

0 Join discussion

27 Sep 10:28

fcakyon

1.0.3

1f8ff3d

v1.0.3

What's Changed

fix package testing by @fcakyon in #11
update to 27.09.22 openai/whisper by @fcakyon in #12

Full Changelog: 1.0.2...1.0.3

Contributors

fcakyon

Assets 2

25 Sep 13:02

fcakyon

1.0.2

1fc82b7

v1.0.2

What's Changed

include latest updates from openai/whisper by @fcakyon in #8

Full Changelog: 1.0.1...1.0.2

Contributors

fcakyon

Assets 2

25 Sep 09:45

fcakyon

1.0.1

518bce2

v1.0.1

bugfix release

What's Changed

fix package testing by @fcakyon in #4
fix missing files in packaging by @fcakyon in #5
increment package version by @fcakyon in #6

Full Changelog: 1.0.0...1.0.1

Contributors

fcakyon

Assets 2

24 Sep 23:41

fcakyon

1.0.0

6423cb3

v1.0.0

pywhisper

openai/whisper + extra features

extra features

no need for ffmpeg cli installation, pip install is enough
srt export
progress bar for transcribe
continious integration and package testing via github actions

setup

pip install pywhisper

You may need rust installed as well, in case tokenizers does not provide a pre-built wheel for your platform. If you see installation errors during the pip install command above, please follow the Getting started page to install Rust development environment.

command-line usage

The following command will transcribe speech in audio files, using the medium model:

pywhisper audio.flac audio.mp3 audio.wav --model medium

The default setting (which selects the small model) works well for transcribing English. To transcribe an audio file containing non-English speech, you can specify the language using the --language option:

pywhisper japanese.wav --language Japanese

Adding --task translate will translate the speech into English:

pywhisper japanese.wav --language Japanese --task translate

Run the following to view all available options:

pywhisper --help

See tokenizer.py for the list of all available languages.

python usage

Transcription can also be performed within Python:

import pywhisper

model = pywhisper.load_model("base")
result = model.transcribe("audio.mp3")
print(result["text"])

Internally, the transcribe() method reads the entire file and processes the audio with a sliding 30-second window, performing autoregressive sequence-to-sequence predictions on each window.

Below is an example usage of pywhisper.detect_language() and pywhisper.decode() which provide lower-level access to the model.

import pywhisper

model = pywhisper.load_model("base")

# load audio and pad/trim it to fit 30 seconds
audio = pywhisper.load_audio("audio.mp3")
audio = pywhisper.pad_or_trim(audio)

# make log-Mel spectrogram and move to the same device as the model
mel = pywhisper.log_mel_spectrogram(audio).to(model.device)

# detect the spoken language
_, probs = model.detect_language(mel)
print(f"Detected language: {max(probs, key=probs.get)}")

# decode the audio
options = pywhisper.DecodingOptions()
result = pywhisper.decode(model, mel, options)

# print the recognized text
print(result.text)

What's Changed

initial commit by @fcakyon in #1
add srt export, add cli test, improve test speed by @fcakyon in #2
add progress bar with tqdm by @fcakyon in #3

New Contributors

@fcakyon made their first contribution in #1

Full Changelog: https://github.com/fcakyon/pywhisper/commits/1.0.0

Contributors

fcakyon

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

Contributors

What's Changed

Contributors

What's Changed

Contributors

What's Changed

Contributors

What's Changed

Contributors

bugfix release

What's Changed

Contributors

pywhisper

extra features

setup

command-line usage

python usage

What's Changed

New Contributors

Contributors

Releases: fcakyon/pywhisper

v1.0.6

What's Changed

Contributors

v1.0.5

What's Changed

Contributors

v1.0.4

What's Changed

Contributors

v1.0.3

What's Changed

Contributors

v1.0.2

What's Changed

Contributors

v1.0.1

bugfix release

What's Changed

Contributors

v1.0.0

pywhisper

extra features

setup

command-line usage

python usage

What's Changed

New Contributors

Contributors