Releases: Uberi/speech_recognition
Releases · Uberi/speech_recognition
Version 3.4.1
Changes:
- BREAKING CHANGE: AT&T STT API IS BEING SHUT DOWN SOON. (source)
- For now, the
recognize_att
function will keep working, until the API itself is shut down. - It is best to transition over to IBM, Wit.ai, Google, CMU Sphinx, Bing Voice, or api.ai as soon as possible.
- In most cases, you can simply rename
recognize_att
to a different service likerecognize_ibm
, then generate new API keys/tokens for it.
- For now, the
- DEPRECATED CLASS:
WavFile
has been renamed toAudioFile
.WavFile
will continue to work for the foreseeable future. New code should useAudioFile
.AudioFile
is the same asWavFile
, but in addition to WAV, it also supports AIFF and FLAC files!
- New api.ai support, courtesy of @sbraden! See
recognize_api
in the library reference. - New Microsoft Bing Voice Recognition API support! See
recognize_bing
in the library reference. - Support for 8-bit unsigned WAV audio (thanks to @zhaoqf123 for reporting!).
- Faster, upgraded FLAC binaries, with Linux binaries using Holy Build Box for maximum distro compatibility..
- Updated setup process for Wit.ai.
- Update phrase retrieval for
recognize_ibm
, courtesy of Bhavik Shah from IBM. - Documentation improvements and code cleanup.
- Clearer licensing information - see the README.
As always, you can upgrade with pip install --upgrade speechrecognition
.
Version 3.3.3
Changes:
- Tiny fix to some error checking.
- The version number is all 3's now. This is pretty OK.
Version 3.3.2
Bugfix release!
- Fix
exception_on_overflow
shenanigans. This version will eliminate those peskyValueError
s. - The overflow error should well and truly be gone now.
Special thanks to @michaelpri10 for reporting the exception_on_overflow
bug.
Version 3.3.1
Version 3.3.0
Major changes since 3.2.1:
- Possible backward incompatibility: if PyAudio is not installed,
Microphone
now throws anAttributeError
when created rather than not being defined.- This only requires changes if you are explicitly testing for the existance of the class, using something like
hasattr
orgetattr
.
- This only requires changes if you are explicitly testing for the existance of the class, using something like
- More robust error handling - always clean up PyAudio resources, no matter what error conditions arise.
- Better error checking - always verify PyAudio version.
Version 3.2.1
Changes since 3.2.0:
- Significantly improved and reorganized documentation.
- Fix for overflow issue, courtesy of @jbwincek.
Version 3.2.0
Major changes since 3.1.3:
- Support for recognition using CMU Sphinx - do speech recognition while offline!
- English supported out of the box; French and Mandarin available for download. See the README for details.
- Automatic sample rate/sample width conversions; users shouldn't have to worry about audio formats at all.
- Lots of documentation improvements.
Version 3.1.3
- Work around an obscure standard library issue.
- Doesn't work on big endian systems - use Python 3.5 to get the proper bugfix for that.
- Handle stereo WAV files correctly.
Version 3.1.2
Changes since 3.1.0:
- Update documentation to account for new releases of Python, PyInstaller, and PyAudio.
- The new PyAudio version fixes an obscure overflow issue and also makes installation much easier on all platforms.
- New documentation for the updated PyAudio installation process.
- General documentation improvements.
- Handle errors better and add additional error checks.
Version 3.1.0
Changes since 3.0.0:
- Support for AT&T Speech to Text API.
- Small documentation tweaks.