Skip to content

Releases: Uberi/speech_recognition

Version 3.4.1

03 Apr 10:41
Compare
Choose a tag to compare

Changes:

  • BREAKING CHANGE: AT&T STT API IS BEING SHUT DOWN SOON. (source)
    • For now, the recognize_att function will keep working, until the API itself is shut down.
    • It is best to transition over to IBM, Wit.ai, Google, CMU Sphinx, Bing Voice, or api.ai as soon as possible.
    • In most cases, you can simply rename recognize_att to a different service like recognize_ibm, then generate new API keys/tokens for it.
  • DEPRECATED CLASS: WavFile has been renamed to AudioFile.
    • WavFile will continue to work for the foreseeable future. New code should use AudioFile.
    • AudioFile is the same as WavFile, but in addition to WAV, it also supports AIFF and FLAC files!
  • New api.ai support, courtesy of @sbraden! See recognize_api in the library reference.
  • New Microsoft Bing Voice Recognition API support! See recognize_bing in the library reference.
  • Support for 8-bit unsigned WAV audio (thanks to @zhaoqf123 for reporting!).
  • Faster, upgraded FLAC binaries, with Linux binaries using Holy Build Box for maximum distro compatibility..
  • Updated setup process for Wit.ai.
  • Update phrase retrieval for recognize_ibm, courtesy of Bhavik Shah from IBM.
  • Documentation improvements and code cleanup.
  • Clearer licensing information - see the README.

As always, you can upgrade with pip install --upgrade speechrecognition.

Version 3.3.3

05 Mar 18:38
Compare
Choose a tag to compare

Changes:

  • Tiny fix to some error checking.
  • The version number is all 3's now. This is pretty OK.

Version 3.3.2

04 Mar 03:20
Compare
Choose a tag to compare

Bugfix release!

  • Fix exception_on_overflow shenanigans. This version will eliminate those pesky ValueErrors.
  • The overflow error should well and truly be gone now.

Special thanks to @michaelpri10 for reporting the exception_on_overflow bug.

Version 3.3.1

26 Feb 05:34
Compare
Choose a tag to compare

Fix for list_microphone_names, courtesy of @ibutra. Fully compatible with 3.3.0.

See #85 for more details!

Version 3.3.0

20 Feb 21:13
Compare
Choose a tag to compare

Major changes since 3.2.1:

  • Possible backward incompatibility: if PyAudio is not installed, Microphone now throws an AttributeError when created rather than not being defined.
    • This only requires changes if you are explicitly testing for the existance of the class, using something like hasattr or getattr.
  • More robust error handling - always clean up PyAudio resources, no matter what error conditions arise.
  • Better error checking - always verify PyAudio version.

Version 3.2.1

20 Feb 21:17
Compare
Choose a tag to compare

Changes since 3.2.0:

  • Significantly improved and reorganized documentation.
  • Fix for overflow issue, courtesy of @jbwincek.

Version 3.2.0

20 Feb 21:05
Compare
Choose a tag to compare

Major changes since 3.1.3:

  • Support for recognition using CMU Sphinx - do speech recognition while offline!
    • English supported out of the box; French and Mandarin available for download. See the README for details.
  • Automatic sample rate/sample width conversions; users shouldn't have to worry about audio formats at all.
  • Lots of documentation improvements.

Version 3.1.3

05 Nov 02:12
Compare
Choose a tag to compare
  • Work around an obscure standard library issue.
    • Doesn't work on big endian systems - use Python 3.5 to get the proper bugfix for that.
  • Handle stereo WAV files correctly.

Version 3.1.2

02 Nov 05:54
Compare
Choose a tag to compare

Changes since 3.1.0:

  • Update documentation to account for new releases of Python, PyInstaller, and PyAudio.
    • The new PyAudio version fixes an obscure overflow issue and also makes installation much easier on all platforms.
    • New documentation for the updated PyAudio installation process.
  • General documentation improvements.
  • Handle errors better and add additional error checks.

Version 3.1.0

02 Nov 05:54
Compare
Choose a tag to compare

Changes since 3.0.0:

  • Support for AT&T Speech to Text API.
  • Small documentation tweaks.