Skip to content

Releases: Uberi/speech_recognition

Version 3.12.0

08 Dec 12:57
Compare
Choose a tag to compare

SpeechRecognition 3.12.0 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition.
Enjoy!

What's Changed

New features

Groq Support

  • recognizer_instance.recognize_groq() (@ftnext in #797 & #803)
    • Use with an environment variable GROQ_API_KEY
  • Rename to recognizer_instance.recognize_openai() (Deprecate recognizer_instance.recognize_whisper_api()) (@ftnext in #801)
    • SpeechRecognition supports Whisper API served OpenAI and Groq
    • Use with an environment variable OPENAI_API_KEY
    • OpenAI's optional parameters partially supported (@ftnext in #802, Idea from @wronkiew #676)
  • Decision: make the responsibility of checking environment variables each library (not SpeechRecognition) (@ftnext in #804)
    • export XXX_API_KEY=... or os.environ["XXX_API_KEY"] = ...

Python 3.13 Support (experimental)

  • Install aifc and audioop in Python 3.13 by @ftnext in #781
  • Run tests on Python 3.13 (excluding whisper-local) by @ftnext in #785

Improvement

Cleanup extras

Others

  • Install PyAudio in unittest Action by @ftnext in #774
  • Adopt pytest by @ftnext in #779
  • Fix "SetuptoolsDeprecationWarning: bdist_wheel.universal is deprecated" by @ftnext in #773

Full Changelog: 3.11.0...3.12.0

Version 3.11.0

20 Oct 09:28
Compare
Choose a tag to compare

SpeechRecognition 3.11.0 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition.
Enjoy!

What's Changed

New features

Improvements

  • Support Python maintenance: Add 3.12, Drop 3.8 by @ftnext in #770
  • Remove deprecated distutils @ftnext in #768 and #769
    • Support pip install SpeechRecognition[audio]

New Contributors

Full Changelog: 3.10.4...3.11.0

Version 3.10.4

05 May 04:42
Compare
Choose a tag to compare

SpeechRecognition 3.10.4 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition.
Enjoy!

What's Changed

New features

Improvements

Full Changelog: 3.10.3...3.10.4

Version 3.10.3

30 Mar 15:17
Compare
Choose a tag to compare

SpeechRecognition 3.10.3 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition.
Enjoy!

What's Changed

Improvements

  • Tweak installation by @ftnext in #740
    • Support pip install SpeechRecognition[whisper-local]
    • Support pip install SpeechRecognition[whisper-api]
  • Add tests with mock by @ftnext (#738, #739)

Full Changelog: 3.10.2...3.10.3

Version 3.10.2

28 Mar 13:57
Compare
Choose a tag to compare

SpeechRecognition 3.10.2 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition.
Enjoy!

What's Changed

Bugfixes

  • Updated to the latest OpenAI API changes, and fixed #720 by @herrjemand in #729

New Contributors

  • @herrjemand made their first contribution in #729

Thanks to all contributors!

Full Changelog: 3.10.1...3.10.2

Version 3.10.1

06 Dec 13:10
Compare
Choose a tag to compare

SpeechRecognition 3.10.1 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition.
Enjoy!

What's Changed

New features

  • Support Python 3.11

Improvements

Thanks to all contributors!

Full Changelog: 3.10.0...3.10.1

Version 3.10.0

14 Mar 12:04
Compare
Choose a tag to compare

SpeechRecognition 3.10.0 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition.
Enjoy!

What's Changed

New features

Improvements

Thanks❤️

  • Replace with in-memory stream on recognize_whisper by @ftnext in #647
  • Remove prints that shouldn't be printed by default by @kuzmoyev in #651
  • Codebase is under refactoring...

Deprecations

  • Drop inactive Python by @ftnext in #650
    • SpeechRecognition currently supports Python 3.8+

New Contributors

Thanks to all contributors!

Full Changelog: 3.9.0...3.10.0

Version 3.9.0

10 Jan 12:12
Compare
Choose a tag to compare

SpeechRecognition 3.9.0 was out on December 2022🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition.
Enjoy!

What's Changed

New features

Thanks for making SpeechRecognition even more wonderful! 🙌

Bugfixes & improvements

Thanks!👏

Documentation improvements

Thanks!❤️

Improvements for developers

New Contributors

Thanks to all contributors!

Full Changelog: 3.8.1...3.9.0

Version 3.8.1

05 Dec 14:05
3.8.1
Compare
Choose a tag to compare

Lots of changes since June! Summary below. Get all of these and more with a quick pip install --upgrade SpeechRecognition.

  • Snowboy hotwords support for highly efficient, performant listening (thanks @beeedy!). This is implemented as the snowboy_configuration parameter of recognizer_instance.listen.
  • Configurable Pocketsphinx models - you can now specify your own acoustic parameters, language model, and phoneme dictionary, using the language parameter of recognizer_instance.recognize_sphinx (thanks @frawau!).
  • audio_data_instance.get_segment(start_ms=None, end_ms=None) is a new method that can be called on any AudioData instance to get a segment of the audio starting at start_ms and ending at end_ms. This is really useful when you want to get, say, only the first five seconds of some audio.
  • The stopper function returned by listen_in_background now accepts one parameter, wait_for_stop (defaulting to True for backwards compatibility), which determines whether the function will wait for the background thread to fully shutdown before returning. One advantage is that if wait_for_stop is False, you can call the stopper function from any thread!
  • New example, demonstrating how to simultaneously listen to and recognize speech with the threaded producer/consumer pattern: threaded_workers.py.
  • Various improvements and bugfixes:
    • Python 3 style type annotations in library documentation.
    • recognize_google_cloud now uses the v1 rather than the beta API (thanks @oort7!).
    • recognize_google_cloud now returns timestamp info when the show_all parameter is True.
    • recognize_bing won't time out as often on credential requests, due to a longer default timeout.
    • recognize_google_cloud timeouts respect recognizer_instance.operation_timeout now (thanks @reefactor!).
    • Any recognizers using FLAC audio were broken inside Linux on Docker - this is now fixed (thanks @reefactor!).
    • Various documentation and lint fixes (thanks @josh-hernandez-exe!).
    • Lots of small build system improvements.

Version 3.7.1

27 Jun 05:43
3.7.1
Compare
Choose a tag to compare

As usual, get it with pip install --upgrade SpeechRecognition

  • New grammar parameter for recognizer_instance.recognize_sphinx - now, you can specify a JSGF or FSG grammar to PocketSphinx (thanks @aleneum!).
  • Update PyAudio to version 0.2.11 - this fixes a couple memory management issues users have been experiencing.
  • Update FLAC to 1.3.2 on all platforms - this will make it easier to support more audio formats in the near future.
  • Fixes for various APIs on Python 3.6+ - small changes in urllib.request behavior made requests fail in certain situations.
  • Fixes for Bing Speech API timing out due to some backwards incompatible changes to their API.
  • Restore original IBM audio segmentation behaviour - previously, it would stop recognizing after the first pause. Now, it will recognize all speech in the input audio, as it did before IBM's changes.
  • Fix links in PocketSphinx docs and library reference. Add-on language models now available from Google Drive, including the now-officially-supported Italian model.
  • New troubleshooting entries for JACK server in README.
  • Documentation and build process updates.