Releases: Uberi/speech_recognition
Version 3.12.0
SpeechRecognition 3.12.0 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition
.
Enjoy!
What's Changed
New features
Groq Support
recognizer_instance.recognize_groq()
(@ftnext in #797 & #803)- Use with an environment variable
GROQ_API_KEY
- Use with an environment variable
- Rename to
recognizer_instance.recognize_openai()
(Deprecaterecognizer_instance.recognize_whisper_api()
) (@ftnext in #801) - Decision: make the responsibility of checking environment variables each library (not SpeechRecognition) (@ftnext in #804)
export XXX_API_KEY=...
oros.environ["XXX_API_KEY"] = ...
Python 3.13 Support (experimental)
- Install aifc and audioop in Python 3.13 by @ftnext in #781
- Run tests on Python 3.13 (excluding whisper-local) by @ftnext in #785
Improvement
Cleanup extras
Others
- Install PyAudio in unittest Action by @ftnext in #774
- Adopt pytest by @ftnext in #779
- Fix "SetuptoolsDeprecationWarning: bdist_wheel.universal is deprecated" by @ftnext in #773
Full Changelog: 3.11.0...3.12.0
Version 3.11.0
SpeechRecognition 3.11.0 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition
.
Enjoy!
What's Changed
New features
- Add
stream=
kwarg toRecognizer.listen
by @clusterfudge in #757
Improvements
- Support Python maintenance: Add 3.12, Drop 3.8 by @ftnext in #770
- Remove deprecated distutils @ftnext in #768 and #769
- Support
pip install SpeechRecognition[audio]
- Support
New Contributors
- @clusterfudge made their first contribution in #757
Full Changelog: 3.10.4...3.11.0
Version 3.10.4
SpeechRecognition 3.10.4 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition
.
Enjoy!
What's Changed
New features
- Support M1 Mac by @ftnext in #745
- Enable to pass SSL endpoint to cope with WinError 10054 by @ftnext in #754
Improvements
- Migrate from TravisCI by @ftnext in #741 #742
- Fix code according to flake8
- Run unittests on Windows in CI by @ftnext in #746
Full Changelog: 3.10.3...3.10.4
Version 3.10.3
SpeechRecognition 3.10.3 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition
.
Enjoy!
What's Changed
Improvements
- Tweak installation by @ftnext in #740
- Support
pip install SpeechRecognition[whisper-local]
- Support
pip install SpeechRecognition[whisper-api]
- Support
- Add tests with mock by @ftnext (#738, #739)
Full Changelog: 3.10.2...3.10.3
Version 3.10.2
SpeechRecognition 3.10.2 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition
.
Enjoy!
What's Changed
Bugfixes
New Contributors
- @herrjemand made their first contribution in #729
Thanks to all contributors!
Full Changelog: 3.10.1...3.10.2
Version 3.10.1
SpeechRecognition 3.10.1 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition
.
Enjoy!
What's Changed
New features
- Support Python 3.11
Improvements
Thanks to all contributors!
Full Changelog: 3.10.0...3.10.1
Version 3.10.0
SpeechRecognition 3.10.0 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition
.
Enjoy!
What's Changed
New features
Improvements
Thanks❤️
- Replace with in-memory stream on
recognize_whisper
by @ftnext in #647 - Remove prints that shouldn't be printed by default by @kuzmoyev in #651
- Codebase is under refactoring...
Deprecations
New Contributors
Thanks to all contributors!
Full Changelog: 3.9.0...3.10.0
Version 3.9.0
SpeechRecognition 3.9.0 was out on December 2022🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition
.
Enjoy!
What's Changed
New features
Thanks for making SpeechRecognition even more wonderful! 🙌
- Add
recognize_tensorflow
by @chriamue in #296 - Add
recognize_vosk
by @mytja in #513 - Add
recognize_amazon
andrecognize_assemblyai
by @chrisspen in #434 - Add
recognize_whisper
by @joy-void-joy in #625
Bugfixes & improvements
Thanks!👏
- Update to speechContext formatting for recognize_google_cloud by @dcam0050 in #304
- Fix for OSError: [Errno -9988] Stream closed Error by @chriamue in #306
- Add paramater to change profanity filter level for Google Speech Recognition by @jorgegarciadev in #363
- Updating Wit API version (20160526 -> 20170307) by @Franck-Dernoncourt in #344
- Google cloud speech library by @frnsys in #406
- Fix large cpu consumption in snowboy detect by @Aculeasis in #395
- Replace Bing Speech API with Azure Speech API by @lastcoolnameleft in #389
- Removed duplicate code by @jhoelzl in #321
- fix recognize_google_cloud by @alinerguio in #601
- Pin pocketsphinx temporarily by @ftnext in #627
- Specify fp16 parameter for whisper by @ftnext in #630
Documentation improvements
Thanks!❤️
- Update pocketsphinx.rst by @fygul in #396
- docs: fix simple typo, covnert -> convert by @timgates42 in #536
- Update pocketsphinx.rst by @fygul in #435
Improvements for developers
- Fix Travis build by @native-api in #418 (Thanks!)
- Fix unit tests of recognize_google method by @ftnext in #619
New Contributors
- @dcam0050 made their first contribution in #304
- @chriamue made their first contribution in #296
- @jorgegarciadev made their first contribution in #363
- @Franck-Dernoncourt made their first contribution in #344
- @fygul made their first contribution in #396
- @frnsys made their first contribution in #406
- @Aculeasis made their first contribution in #395
- @lastcoolnameleft made their first contribution in #389
- @native-api made their first contribution in #418
- @mytja made their first contribution in #513
- @alinerguio made their first contribution in #601
- @chrisspen made their first contribution in #434
- @timgates42 made their first contribution in #536
- @joy-void-joy made their first contribution in #625
Thanks to all contributors!
Full Changelog: 3.8.1...3.9.0
Version 3.8.1
Lots of changes since June! Summary below. Get all of these and more with a quick pip install --upgrade SpeechRecognition
.
- Snowboy hotwords support for highly efficient, performant listening (thanks @beeedy!). This is implemented as the
snowboy_configuration
parameter ofrecognizer_instance.listen
. - Configurable Pocketsphinx models - you can now specify your own acoustic parameters, language model, and phoneme dictionary, using the
language
parameter ofrecognizer_instance.recognize_sphinx
(thanks @frawau!). audio_data_instance.get_segment(start_ms=None, end_ms=None)
is a new method that can be called on any AudioData instance to get a segment of the audio starting atstart_ms
and ending atend_ms
. This is really useful when you want to get, say, only the first five seconds of some audio.- The
stopper
function returned bylisten_in_background
now accepts one parameter,wait_for_stop
(defaulting toTrue
for backwards compatibility), which determines whether the function will wait for the background thread to fully shutdown before returning. One advantage is that ifwait_for_stop
isFalse
, you can call thestopper
function from any thread! - New example, demonstrating how to simultaneously listen to and recognize speech with the threaded producer/consumer pattern: threaded_workers.py.
- Various improvements and bugfixes:
- Python 3 style type annotations in library documentation.
recognize_google_cloud
now uses the v1 rather than the beta API (thanks @oort7!).recognize_google_cloud
now returns timestamp info when theshow_all
parameter isTrue
.recognize_bing
won't time out as often on credential requests, due to a longer default timeout.recognize_google_cloud
timeouts respectrecognizer_instance.operation_timeout
now (thanks @reefactor!).- Any recognizers using FLAC audio were broken inside Linux on Docker - this is now fixed (thanks @reefactor!).
- Various documentation and lint fixes (thanks @josh-hernandez-exe!).
- Lots of small build system improvements.
Version 3.7.1
As usual, get it with pip install --upgrade SpeechRecognition
- New
grammar
parameter forrecognizer_instance.recognize_sphinx
- now, you can specify a JSGF or FSG grammar to PocketSphinx (thanks @aleneum!). - Update PyAudio to version 0.2.11 - this fixes a couple memory management issues users have been experiencing.
- Update FLAC to 1.3.2 on all platforms - this will make it easier to support more audio formats in the near future.
- Fixes for various APIs on Python 3.6+ - small changes in
urllib.request
behavior made requests fail in certain situations. - Fixes for Bing Speech API timing out due to some backwards incompatible changes to their API.
- Restore original IBM audio segmentation behaviour - previously, it would stop recognizing after the first pause. Now, it will recognize all speech in the input audio, as it did before IBM's changes.
- Fix links in PocketSphinx docs and library reference. Add-on language models now available from Google Drive, including the now-officially-supported Italian model.
- New troubleshooting entries for JACK server in README.
- Documentation and build process updates.