Releases · Uberi/speech_recognition

08 Dec 12:57

ftnext

3.12.0

85ba2d2

Version 3.12.0 Latest

Latest

SpeechRecognition 3.12.0 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition.
Enjoy!

What's Changed

New features

Groq Support

recognizer_instance.recognize_groq() (@ftnext in #797 & #803)
- Use with an environment variable GROQ_API_KEY
Rename to recognizer_instance.recognize_openai() (Deprecate recognizer_instance.recognize_whisper_api()) (@ftnext in #801)
- SpeechRecognition supports Whisper API served OpenAI and Groq
- Use with an environment variable OPENAI_API_KEY
- OpenAI's optional parameters partially supported (@ftnext in #802, Idea from @wronkiew #676)
Decision: make the responsibility of checking environment variables each library (not SpeechRecognition) (@ftnext in #804)
- export XXX_API_KEY=... or os.environ["XXX_API_KEY"] = ...

Python 3.13 Support (experimental)

Install aifc and audioop in Python 3.13 by @ftnext in #781
Run tests on Python 3.13 (excluding whisper-local) by @ftnext in #785

Improvement

Cleanup extras

Tidy dependencies: requests by @ftnext in #780
Extract pocketsphinx extra by @ftnext in #787

Others

Install PyAudio in unittest Action by @ftnext in #774
Adopt pytest by @ftnext in #779
Fix "SetuptoolsDeprecationWarning: bdist_wheel.universal is deprecated" by @ftnext in #773

Full Changelog: 3.11.0...3.12.0

Contributors

wronkiew and ftnext

Assets 2

20 Oct 09:28

ftnext

3.11.0

b0c91ae

Version 3.11.0

SpeechRecognition 3.11.0 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition.
Enjoy!

What's Changed

New features

Add stream= kwarg to Recognizer.listen by @clusterfudge in #757

Improvements

Support Python maintenance: Add 3.12, Drop 3.8 by @ftnext in #770
Remove deprecated distutils @ftnext in #768 and #769
- Support pip install SpeechRecognition[audio]

New Contributors

@clusterfudge made their first contribution in #757

Full Changelog: 3.10.4...3.11.0

Contributors

clusterfudge and ftnext

Assets 2

05 May 04:42

ftnext

3.10.4

75a7f6b

Version 3.10.4

SpeechRecognition 3.10.4 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition.
Enjoy!

What's Changed

New features

Support M1 Mac by @ftnext in #745
Enable to pass SSL endpoint to cope with WinError 10054 by @ftnext in #754
- Thanks to @Hexer-7 #751

Improvements

Migrate from TravisCI by @ftnext in #741 #742
- Fix code according to flake8
Run unittests on Windows in CI by @ftnext in #746

Full Changelog: 3.10.3...3.10.4

Contributors

ftnext and Hexer-7

Assets 2

30 Mar 15:17

ftnext

3.10.3

297b939

Version 3.10.3

SpeechRecognition 3.10.3 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition.
Enjoy!

What's Changed

Improvements

Tweak installation by @ftnext in #740
- Support pip install SpeechRecognition[whisper-local]
- Support pip install SpeechRecognition[whisper-api]
Add tests with mock by @ftnext (#738, #739)

Full Changelog: 3.10.2...3.10.3

Contributors

ftnext

Assets 2

28 Mar 13:57

ftnext

3.10.2

c028409

Version 3.10.2

SpeechRecognition 3.10.2 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition.
Enjoy!

What's Changed

Bugfixes

Updated to the latest OpenAI API changes, and fixed #720 by @herrjemand in #729

New Contributors

@herrjemand made their first contribution in #729

Thanks to all contributors!

Full Changelog: 3.10.1...3.10.2

Contributors

yackermann

Assets 2

06 Dec 13:10

ftnext

3.10.1

1b737c5

Version 3.10.1

SpeechRecognition 3.10.1 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition.
Enjoy!

What's Changed

New features

Support Python 3.11

Improvements

Refactor recognize_google by @ftnext in #721

Thanks to all contributors!

Full Changelog: 3.10.0...3.10.1

Contributors

ftnext

Assets 2

14 Mar 12:04

ftnext

3.10.0

8b07762

Version 3.10.0

SpeechRecognition 3.10.0 was out🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition.
Enjoy!

What's Changed

New features

Support Whisper API by @ftnext in #669

Improvements

Thanks❤️

Replace with in-memory stream on recognize_whisper by @ftnext in #647
Remove prints that shouldn't be printed by default by @kuzmoyev in #651
Codebase is under refactoring...

Deprecations

Drop inactive Python by @ftnext in #650
- SpeechRecognition currently supports Python 3.8+

New Contributors

@kuzmoyev made their first contribution in #651

Thanks to all contributors!

Full Changelog: 3.9.0...3.10.0

Contributors

kuzmoyev and ftnext

Assets 2

10 Jan 12:12

ftnext

3.9.0

3f4162f

Version 3.9.0

SpeechRecognition 3.9.0 was out on December 2022🎉
Get all of these and more with a quick pip install --upgrade SpeechRecognition.
Enjoy!

What's Changed

New features

Thanks for making SpeechRecognition even more wonderful! 🙌

Add recognize_tensorflow by @chriamue in #296
Add recognize_vosk by @mytja in #513
Add recognize_amazon and recognize_assemblyai by @chrisspen in #434
Add recognize_whisper by @joy-void-joy in #625

Bugfixes & improvements

Thanks!👏

Update to speechContext formatting for recognize_google_cloud by @dcam0050 in #304
Fix for OSError: [Errno -9988] Stream closed Error by @chriamue in #306
Add paramater to change profanity filter level for Google Speech Recognition by @jorgegarciadev in #363
Updating Wit API version (20160526 -> 20170307) by @Franck-Dernoncourt in #344
Google cloud speech library by @frnsys in #406
Fix large cpu consumption in snowboy detect by @Aculeasis in #395
Replace Bing Speech API with Azure Speech API by @lastcoolnameleft in #389
Removed duplicate code by @jhoelzl in #321
fix recognize_google_cloud by @alinerguio in #601
Pin pocketsphinx temporarily by @ftnext in #627
Specify fp16 parameter for whisper by @ftnext in #630

Documentation improvements

Thanks!❤️

Update pocketsphinx.rst by @fygul in #396
docs: fix simple typo, covnert -> convert by @timgates42 in #536
Update pocketsphinx.rst by @fygul in #435

Improvements for developers

Fix Travis build by @native-api in #418 (Thanks!)
Fix unit tests of recognize_google method by @ftnext in #619

New Contributors

@dcam0050 made their first contribution in #304
@chriamue made their first contribution in #296
@jorgegarciadev made their first contribution in #363
@Franck-Dernoncourt made their first contribution in #344
@fygul made their first contribution in #396
@frnsys made their first contribution in #406
@Aculeasis made their first contribution in #395
@lastcoolnameleft made their first contribution in #389
@native-api made their first contribution in #418
@mytja made their first contribution in #513
@alinerguio made their first contribution in #601
@chrisspen made their first contribution in #434
@timgates42 made their first contribution in #536
@joy-void-joy made their first contribution in #625

Thanks to all contributors!

Full Changelog: 3.8.1...3.9.0

Contributors

Franck-Dernoncourt, chrisspen, and 13 other contributors

Assets 2

05 Dec 14:05

Uberi

3.8.1

b24d057

Version 3.8.1

Lots of changes since June! Summary below. Get all of these and more with a quick pip install --upgrade SpeechRecognition.

Snowboy hotwords support for highly efficient, performant listening (thanks @beeedy!). This is implemented as the snowboy_configuration parameter of recognizer_instance.listen.
Configurable Pocketsphinx models - you can now specify your own acoustic parameters, language model, and phoneme dictionary, using the language parameter of recognizer_instance.recognize_sphinx (thanks @frawau!).
audio_data_instance.get_segment(start_ms=None, end_ms=None) is a new method that can be called on any AudioData instance to get a segment of the audio starting at start_ms and ending at end_ms. This is really useful when you want to get, say, only the first five seconds of some audio.
The stopper function returned by listen_in_background now accepts one parameter, wait_for_stop (defaulting to True for backwards compatibility), which determines whether the function will wait for the background thread to fully shutdown before returning. One advantage is that if wait_for_stop is False, you can call the stopper function from any thread!
New example, demonstrating how to simultaneously listen to and recognize speech with the threaded producer/consumer pattern: threaded_workers.py.
Various improvements and bugfixes:
- Python 3 style type annotations in library documentation.
- recognize_google_cloud now uses the v1 rather than the beta API (thanks @oort7!).
- recognize_google_cloud now returns timestamp info when the show_all parameter is True.
- recognize_bing won't time out as often on credential requests, due to a longer default timeout.
- recognize_google_cloud timeouts respect recognizer_instance.operation_timeout now (thanks @reefactor!).
- Any recognizers using FLAC audio were broken inside Linux on Docker - this is now fixed (thanks @reefactor!).
- Various documentation and lint fixes (thanks @josh-hernandez-exe!).
- Lots of small build system improvements.

Assets 2

27 Jun 05:43

Uberi

3.7.1

566a70b

Version 3.7.1

As usual, get it with pip install --upgrade SpeechRecognition

New grammar parameter for recognizer_instance.recognize_sphinx - now, you can specify a JSGF or FSG grammar to PocketSphinx (thanks @aleneum!).
Update PyAudio to version 0.2.11 - this fixes a couple memory management issues users have been experiencing.
Update FLAC to 1.3.2 on all platforms - this will make it easier to support more audio formats in the near future.
Fixes for various APIs on Python 3.6+ - small changes in urllib.request behavior made requests fail in certain situations.
Fixes for Bing Speech API timing out due to some backwards incompatible changes to their API.
Restore original IBM audio segmentation behaviour - previously, it would stop recognizing after the first pause. Now, it will recognize all speech in the input audio, as it did before IBM's changes.
Fix links in PocketSphinx docs and library reference. Add-on language models now available from Google Drive, including the now-officially-supported Italian model.
New troubleshooting entries for JACK server in README.
Documentation and build process updates.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

New features

Improvement

Contributors

What's Changed

New features

Improvements

New Contributors

Contributors

What's Changed

New features

Improvements

Contributors

What's Changed

Improvements

Contributors

What's Changed

Bugfixes

New Contributors

Contributors

What's Changed

New features

Improvements

Contributors

What's Changed

New features

Improvements

Deprecations

New Contributors

Contributors

What's Changed

New features

Bugfixes & improvements

Documentation improvements

Improvements for developers

New Contributors

Contributors

Releases: Uberi/speech_recognition

Version 3.12.0

What's Changed

New features

Improvement

Contributors

Version 3.11.0

What's Changed

New features

Improvements

New Contributors

Contributors

Version 3.10.4

What's Changed

New features

Improvements

Contributors

Version 3.10.3

What's Changed

Improvements

Contributors

Version 3.10.2

What's Changed

Bugfixes

New Contributors

Contributors

Version 3.10.1

What's Changed

New features

Improvements

Contributors

Version 3.10.0

What's Changed

New features

Improvements

Deprecations

New Contributors

Contributors

Version 3.9.0

What's Changed

New features

Bugfixes & improvements

Documentation improvements

Improvements for developers

New Contributors

Contributors

Version 3.8.1

Version 3.7.1