SpeechRecognition v3.8.1 Release Notes
Release Date: 2017-12-05 // about 8 years ago-
โฌ๏ธ Lots of changes since June! Summary below. Get all of these and more with a quick
pip install --upgrade SpeechRecognition.- ๐ Snowboy hotwords support for highly efficient, performant listening (thanks @beeedy!). This is implemented as the
snowboy_configurationparameter ofrecognizer_instance.listen. - ๐ง Configurable Pocketsphinx models - you can now specify your own acoustic parameters, language model, and phoneme dictionary, using the
languageparameter ofrecognizer_instance.recognize_sphinx(thanks @frawau!). audio_data_instance.get_segment(start_ms=None, end_ms=None)is a new method that can be called on any AudioData instance to get a segment of the audio starting atstart_msand ending atend_ms. This is really useful when you want to get, say, only the first five seconds of some audio.- The
stopperfunction returned bylisten_in_backgroundnow accepts one parameter,wait_for_stop(defaulting toTruefor backwards compatibility), which determines whether the function will wait for the background thread to fully shutdown before returning. One advantage is that ifwait_for_stopisFalse, you can call thestopperfunction from any thread! - New example, demonstrating how to simultaneously listen to and recognize speech with the threaded producer/consumer pattern: threaded_workers.py.
- ๐ Various improvements and bugfixes:
- Python 3 style type annotations in library documentation.
recognize_google_cloudnow uses the v1 rather than the beta API (thanks @oort7!).recognize_google_cloudnow returns timestamp info when theshow_allparameter isTrue.recognize_bingwon't time out as often on credential requests, due to a longer default timeout.recognize_google_cloudtimeouts respectrecognizer_instance.operation_timeoutnow (thanks @reefactor!).- Any recognizers using FLAC audio were broken inside Linux on Docker - this is now fixed (thanks @reefactor!).
- Various documentation and lint fixes (thanks @josh-hernandez-exe!).
- Lots of small build system improvements.
- ๐ Snowboy hotwords support for highly efficient, performant listening (thanks @beeedy!). This is implemented as the
Previous changes from v3.7.1
-
โฌ๏ธ As usual, get it with
pip install --upgrade SpeechRecognition- ๐ New
grammarparameter forrecognizer_instance.recognize_sphinx- now, you can specify a JSGF or FSG grammar to PocketSphinx (thanks @aleneum!). - โก๏ธ Update PyAudio to version 0.2.11 - this fixes a couple memory management issues users have been experiencing.
- โก๏ธ Update FLAC to 1.3.2 on all platforms - this will make it easier to support more audio formats in the near future.
- ๐ Fixes for various APIs on Python 3.6+ - small changes in
urllib.requestbehavior made requests fail in certain situations. - ๐ Fixes for Bing Speech API timing out due to some backwards incompatible changes to their API.
- โช Restore original IBM audio segmentation behaviour - previously, it would stop recognizing after the first pause. Now, it will recognize all speech in the input audio, as it did before IBM's changes.
- ๐ Fix links in PocketSphinx docs and library reference. Add-on language models now available from Google Drive, including the now-officially-supported Italian model.
- ๐ New troubleshooting entries for JACK server in README.
- ๐ Documentation and build process updates.
- ๐ New