Speech recognition module for Python, supporting several engines and APIs, online and offline.
This is more of a maintenance release, but a few features slipped in as well:
recognizer_instance.recognize_google_cloud
(thanks @Thynix!), plus documentation and examples.speech_recognition.Microphone
- this should fully resolve all the "Invalid sample rate" issues from PyAudio.recognizer_instance.recognize_sphinx
.EOFError
upon encountering malformed audio files; a proper exception message is now given.recognizer_instance.recognize_houndify
(thanks @tb0hdan!).recognize_sphinx
now supports keyword-based matching via the keywords=[("cat", 30), ("potato", 45)]
parameter.
recognize_api
function will keep working if you're on a paid API.AI plan, and we will not be removing it until the service is shut down entirely.phrase_time_limit
option for listening functions, to limit phrase lengths to a certain number of seconds.recognizer_instance.operation_timeout
- this can be used to ensure long requests always take finite time.recognize_ibm
now opts out of request logging by default, for improved user privacy (thanks @michellemorales!). This is a breaking change if you previously relied on request logging behaviour.listen()
sometimes didn't terminate on finite-length streams.Bugfix release.
Changes:
sessionId
field, so we'll just add that in (thanks @jhoelzl!).Changes:
Maintenance release:
recognizer_instance.recognize_bing
.Bugfix release:
We're now GPG signing all our release tags. Under the releases page, you should see the following:
This tells you that GitHub thinks the Git tag is the same as the one we intended to release.
This key can also be found on the SKS keyservers, and you can import it with the following command:
gpg --keyserver x-hkp://pool.sks-keyservers.net --recv-keys 0x5F56B350
The packages on PyPI are signed as well - the signature can be downloaded under the "pgp" link on the SpeechRecognition PyPI page.
Quick bugfix release on the tails of yesterday's big one:
monotonic
library on Python 2 - if you have monotonic
installed in Python 2, recognize_bing
will work faster!
recognize_bing
already does the things that would make it fast, so the library is unnecessary.Changes:
recognize_att
function will keep working, until the API itself is shut down.recognize_att
to a different service like recognize_ibm
, then generate new API keys/tokens for it.WavFile
has been renamed to AudioFile
.
WavFile
will continue to work for the foreseeable future. New code should use AudioFile
.AudioFile
is the same as WavFile
, but in addition to WAV, it also supports AIFF and FLAC files!recognize_api
in the library reference.recognize_bing
in the library reference.recognize_ibm
, courtesy of Bhavik Shah from IBM.As always, you can upgrade with pip install --upgrade speechrecognition
.
Changes:
Bugfix release!
exception_on_overflow
shenanigans. This version will eliminate those pesky ValueError
s.Special thanks to @michaelpri10 for reporting the exception_on_overflow
bug.