Streaming transcriber with whisper
--frame
option to change the number of minimum frames of mel spectrogram input for Whisper (default: 3000. i.e. 30 seconds)data_type
in class Context
for websocket (#36)--no-vad
option and add --vad
option to set threshold (86f38c6ca91a2bc9ff54836906f734ea7ceae502)--allow-padding
and add --max_nospeech_skip
option (Resolve #13)None
Set the tag to install easily the release version like this.
pip install -U git+https://github.com/shirayu/[email protected]