WhisperX Versions Save

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

v3.1.1

11 months ago
  • translate functionality added
  • fix word timestamp bug (words no longer have consecutive timestamps)

v3.1.0

1 year ago
  • 70x real time transcription, <8GB gpu memory requirement ⚡️⚡️
  • each transcript segment is a sentence (using nltk.sent_tokenize)
  • diarization now assigned per sentence (and outputted to srt)
  • clean up on alignment logic

v3.0.2

1 year ago

torch2.0, python3.10

v3.0.1

1 year ago
  • fix pickling error (set num_workers=0) to perform dataloading iwth main process
  • add basic diarization
  • pad language detection if less than 30s

v3.0.0

1 year ago

batched inference with faster-whisper backend

v2.0.1

1 year ago

v1.0.0

1 year ago