Whishper Save Abandoned

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

Project README

whishper banner

Whishper

Whishper (formerly known as Web Whisper Plus) is a complete transcription suite. In simple words, it is a frontend for the Whisper model family, but with batteries included!

Show me the screenshots!

Self-hosting docs

Features

  • Transcribe any media to text: audio, video, etc.
    • Transcribe from URLs (any source supported by yt-dlp).
    • Upload a file to transcribe.
  • Download transcriptions in many formats: TXT, JSON, VTT, SRT or copy the raw text to your clipboard.
  • Translate your transcriptions to any language supported by Libretranslate.
  • Edit your subtitles in a comfy and complete web UI!
    • Transcription highlighting based on media position.
    • CPS (Characters per second) warnings.
    • Segment splitting.
    • Segment insertion.
    • Subtitle language selection.
  • 100% Local: transcription, translation and subtitle edition happen 100% on your machine (can even work offline!).
  • Fast: uses FasterWhisper as the Whisper backend: get much faster transcription times on CPU!
  • Quick setup: the setup involves just a docker-compose and you're good to go!

Roadmap

  • Full-text search all transcriptions
  • Support for GPU acceleration.
  • Audio recording from the browser.
  • Can we do something with seamless_communication?

Self hosting

Check out the self-hosting documentation here.

Project structure

Whishper is a collection of pieces that work together. The three main pieces are:

  • Transcription-API: This is the API that enables running Faster-Whisper. You can find it in the transcription-api folder.
  • Whishper-Backend: This is the backend that coordinates frontend calls, database, and tasks. You can find it in backend folder.
  • Whishper-Frontend: This is the frontend (web UI) of the application. You can find it in frontend folder.
  • Translation (3rd party): This is the libretranslate container that is used for translating subtitles.
  • MongoDB (3rd party): This is the database that stores all the information about your transcriptions.
  • Nginx (3rd party): This is the proxy that allows running everything from a single domain.

Contributing

Contributions are welcome! Feel free to open a PR with your changes, or take a look at the issues to see if there is something you can help with.

Development setup

Check out the development documentation here.

Screenshots

These screenshots are available on the official website, click any of the following links to see:

Credits

Open Source Agenda is not affiliated with "Whishper" Project. README Source: pluja/whishper
Stars
217
Open Issues
13
Last Commit
6 months ago
License

Open Source Agenda Badge

Open Source Agenda Rating