Pandrator Versions Save

Pandrator aspires to be a user-friendly app with a graphical interface and a one-click installer that creates high-quality speech from text in multiple languages (audiobooks, speech synchronised with subtitles and more) using local models (XTTS, Silero or VoiceCraft), plus voice cloning, LLM pre-processing, RVC enhancement, and automatic evaluation

v.0.0.7

1 month ago

This release addresses primarily VoiceCraft and its recent updates. It adds VoiceCraft model selection to the GUI as well as advanced generation settings. The selected model will be downloaded automatically.

The .exe one-click installer files were created using pyinstaller from pandrator-start-minimal_xtts.py, pandrator_start_minimal_silero.py and pandrator_start_minimal_voicecraft.py in the repository. Please remember to run them as an administrator if you want them to install git, ffmpeg and/or calibre.

It's possible that your antivirus software flags them as a threat. In that case, add them as an exception. If you're not comfortable doing that, review the code in the repository and perform a manual installation.

v.0.0.6

1 month ago

New features:

  • Support for EPUB files using ebook-convert from Calibre.

Improvements:

  • Enhanced sentence splitting logic.

The .exe one-click installer files were created using pyinstaller from pandrator-start-minimal_xtts.py, pandrator_start_minimal_silero.py and pandrator_start_minimal_voicecraft.py in the repository. Please remember to run them as an administrator if you want them to install git, ffmpeg and/or calibre.

It's possible that your antivirus software flags them as a threat. In that case, add them as an exception. If you're not comfortable doing that, review the code in the repository and perform a manual installation.

v0.0.5

2 months ago

New features:

  • The ability to use PDF files as input (they are converted to .txt, and before the final conversion happens you can see a preview, enable or disable paragraph retention and edit the text). It is not perfect and uses a relatively simple conversion method, so results may vary depending on the layout complexity of the input PDF and other factors. You may use the LLM workflow to try and remove OCR artifacts/misspelled words etc. I'm looking for a better conversion method, and if you have any suggestions, please let me know.
  • The option to select an external (remote) XTTS server, for example hosted on a service like RunPod or a Google Colab like this one created by the author of XTTS Api Server.

Fixes:

  • Corrected dependencies (ffmpeg-python).
  • Improved lowering of the original track's volume during subtitle speech segments when mixing the synchronized audio output with a video track.
  • Minor UI improvements.

The .exe files were created using pyinstaller from pandrator-start-minimal_xtts.py, pandrator_start_minimal_silero.py and pandrator_start_minimal_voicecraft.py in the repository. Please remember to run them as an administrator.

It's possible that your antivirus software flags them as a threat. In that case, add them as an exception. If you're not comfortable doing that, review the code in the repository and perform a manual installation.

v0.0.4

2 months ago
  • Added the VoiceCraft model through the VoiceCraft API Server I made for this purpose. You can install Pandrator with VoiceCraft using the pandrator_start_minimal_voicecraft.exe.
  • It is now possible to generate speech from an .srt file and automatically mix it with a video's sound track.

The .exe files were created using pyinstaller from pandrator-start-minimal_xtts.py, pandrator_start_minimal_silero.py and pandrator_start_minimal_voicecraft.py in the repository. Please remember to run them as an administrator.

It's possible that your antivirus software flags them as a threat. In that case, add them as an exception. If you're not comfortable doing that, review the code in the repository and perform a manual installation.

v0.0.3

2 months ago

I added processing of .srt subtitle files. If a subtitle file is chosen, the output will be synchronized to its timestamps, effectively creating dubbing that you can, for example, mix with a video's sound track. Additionally, you may now choose the TTS engine - XTTS or Silero. Silero produces worse results (though still decent), but can run on a cpu, so it is recommended for lower-end systems. There are two executables now - one installs (and later runs) Pandrator with the XTTS Api Server, the other with the Silero API server. You may of course install another TTS server manually later. The .exe files were created using pyinstaller from pandrator-start-minimal_xtts.py and pandrator_start_minimal_silero.py in the repository. Please remember to run them as an administrator. It's possible that your antivirus software flags them as a threat. In that case, add them as an exception. If you're not comfortable doing that, review the code in the repository and perform a manual installation.

v0.0.2

2 months ago

I added the option to edit sentences from the GUI, addressed some playback issues and added a few tooltips explaining several options in the GUI. The .exe file was created using pyinstaller from pandrator-start-minimal.py in the repository. Please remember to run it as an administrator. It's possible that your antivirus software flags it as a threat. In that case, add it as an exception. If you're not comfortable doing that, review the code in the repository and perform a manual installation.

v0.0.1

2 months ago

This is the initial release. The .exe was created using pyinstaller from pandrator-start-minimal.py in the repository. Please remember to run it as an administrator. It's possible that your antivirus software flags it as a threat. In that case, add it as an exception. If you're not comfortable doing that, review the code in the repository and perform a manual installation.