hazumi
Back

Show HN: Voice-Pro – AI Voice Cloning

abuskoreagithub.com
271 points189 comments

VoicePro is a Python-based tool that allows users to easily generate and manipulate audio files using text-to-speech and voice cloning techniques. It offers a user-friendly interface for creating speech from text, modifying existing audio, and experimenting with different voice styles.

Imagine creating a podcast where Mark Zuckerberg interviews Elon Musk – using their actual voices?

What sounds like science fiction is now reality.

Voice-Pro is an open-source Gradio WebUI that breaks the boundaries of audio manipulation.

Powered by cutting-edge Whisper engines, this tool turns voice replication into child's play.

Key Features:

- Zero-shot Voice Cloning

- Voice Changer with 50+ Celebrity Voices

- YouTube Audio Downloading

- Vocal Isolation

- Multi-Language Text-to-Speech (Edge-TTS, F5-TTS)

- Multi-Language Translation

- Powered by Whisper Engines (Whisper, Faster-Whisper, Whisper-Timestamped)

Video Demos:

1. Voice-Pro Usage Tutorial: https://youtu.be/z8g8LMhoh_o

2. Voice Cloning Celebrity Podcast Demo: https://youtu.be/Wfo7vQCD4no

3. Full Demo Playlist: https://www.youtube.com/playlist?list=PLwx5dnMDVC9Y7dAjm9r26...

Whether you're a content creator, developer, or audio experiment enthusiast,

Voice-Pro provides a user-friendly interface to push the boundaries of audio manipulation.

GitHub: https://github.com/abus-aikorea/voice-pro

Comments (189)

GitHub - abus-aikorea/voice-pro: Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation. | Hazumi News