Story

Show HN: Voice-Pro – AI Voice Cloning

abuskorea Thursday, November 28, 2024

Imagine creating a podcast where Mark Zuckerberg interviews Elon Musk – using their actual voices?

What sounds like science fiction is now reality.

Voice-Pro is an open-source Gradio WebUI that breaks the boundaries of audio manipulation.

Powered by cutting-edge Whisper engines, this tool turns voice replication into child's play.

Key Features:

- Zero-shot Voice Cloning

- Voice Changer with 50+ Celebrity Voices

- YouTube Audio Downloading

- Vocal Isolation

- Multi-Language Text-to-Speech (Edge-TTS, F5-TTS)

- Multi-Language Translation

- Powered by Whisper Engines (Whisper, Faster-Whisper, Whisper-Timestamped)

Video Demos:

1. Voice-Pro Usage Tutorial: https://youtu.be/z8g8LMhoh_o

2. Voice Cloning Celebrity Podcast Demo: https://youtu.be/Wfo7vQCD4no

3. Full Demo Playlist: https://www.youtube.com/playlist?list=PLwx5dnMDVC9Y7dAjm9r26...

Whether you're a content creator, developer, or audio experiment enthusiast,

Voice-Pro provides a user-friendly interface to push the boundaries of audio manipulation.

GitHub: https://github.com/abus-aikorea/voice-pro

Summary
The linked article is about a voice recognition and generation system called "Voice-Pro" developed by ABUS (AI Korea). The system aims to provide high-quality voice synthesis and speech recognition capabilities, catering to various applications such as chatbots, virtual assistants, and audio content creation. The article provides an overview of the system's features, including its natural language understanding, text-to-speech, and voice conversion capabilities, as well as its ability to handle multiple languages. The repository on GitHub offers access to the project's source code and documentation, allowing developers to explore and utilize the Voice-Pro system in their own projects.
270 187
Summary
github.com
Visit article Read on Hacker News Comments 187