Show HN: Voice-Pro – AI Voice Cloning

github.com

270 points · abuskorea · 9 days ago

Imagine creating a podcast where Mark Zuckerberg interviews Elon Musk – using their actual voices?

What sounds like science fiction is now reality.

Voice-Pro is an open-source Gradio WebUI that breaks the boundaries of audio manipulation.

Powered by cutting-edge Whisper engines, this tool turns voice replication into child's play.

Key Features:

- Zero-shot Voice Cloning

- Voice Changer with 50+ Celebrity Voices

- YouTube Audio Downloading

- Vocal Isolation

- Multi-Language Text-to-Speech (Edge-TTS, F5-TTS)

- Multi-Language Translation

- Powered by Whisper Engines (Whisper, Faster-Whisper, Whisper-Timestamped)

Video Demos:

1. Voice-Pro Usage Tutorial: https://youtu.be/z8g8LMhoh_o

2. Voice Cloning Celebrity Podcast Demo: https://youtu.be/Wfo7vQCD4no

3. Full Demo Playlist: https://www.youtube.com/playlist?list=PLwx5dnMDVC9Y7dAjm9r26...

Whether you're a content creator, developer, or audio experiment enthusiast,

Voice-Pro provides a user-friendly interface to push the boundaries of audio manipulation.

GitHub: https://github.com/abus-aikorea/voice-pro


189 comments
vunderba · 9 days ago
I do think that voice cloning for personal usage has actual genuine uses - in fact there was a relatively interesting news article about a person who was irrevocably losing their voice who had their vocal pattern cloned.

https://www.voanews.com/a/illness-took-away-her-voice-ai-cre...

That being said, it does seem a bit bizarre that the repo's home page is proudly trumpeting the ability to co-opt other people's identities without their permission (and yes your unique vocal pattern is definitely part of your identity - I mean it's used in some forms of biometric data). They're doing the project a bit of a disservice.

Show replies

bguberfain · 8 days ago
Thanks for sharing this! But I have some doubts about hidden installation procedures. It imports all functions from one_click (from one_click import *), which points to a compiled file. It then runs functions like install_webui and install_extra_packages. At least suspicious.

Show replies

shannifin · 9 days ago
I don't have much real use for celebrity voices (other than fun experimentation), but I'd love to be able to clone my own voice and character voices for the purposes of creating audiobooks / audioplays without having to pay monthly fees with monthly usage limits. So I'm excited by this sort of project!

P.S. Are there any tools for synthetic voice creation? Maybe melding two or more voices together, or just exploring latent space? Would be fun for character creation to create completely new voices.

Show replies

deskr · 9 days ago
Isn't it funny how some text changes the voice in your head? Now you're hearing the best voice. It's amazing. I tell you. It's the greatest voice. Everybody’s talking about it. They are saying it's incredible. They say they've never heard as beautiful a voice before.

Show replies

wutwutwat · 8 days ago
> Windows Defender may give a warning about untrusted application and disallow further execution of Voice-Pro. If SmartScreen security level is set to "Warn", just click "More info" and then click "Run anyway". If SmartScreen is set to level "Block" there will be no button to run the installation. In this case, open the properties of the start.bat file, and check "Unblock", apply the change and run the start.bat again.

https://github.com/abus-aikorea/voice-pro?tab=readme-ov-file...

hard pass and anyone who reads this and continues is bonkers

Show replies