- Joined
- Jan 12, 2022
After reading the Elevenlabs.io thread people were posting songs that were ran through an AI to have Kanye West and a few other singers sing different songs
I'll add more later
Which made me think "How can I make my own?" It's actually very simple and fun once you get it all working.
To just jump straight into it you can use the Google Colab link here (Requires Google account) which has nearly everything someone would need to do get Kanye West AI singing.
But if you don't want to use Google Colabs and want to host locally:
Install https://github.com/34j/so-vits-svc-fork and other prerequisites. Instructions to get this working are on the Github. This has a little bit more extra than the Google Colab version.
Thanks to @Colon capital V for suggesting Ultimate Vocal Remover this tool is great but not perfect at separating the vocals from songs, it does occasionally leave some of the instrumentals which is a little annoying but also funny listening to the AI trying to guess how to vocalize these errors. It is highly recommended to use something like this
The hardest part was finding models to use. Hugging Face has a few (Biden, Trump and Obama | GLaDOS, Bob Odenkirk and a few more | Cartman, David Bowie and a few more | Kanye West (Mega link) ). I do not know how to train models or even really where to start to train them so I can't really help there.
The left side is the most important. This is how I have mine currently set up but ignore the Pitch slider that needs to be tweaked depending on the source audio but if you don't want to fuck around with that just tick "Auto predict F0" doesn't give the best result but it's good enough to get something that might be worth using. I have no idea what the other sliders do so don't ask me.

I recommend having "Auto play" off and just placing the output audio into Audacity(Or something similar) along with the instrumentals. If you use the instrumental output from Ultimate Vocal Remover it will be perfectly synced and ready to export. If you don't disable the auto play feature you will be forced to listen to the output without a way of stopping and the audio can get really distorted which can cause a painful high pitch.
This is what I have done so far. Some of them are really bad but it's something
If there is something I missed and there are quite a few people having the same issues. I'll have to assist and then fix the OP.
Also just for the sake of sources label your shit properly so others can find the original source etc. etc. etc
Which made me think "How can I make my own?" It's actually very simple and fun once you get it all working.
To just jump straight into it you can use the Google Colab link here (Requires Google account) which has nearly everything someone would need to do get Kanye West AI singing.
But if you don't want to use Google Colabs and want to host locally:
Install https://github.com/34j/so-vits-svc-fork and other prerequisites. Instructions to get this working are on the Github. This has a little bit more extra than the Google Colab version.
Thanks to @Colon capital V for suggesting Ultimate Vocal Remover this tool is great but not perfect at separating the vocals from songs, it does occasionally leave some of the instrumentals which is a little annoying but also funny listening to the AI trying to guess how to vocalize these errors. It is highly recommended to use something like this
The hardest part was finding models to use. Hugging Face has a few (Biden, Trump and Obama | GLaDOS, Bob Odenkirk and a few more | Cartman, David Bowie and a few more | Kanye West (Mega link) ). I do not know how to train models or even really where to start to train them so I can't really help there.
The left side is the most important. This is how I have mine currently set up but ignore the Pitch slider that needs to be tweaked depending on the source audio but if you don't want to fuck around with that just tick "Auto predict F0" doesn't give the best result but it's good enough to get something that might be worth using. I have no idea what the other sliders do so don't ask me.

I recommend having "Auto play" off and just placing the output audio into Audacity(Or something similar) along with the instrumentals. If you use the instrumental output from Ultimate Vocal Remover it will be perfectly synced and ready to export. If you don't disable the auto play feature you will be forced to listen to the output without a way of stopping and the audio can get really distorted which can cause a painful high pitch.
This is what I have done so far. Some of them are really bad but it's something
Shoop Shoop Song - Cher (Not the best example)
The Gambler - Kenny Rogers
I think I'm going to Kill Myself - Elton John
Sultans of Swing - Dire Straits
SMB3 Mastubatory Madness - I don't know I got it off youtube
Piano Man - Billy Joel
Now You're a Man - Orgazmo
Pomfpomfpomf =3
The Gambler - Kenny Rogers
I think I'm going to Kill Myself - Elton John
Sultans of Swing - Dire Straits
SMB3 Mastubatory Madness - I don't know I got it off youtube
Piano Man - Billy Joel
Now You're a Man - Orgazmo
Pomfpomfpomf =3
America Fuck Yeah! - Team America World Police
The Gambler - Kenny Rogers
I Think I'm Going to Kill Myself - Elton John
Sultans of Swing - Dire Straits
Pomfpomfpomf =3
The Gambler - Kenny Rogers
I Think I'm Going to Kill Myself - Elton John
Sultans of Swing - Dire Straits
Pomfpomfpomf =3
America Fuck Yeah - Team America World Police
Sultans of Swing - Dire Straits
Sultans of Swing - Dire Straits
I think I'm Going to Kill Myself - Elton John
The Gambler - Kenny Rogers
If there is something I missed and there are quite a few people having the same issues. I'll have to assist and then fix the OP.
Also just for the sake of sources label your shit properly so others can find the original source etc. etc. etc
Last edited: