Optimizing Settings in AI Voice Changer Client
HTML-код
- Опубликовано: 24 июн 2023
- Links referenced in the video:
w-okada's github repo: github.com/w-okada/voice-changer
AI Voice Changer Client Installation: • Realtime AI Voice Chan...
Get it working in Discord: • How to Connect AI Voic...
Hardware for my PC:
Graphics Card - amzn.to/3pcREux
CPU - amzn.to/43O66Ir
Cooler - amzn.to/3p98TwX
RAM - amzn.to/3NBAsIq
SSD Storage - amzn.to/42NgMFR
Power Supply (PSU) - amzn.to/3NBAsIq
PC Case - amzn.to/447499T
Mother Board - amzn.to/3CziMXI
Alternative prebuilds to my PC:
Corsair Vengeance i7400 - amzn.to/3p64r22
MSI MPG Velox - amzn.to/42MnJHl
Cheapest PC recommended:
Cyberpower 3060 - amzn.to/3XjtZoP
Come join The Learning Journey!
Discord - / discord
Github - github.com/JarodMica
TikTok - / jarodsjourney
If you found anything helpful, please consider supporting me and the content I am trying to produce!
www.buymeacoffee.com/jarodsjo... Наука
Great video. thank you
I'm one of the VCClient and RVC contributors. There are some additions to the content of the video.
Regarding the difference between the f0 estimator harvest and crepe, in addition to the sound quality, harvest uses a CPU and crepe uses a GPU. Crepe can improve latency if you have a good GPU.
In sever mode you can choose the sound driver. VCClient measures latency within VCClient, but additional latency is added when connecting to other devices.
Besides MME, WASAPI and ASIO can be selected, so if you can use them, I recommend using them.
For the protect item in advanced options, if protect is set to less than 0.5, the ratio of retrieved features will be reduced in cases where f0 estimation is unsuccessful (silence or breath sounds).
I've seen, appreciate your work and thank you for the additional information!
Would be better if next to every option will be ? icon when on hover you will see popup with explanation. It will help a lot.
Yo app just mines BTC stop the cap.
@@meoqtx proof?
@@paradym777 My Premium Kaspersky version 😀
From a musician experience: if you have ASIO supporting soundcard - use ASIO instead of MME. It decreases the audio delay provided by audio tract (e.g. on my PC guitar/mic recording delay for 1024 samples chunk is 180ms for standard MME, and 14ms for ASIO). Theoretically WASAPI can also work fast however I don't have WASAPI supported hardware.
It stutters for me when use my asio soundcard or just doesnt work at all
@@realxdey same here
Where do I change the settings for asio?
ASIO4All an any other ASIO doesnt work.
Thanks for all the help! You have responded to all of the comments and provided everything Ive needed. Anyways keep up the great work and keep doing what you are doing 👍
I'm so lost. My window is just empty when I click on the native client exe...
Thank you so much for making this video. I have a RTX 3090 and was wondering why no matter how much I messed with the Extra and Chunk settings, the voice still sounded distorted, but it was probably because I had the pitch too high where the voice probably wasn’t trained (I was using above 20 pitch) I didn’t know that most voices are trained no higher than 12. I’ll mess around more with this program tomorrow with what I learned from this video.
Tech is getting so cool, great video!
Great video as always glad you went through everything with a good explanation for everything! Keep up the great work, and i am excited for what will come after RVC!
This worked so great on my first Voice Changing experience. Other videos on your channel are also great! Thank you very much.
hearing senchou speaks english just feels weird, not in a bad way it's just feels like im hearing something im not supposed to hear in my entire life
Great video! I'm glad you showed what this program is capable of on a 4090. It seems we're not quite there yet with AI voices. I wonder if this is a small hurdle that will be overcome soon or a insurmountable mountain like hands are to AI art.
Ah, actually some AI voice models that I've tried are actually pretty scary accurate, meaning my models need a bit more training lol. I would say we're not that far away from indistinguishable voices
i literally just upgraded from my 1050ti to a 4070 today just to use this + other AI tools. love these tutorials
GZ! Very solid card right there :D
i'm probably going to buy an rtx 4090 for AI stuff too. otherwise i'd buy a 7900xtx. it costs half as much, has the same amount of vram and the performance of a 4080. too bad amd sucks with these things.
man im using intel HD card 💀, will it work?💀
I'm still using rtx 2060
I'm using 920mx😀
Great! can't wait to try this out to see if it improves performance. If it doesn't I might just install windows 11 to match your settings exactly.
Thanks, I really need to try this one.
you're a legend!! insane video quality and tutorial, can't believe i found pure gold at 4 am.
i guess youtube can also be a chad and recommend really good content wow
same...
If anyone has issues exporting an ONNX file and getting an error message in the GUI (it usually just says error message: no error message), but if you check in the console it says that pytorch has tried to allocate VRAM and has failed. A quick workaround for this that worked for me was changing in the GUI to use my CPU instead of my GPU and then exporting the ONNX worked. Afterwards you can change it back to your GPU.
Smart, didn't even cross my mind xd
i still get an error in the GUI..
if it helps,this is what is shown on the console : "[Voice Changer] get_onnx ex: Can't get source for . TorchScript requires source access in order to carry out compilation, make sure original .py files are available."
yooooooooo thank you so much
What do you mean changing the gui?
@@Nebularban changing the setting in the GUI
Damnn, the intro transition was goood
Hear Botan/Marine speaking clear English is kind of weird&awesome at the same time XD
great video! keep it up!
Great video !
Thank you Jard!!!
you saved my day !
Very helpful 👌 . Just where can you get models?
Using a RTX 3070 with:
Chunk 256
Extra 131072
Sounds perfect on these even with the half second delay!
Bro. You are too good . A wizard
😍 Thank You
Hi im the one on your discord server who created a ticket about the gl not loading and you said to change the chunk to max, i just switched the audio to server and it worked perfectly for me, i guess theres a weird thing going on with client option. Just wanted to put it out there
Its cool bro, As expected I want to prank my virtual friends using a voice changer😂😂
I subscribe you channel😂😂
I hope they will make it in VST3 format so i can just put it on the daw track which my microphone is routed through. it would be so amazing holyy
Considering how calculation heavy this thing is it's unlikely.
I might try to make a VST3 client for this stuff.. but I couldn't find any API description on their page.
why cant you just use VAC and use VAC Line in as your input mic in your DAW?
@@PsychicType hmm ill try that ty!!!
Do you think we could get a download for the Marine voice?
I've found using this in games causes your mic to cut out and stutter, as well as spiking CPU usage. I hope in the near future it's even more optimized ^^
It's to be expected. The results outside of games are when it's able to fully utilize your GPU.
While playing a game, the game is eating some amount of resources. Ecspecially so if the game isn't able to run at a steady framerate, which very well causes the voice client to have unreliable processing times.
Have you figure a solution for that yet? Bc recording or even ingame mic check, it sounds fine. But people ingame hear it stutter alot
I tried this tool out. Honestly... I can see this kind of tool being put on a list of banned software at some point in the future. I gave it a go mostly to see if the DirectML implementation works on intel Arc and... well it didn't. Which is fair. I have a 1080ti as well so that's no biggy.
Ah, I think that's a slippery slope. If it does get banned, that leaves only bad actors to use it as nothing will ever stop them lol.
It's still hardware intensive atm so newer Nvidia are still needed
I find that the DirectML version (AMD) tends to randomly stop working and sometimes my GPU driver would crash, so that could use some more work to make it stable. And thanks for this guide! I barely know what each option really means.
Wait is your gpu recognized in the software?
@@jesuisunroi5543 It is, it's an RX 6900XT.
how do u get it to use ur gpu mine only uses the cpu even on the directml version 😭😭
@DesuVR wtf my rx 6800xt is not how did you do 💀
@@jesuisunroi5543 Honestly, I don't know if it's even using the GPU at all, I get ~50-80% CPU usage when RVC is active. I thought you meant if the software worked at all on AMD, my mistake. ^^;
However, I got it running okay-enough on my 5800X3D with these settings; INDEX: 0, F0 Det.: harvest, S.Thresh.: 0, CHUNK: 320, EXTRA: 4096
thx now my voice changer finally works good
best youtuber i downaldo it🤩
Praying for the day it works on AMD, i'm currently stuck with a 5700XT.
it feels like the voice changer is picking up too much background noises and saying things I didnt say is that normal?
this is where my vtuber career starts
1650 Super and even 1060 are doing surprisingly fine using this tool.
Awesome to hear! I had great success on my 2070 super so I'm glad to hear the pascal cards are still chugging along
hello mind telling me what settings you are using? i have a 1660 super and its super laggy nothing is understandable
@@raidentatsunoko same, i have 1660, but voice very bad and laggy
@@soluckymoon sad, still same to me.
@@raidentatsunoko have youn found a fix?
i seem to have a problem with delay, theres a long delay before the voice starts working, is there a way i can fix it?
thank you
because the RES comes out with 8000 up to 29000 you can lower it because it takes a long time to load the voice and sometimes you don't even hear anything
please do you have a fix for:
[Voice Changer] Pipeline is not initialized.
[Voice Changer] Waiting generate pipeline...
pls it would be much appreciated
Great instructions! I saw you don't have any of the original models that come with the software loaded up anymore, how did you delete those? I can't seem to figure out how to get rid of them lol
You can just overwrite them by uploading new ones :)!
@@Jarods_Journey Gotcha, it didn't seem to work when I did that so maybe I'll try reinstalling or using an older version. I downloaded whatever the most recent version was as of yesterday (7/10/23) and the UI looked a bit different than yours too so maybe some changes the dev made broke overwriting. Thanks man, appreciate you!
How can you find free rvc models made from other people? Love your videos.
@Jarods Journey do you have any resources for voice models, or do you train your own? All of your voice models sound much better than what I could find online. What settings do you use to train your models at?
I train all of my models using RVC v2. No real special settings, I just clean and curate my data so it's crystal clear input audio.
@@Jarods_Journey Trained a couple models already, came out great. Thanks for all the help between comments and videos.
HOW DO U CLEAN DATA@@Jarods_Journey
For High cpu usage problems....
1) Buy $10k PC setup. jk
2. Set index to 0. Even setting it to Index: 0.1 maxes out CPU
3) Set Extra to a lower amount. Higher Extra uses more CPU. Lower Extra uses more GPU. ( use 16384, any lower and doesn't decreases cpu usage, any higher and cpu usage doubles.
4) Use crepe_full (uses most gpu)
4) S.R. to 48000. (dont go beyond as echo forms.)
this works for amd gpu, i was getting crazy delays. followed this settings and now its only 2 seconds delay. thank you!
thx bro now im only getting a 1/2 second delay on amd gpu
question, how to train? what does train do?
i followed all previous tutorial and the voice output only sounds distorted repetitive. im using ryzen 3600x with gtx 1050ti 16gb ram.
If you've already trained, you might need better audio samples, more samples, or adjust settings to better fit your system via chunks/extra
fun thing to test
my voice echoes and i can hear what i say again 2-3 times any suggestions?
Nice
Yea my friends also wanted to try this out and it was hilarious to see that the GDrive link had too much traffic 💀
Thanks so much for explaining this! What parts of my computer would I need to upgrade to make this go faster?
GPU 100%
Just a quick question, the voice changer works perfectly but it always cuts off at the end of the sentence. Any clues why?
+1
same
Omg i have never heard an english fluent Suisei before
Im having issues where everything comes out in short breaths or stutters, barely comprehensible, I have pretty good specs and I tinkered around with as many settings as I could but I always get the same result, how do I fix this??
Same
do you have any idea what happened? The voice changer was running pretty well a couple weeks ago but it looks like they had an update and now all the voices seem slurred and choppy, it was perfectly fine a couple weeks ago and I've made no changes to the settings, i don't know what happened, it affected all my models
Not sure, this would only have happened if you had installed a newer version of it. I would recommend just use and stay with the version that worked for you previously as there seems to be issues using other versions
@@Jarods_Journey I’ll try and play around with it, does it remove your models if you install it again?
@@kongk5772 yup, you gotta start from scratch
@@Jarods_Journey so far I've redownloaded everything, and everything is up to date, I optimized my settings the same as the older versions but for some reason, the millisecond per response keeps stacking, all the way up to 30k ms per response all the while it's eating my CPU, this didn't happen before and I'm not sure why, do you have a fix for this?
which version is the one that works for you?
even at highest chunk i cant get it to sound good
I assumed that the clear setting button at the top would clear out a slot for instance I've been testing a bunch of slots and I want to set them back to the default blank but apparently clear setting does not do what I thought it did in fact it seemed to screw up the whole program.
Cool video. Ive been wondering if i can use it to talk to other people as i dont really want people to hear my voice. Like in discord or something.
Mb i found the vid
@@AxooD1 what’s the title of the vid
I forgot, has there been a place for a collection of trained voices so far? I definitely don't have the system nor time to find a character, compile voice lines and then train off that.
AI Hub has a comprehensive collection. I can't post their discord invite here, but it isn't hard to find.
@@gjvyigfghjghivff how to join their server? can u give me the invite code only. not the full link
@@ozymogaming It's "aihub"
There is in fact and it's the AI Hub discord. I plan on making a vid of it in the future just quickly going over it
there is an echo after the first sentence and it gets so squeeky till u cant hear it anymore
Hello sir, do I need to download any extra files prior to downloading these as shown in this video, or will everything work just fine when following the same process in this video? I have heard you talking about downloading RVC first or maybe I am mistaken.
Thank you.
You will need to download or train AI models for voices. It only comes with 4 preinstalled
Thank you, sir, for replying. But I am done downloading the software but I think the voice changes often say things that I didn't say and weird noises and twist echo. I have an AMD potato laptop with just 4gb vram and 32gb ram, in the gpu drawer I see just gpu on/off option no room for choosing between GPU or CPU not sure why that is like that on AMD played with it all day yet the software hasn't been able to say a single word of mine only making a weird noise and short utterances. Not sure though, do you think upgrading the AMD drive would make any much difference? Or maybe time to change to Nvidia.
Note. The laptop comes with a hybrid gpu AMD Rx / Vega 8. I think the Vega is interfering. 🤯
@@ivw1286I have a rented A100, a $10,000 GPU, and it still struggles a little. I don't think you meet the minimum requirements to even run the program.
Why is my voice changer says no embedder? When i changed into a custome voice it says failed idk why it says fail i cant even hear the voice changer
Can you make a video on how to train a custom model for this? Thanks!
Completed :), check out the RVC videos playlist
i have to figure out a way to offload the processing to my laptop so it doesn't bug out when playing demanding games. is there a way to access it from a lan client?
ps: i thought the directml version worked for my 6900xt, but it's processing on the cpu. but, on the plus side, converting to omnx literally halves the load on the cpu. when running this exclusively, i managed to get a total lag of 400ms buffer + processing. not bad for an octacore cpu (5800x3d)
there is no way to mute the output feedback, right?
so that, when you talk, you don't actually hear yourself back but still can be recorded or heard by other people....
man i really wanted this to work for me but for some reason as soon as i press start it just starts saying random voice lines like someone else is speaking on my mic and i dont even know what could be causing that
SAME did u figure it out?
The delay for amd cards are not good, hopefully they will work on this matter soon
Where did you get your Marine model?
Hi! just wanted to ask, can I use this app to change my pre-recorded voice?
Thanks for the video, but when I browse and select the onnx file in one of the slots, do I just leave the index blank? Because if I select the onnx file, then click upload and try to use the voice it doesn't work. Also, with piper it seems to require a json file as well but how is this generated? rvc doesn't give this to me.
8:38 Alas, Houshou Marine finally speaks English🙏🙏🙏
RVC Quality should be in HIGH if you want to use this in a social game as male - female ect.
You might not notice much of a difference but there's a world of difference actually, people notice.
What you do is you keep it in HIGH and the program will make sure the output is always the best that way people don't suspect a thing.
Doesn't sound that different and it's not worth the hit on CPU resources.
I compared High and low to my friends (markeplier model) and they said there is not deference at all.
edit: I'm using 1060
@@leexy3395 You are very very wrong in the context I spoke of, many people have good hearing, of course your friends who aren't paying much attention or taking is seriously won't notice a difference. But people in the context I mentioned absolutely will.
"And it's going to give you the best audio 👹" 9:05
Great video!, btw how to fix the voice repeating issue?
Are you using vb audio cable? Or have the setting to hear the output of your mic on?
One day I'll make my own ai voice using my voice to then mimic myself with an ai voice of myself.
Hi Jarod. I'm trying to use my bluetooth mic as an input for voice but it is not showing in the input options! Any help in this regard will be appreciated!
I'm here because I can't even get it running 😭
I have tried to install it and get it to run, but something is missing and I have not seen a video on installing and getting the client to run.
CHEERS!
I notice my voice cuts out a lot while speaking. So it misses a few words sometimes. Whats the best way to solve that?
I'm also having this issue, so I'm tagging on to your comment.
Hello, I have a GPU but for some reason it’s not selectable in the processing option
Can you tell me why when I say something through the AI it keeps repeating it?
I wish I could play around with this, but sadly I have a shitty *INTEL* GPU PC. (Intel HD Graphics 4600, yep… it’s shit).
But yeah… this is a pretty impressive tool.
Hello and nice video! i have a big problem with my output lagg (I hear my voice after few seconds like 10-12) i have buf:300ms and res:like 10k or something how can i fix that?
Sometimes it lags a bit, give it a minute or so and the res will go down. I turn off my mic and it speeds up the process tremendously
i really want to try this but my 1060 says no.
hey wanted to ask where you can all that extra models ?
Great Video as always! but im having a problem with my gpu, i cant seem to find my current gpu (AMD) on the options for GPU (and i mean cpu is the only one appearing), how do i make it appear to be an option?
You will have to download the directml version of the package, it should appear there so I've heard after this version is installed.
You ever figure out how to get it to work on AMD?
@@faded8975 download the directml version not the gpu cuda one
I got issues and need serious help...
issue 1: The AI is using the CPU when I have the program selected for "GPU 0" which is my main GPU.
issue 2: The audio is choppy, it cuts too often.
Chunk 192, Extra 8192. F0 Det rvmpe_onnx. Run audio off server.
Can you tell me the voice model and settings you used in the opening? please.
Index setting still matters when using model without index
If you use a voice changer, you won't be able to laugh, because when you laugh your voice will become like a robot
so whenever i boot it up and do all the settings and whatnot, it never changes my voice even if i follow every step, i can hear my voice yeah and ive already fiddled with the tune, yet nothing changes even if i turn it on
I tried running this, but its only white screen, what does it mean?
What CPU model did you use?
Is microphone important on sound quality too? Because I'm using my headset microphone and it sounds ass and robotic
Yes, but model quality play a role here too. If you can upgrade your mic, it may help out but it may not if the models are not good.
hey, is there any difference in method which rvc works here or in any other fork ?
Hearing Botan in English seems so wow to me ww
Whenever i run it on my pc the voices sounds TOO robotic, training them would make a dif?
my voice changer worked perfectly when i just installed it, but after reopening it, it started to give errors like "GL is not supported" and refuse to work. any suggestions how to fix this?
Hey I tried to use this today but while my mic is on sometimes out of nowhere there is a robotic static.. is there any way to fix it?
bro ur setup is insane, can u benchmark some games lmao