F5-TTS & E2 TTS Google Colab Tutorial

Neural Falcon

Просмотров 8 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 25 дек 2024

Комментарии • 93

@AbdullahJahangirr 28 дней назад ⁺¹
Happy 1k subs
@neuralfalcon 28 дней назад
Thank you bro
@angelochu3156 2 месяца назад ⁺⁴
I watched many videos about F5-TTS on youtube. You are the only one who can clearly compare the original sound and clone sound in a clear manner to the watcher. Keep up the good work!
@neuralfalcon 2 месяца назад
Glad I could help!
@Dex383-d8d Месяц назад ⁺¹
I have already tried everything in the video and it is indeed very easy to use, the AI has its problems but I guess it will improve over time, the part of cloning the voices works 100 out of 10, I managed to confuse a friend with his own voice speaking in another language which was very funny.
Thank you very much for the video and for taking the time to respond to my first comment
@mekkicharfi5454 2 месяца назад ⁺¹
Thank you very much and especially for your patience
@QHawk7 Месяц назад ⁺¹
*Great Video , thanks, Try dubbing a short documentary and import a deep voice, let's see what we can do with all available AI tools & colabs at this moment*
@MR.VAN1979 Месяц назад ⁺²
Your videos bring a lot of value to the community and are worthy of 1 subscription, 1 like, and 1 comment. I wish you good health and make many valuable videos for everyone to learn and follow.
@dkerdnase 2 месяца назад ⁺¹
Thank you so much man! You're awesome!
@xenn2996 2 месяца назад ⁺¹
thanks for the tutorial
@neuralfalcon 2 месяца назад
Happy to help
@411KJB 2 месяца назад ⁺¹
Excellent!
@lsgzmc5806 Месяц назад ⁺¹
Pls make a video on how to use multi-speech option of this model, I'm having troubles using it
@neuralfalcon Месяц назад
11:18 watch this video ruclips.net/video/6i0cXSvyz98/видео.htmlsi=IZ8FKfAD7l0sqmgV
@neuralfalcon Месяц назад ⁺¹
Use the format {emotion_name} your_text.
For example:
If the emotion is "happy": {happy} I won a prize.
For multiple emotions: {happy} I'm happy. {angry} I'm angry. {sad} I'm sad.
There’s no set order. Just indicate the needed emotion in curly braces before each sentence, like {emotion} your_text.
Make sure you label those reference audio files the same as your emotion_name.
@lsgzmc5806 Месяц назад ⁺¹
@neuralfalcon thx for helping me out
@syntaxstreets 2 месяца назад
2nd audio and first model super
@vodkalikpatates 21 день назад
Thank you for the video! It's really helpful! 🙌How can i use it with another model? like, i want to try with "F5-TTS-Turkish". how can i add it properly
@neuralfalcon 21 день назад
Search on Google to find out if someone has trained an F5TTS model for the Turkish language or train your own model.
To learn how to train in different languages watch this video:
ruclips.net/video/UO4usaOojys/видео.htmlsi=uzMKfs6sdDloKU9a
@vodkalikpatates 21 день назад
@@neuralfalcon There actually is a Turkish language model. I meant to ask how can I use that with your code, since it doesn't have custom model option in ui
@Jerometk 14 дней назад
Do you have the same but for lipsyn? Something on google collab or similar? I want to lipsync audio and a video, not an image.
@neuralfalcon 14 дней назад
Yes, we have Wav2Lip.
github.com/Rudrabha/Wav2Lip
My google Colab link:
github.com/NeuralFalconYT/wav2lip
@harshvaghanii Месяц назад
I've got an error in second step saying -> name 'base_path' is not defined
@neuralfalcon Месяц назад
Because, you forgot to run the cell above, where base_path = "/content".
Run the cell above first, then run the next one afterward.
@EphemeralInferno 12 дней назад
When I do it, it says
"No module named onx"
@neuralfalcon 12 дней назад
yeap, new bug
@QHawk7 Месяц назад ⁺¹
Can I get this to work on kaggle?
@neuralfalcon Месяц назад ⁺¹
Yes
@QHawk7 Месяц назад
@@neuralfalcon
How?
@neuralfalcon Месяц назад
@@QHawk7
github.com/NeuralFalconYT/F5-TTS-Demo/blob/main/F5_TTS_Latest.ipynb
You may need to run:
!sudo apt install ffmpeg
Ensure you are connected to a GPU runtime.
You may also need to install torch if PyTorch is not pre-installed on Kaggle by default.
github.com/SWivid/F5-TTS
@Carlon15 Месяц назад
Can you make a video about how to train your model in a different language, please?
@neuralfalcon Месяц назад
github.com/SWivid/F5-TTS/discussions/143
ruclips.net/video/RQXHKO5F9hg/видео.html
@neuralfalcon Месяц назад
Watch this video: ruclips.net/video/GmketyZW2c4/видео.html
@neuralfalcon Месяц назад
Watch this video : ruclips.net/video/UO4usaOojys/видео.html
@snakezo4218 2 месяца назад
is there a way to speak with our voice and make a transfer to this voice to reproduce the emotions of tones you know
let's imagine that I play the game of an angry person can the cloned voice reproduce this angry voice ?
@neuralfalcon Месяц назад
Easy, Record a short, 15-second audio clip where you speak in a specific tone, like angry, sad, or happy. Use this audio as a reference in F5 TTS, and the output voice will match your chosen emotion, such as anger.
@hiepinh5599 22 дня назад
can i training with own voice, for example: optimus voice..
@neuralfalcon 22 дня назад
Yes 100%, copy this notebook and use F5-TTS colab.research.google.com/github/NeuralFalconYT/F5-TTS-Demo/blob/main/F5_TTS_Latest.ipynb
@hiepinh5599 21 день назад
I checked your collab, and it doesn't work
@neuralfalcon 21 день назад
It's working
@hiepinh5599 14 дней назад
@@neuralfalcon thank you, it worked. Now I have a checkpoint file trained through TTS-F5 but I don't know where to inference through, can you help me, I need python script
@pneuma23093 2 месяца назад ⁺¹
2:57 That's Dva right?
@neuralfalcon 2 месяца назад ⁺¹
Yes
@Deewayne94 15 дней назад
Hello, can i also clone a voice in french?😊
@neuralfalcon 15 дней назад
Yes, but you either need to train the model in French yourself or wait for someone else to do it.
The best option right now is to pay for a service like ElevenLabs.io to clone your voice.
@neuralfalcon 12 дней назад
huggingface.co/RASPIAUDIO/F5-French-MixedSpeakers-reduced
@asfandsherazkhan9135 Месяц назад
can we dubbed into other language like from english to hindi
@neuralfalcon Месяц назад
It only supports English and Chinese , but you can train it in other languages. Watch this video to learn how:
ruclips.net/video/GmketyZW2c4/видео.html
@neuralfalcon 12 дней назад ⁺¹
huggingface.co/SPRINGLab/F5-Hindi-24KHz
@neuralfalcon 11 дней назад
F5-TTS Hindi: ruclips.net/video/Pb3Zx562Juw/видео.html
@kanavwastaken 2 месяца назад
Can you please make it work on LightningAI bro?
@neuralfalcon 2 месяца назад
github.com/NeuralFalconYT/F5-TTS-Demo/blob/main/F5-TTS-lightning-ai.ipynb
Download this notebook and upload it to lightning.ai/. Make sure to switch to GPU.
@411KJB Месяц назад
Link no longer works. Any new links?
@neuralfalcon Месяц назад
colab.research.google.com/github/NeuralFalconYT/F5-TTS-Demo/blob/main/F5_TTS_Latest.ipynb
Or follow official instructions:
github.com/SWivid/F5-TTS
@411KJB Месяц назад
It was PERFECT for that window though and I thank you so much.
@PratikshaPatil-r9o 2 месяца назад
HEY.. IS THE PROCESS FOR E2 IS SAME?
@neuralfalcon 2 месяца назад
Yes same, just choose E2-TTS button
@snakezo4218 Месяц назад
I tried, is it possible to make him speak with a French accent, he still has difficulty or can I speak to the creator to ask him the question?
@neuralfalcon Месяц назад
ruclips.net/video/RQXHKO5F9hg/видео.html
@neuralfalcon Месяц назад
Watch this video : ruclips.net/video/UO4usaOojys/видео.html
@neuralfalcon 12 дней назад
huggingface.co/RASPIAUDIO/F5-French-MixedSpeakers-reduced
@priyakumari-ky4nn Месяц назад
F5 tts Can Support Hindi voice Give Answer ?
@neuralfalcon Месяц назад ⁺¹
It only supports English and Chinese , but you can train it in other languages. Watch this video to learn how:
ruclips.net/video/GmketyZW2c4/видео.html
@priyakumari-ky4nn Месяц назад
@@neuralfalcon Please you can make video realistic hindi tts voice
@neuralfalcon Месяц назад
Watch this video : ruclips.net/video/UO4usaOojys/видео.html
@neuralfalcon 12 дней назад
huggingface.co/SPRINGLab/F5-Hindi-24KHz
@neuralfalcon 11 дней назад
F5-TTS Hindi: ruclips.net/video/Pb3Zx562Juw/видео.html
@RostinSino 2 месяца назад
does it work in indonesian language?🙏
@neuralfalcon 2 месяца назад
For now it's a big 'NO' . You need to train on Indonesian language from scratch. You can use elevenlabs but it's paid.
@neuralfalcon Месяц назад ⁺¹
Watch this video: ruclips.net/video/GmketyZW2c4/видео.html
@neuralfalcon Месяц назад
Watch this video : ruclips.net/video/UO4usaOojys/видео.html
@abhishekkumar-bz1ql 2 месяца назад
Will it work with hindi language
@neuralfalcon 2 месяца назад ⁺¹
For now it's a big No, you need to train for other languages From scratch
@abhishekkumar-bz1ql 2 месяца назад
@@neuralfalcon do you know how to train it? Or any reference video of it?
@neuralfalcon 2 месяца назад
@@abhishekkumar-bz1ql github.com/SWivid/F5-TTS/issues/87
@neuralfalcon Месяц назад
Watch this video : ruclips.net/video/UO4usaOojys/видео.html
@neuralfalcon 11 дней назад
F5-TTS Hindi: ruclips.net/video/Pb3Zx562Juw/видео.html
@QHawk7 Месяц назад ⁺¹
*Is it Multi-language?*
@neuralfalcon Месяц назад ⁺¹
Only English and Chinese
@neuralfalcon Месяц назад
Watch this video : ruclips.net/video/UO4usaOojys/видео.html
@weini-sf3pu 2 месяца назад
when use Generate TTS, get an error " FileNotFoundError: [Errno 2] No such file or directory: 'nvidia-smi' ", can you help me ?
@neuralfalcon 2 месяца назад
@@weini-sf3pu yes send screenshot at NeuralFalcon@proton.me
@neuralfalcon 2 месяца назад
@@weini-sf3pu first you need a GPU to use 'nvidia-smi' then if you are running in a jupyter notebook '!nvidia-smi'
Or if you are running in terminal just 'nvidia-smi'. Else you can skip this.
Use another way to find the cuda version to install the pytorch .
@Dex383-d8d 2 месяца назад
Why did the page ask me for permission to use my microphone? Do not enter the pinned link, you will probably be hacked... The video seemed useful but better not risk it
@neuralfalcon 2 месяца назад ⁺¹
Thank you for your comment! It sounds like you might not be familiar with how Gradio applications work. The page requests microphone permission because the app needs to record or upload audio in order to clone it. Our code prioritizes recording audio before launching the app, which is why microphone access is required. If you're interested, you can learn more about this in the Gradio documentation here: www.gradio.app/docs/gradio/audio .
@Dex383-d8d 2 месяца назад
@@neuralfalcon Thank you very much for replying to my comment, I will read the documentation, it is true that I am not familiar with the application
@Ice_camp 2 месяца назад
uncheck remove silence
@neuralfalcon 2 месяца назад
You can uncheck the silence option, which may create silence in the generated audio .

Следующие

Автовоспроизведение

F5 Text to Speech Tutorial | Hit "Refresh" on Your AI Voice!