F5-TTS & E2 TTS Google Colab Tutorial

Поделиться
HTML-код
  • Опубликовано: 25 дек 2024

Комментарии • 93

  • @AbdullahJahangirr
    @AbdullahJahangirr 28 дней назад +1

    Happy 1k subs

  • @angelochu3156
    @angelochu3156 2 месяца назад +4

    I watched many videos about F5-TTS on youtube. You are the only one who can clearly compare the original sound and clone sound in a clear manner to the watcher. Keep up the good work!

  • @Dex383-d8d
    @Dex383-d8d Месяц назад +1

    I have already tried everything in the video and it is indeed very easy to use, the AI has its problems but I guess it will improve over time, the part of cloning the voices works 100 out of 10, I managed to confuse a friend with his own voice speaking in another language which was very funny.
    Thank you very much for the video and for taking the time to respond to my first comment

  • @mekkicharfi5454
    @mekkicharfi5454 2 месяца назад +1

    Thank you very much and especially for your patience

  • @QHawk7
    @QHawk7 Месяц назад +1

    *Great Video , thanks, Try dubbing a short documentary and import a deep voice, let's see what we can do with all available AI tools & colabs at this moment*

  • @MR.VAN1979
    @MR.VAN1979 Месяц назад +2

    Your videos bring a lot of value to the community and are worthy of 1 subscription, 1 like, and 1 comment. I wish you good health and make many valuable videos for everyone to learn and follow.

  • @dkerdnase
    @dkerdnase 2 месяца назад +1

    Thank you so much man! You're awesome!

  • @xenn2996
    @xenn2996 2 месяца назад +1

    thanks for the tutorial

  • @411KJB
    @411KJB 2 месяца назад +1

    Excellent!

  • @lsgzmc5806
    @lsgzmc5806 Месяц назад +1

    Pls make a video on how to use multi-speech option of this model, I'm having troubles using it

    • @neuralfalcon
      @neuralfalcon  Месяц назад

      11:18 watch this video ruclips.net/video/6i0cXSvyz98/видео.htmlsi=IZ8FKfAD7l0sqmgV

    • @neuralfalcon
      @neuralfalcon  Месяц назад +1

      Use the format {emotion_name} your_text.
      For example:
      If the emotion is "happy": {happy} I won a prize.
      For multiple emotions: {happy} I'm happy. {angry} I'm angry. {sad} I'm sad.
      There’s no set order. Just indicate the needed emotion in curly braces before each sentence, like {emotion} your_text.
      Make sure you label those reference audio files the same as your emotion_name.

    • @lsgzmc5806
      @lsgzmc5806 Месяц назад +1

      @neuralfalcon thx for helping me out

  • @syntaxstreets
    @syntaxstreets 2 месяца назад

    2nd audio and first model super

  • @vodkalikpatates
    @vodkalikpatates 21 день назад

    Thank you for the video! It's really helpful! 🙌How can i use it with another model? like, i want to try with "F5-TTS-Turkish". how can i add it properly

    • @neuralfalcon
      @neuralfalcon  21 день назад

      Search on Google to find out if someone has trained an F5TTS model for the Turkish language or train your own model.
      To learn how to train in different languages watch this video:
      ruclips.net/video/UO4usaOojys/видео.htmlsi=uzMKfs6sdDloKU9a

    • @vodkalikpatates
      @vodkalikpatates 21 день назад

      @@neuralfalcon There actually is a Turkish language model. I meant to ask how can I use that with your code, since it doesn't have custom model option in ui

  • @Jerometk
    @Jerometk 14 дней назад

    Do you have the same but for lipsyn? Something on google collab or similar? I want to lipsync audio and a video, not an image.

    • @neuralfalcon
      @neuralfalcon  14 дней назад

      Yes, we have Wav2Lip.
      github.com/Rudrabha/Wav2Lip
      My google Colab link:
      github.com/NeuralFalconYT/wav2lip

  • @harshvaghanii
    @harshvaghanii Месяц назад

    I've got an error in second step saying -> name 'base_path' is not defined

    • @neuralfalcon
      @neuralfalcon  Месяц назад

      Because, you forgot to run the cell above, where base_path = "/content".
      Run the cell above first, then run the next one afterward.

  • @EphemeralInferno
    @EphemeralInferno 12 дней назад

    When I do it, it says
    "No module named onx"

  • @QHawk7
    @QHawk7 Месяц назад +1

    Can I get this to work on kaggle?

    • @neuralfalcon
      @neuralfalcon  Месяц назад +1

      Yes

    • @QHawk7
      @QHawk7 Месяц назад

      @@neuralfalcon
      How?

    • @neuralfalcon
      @neuralfalcon  Месяц назад

      @@QHawk7
      github.com/NeuralFalconYT/F5-TTS-Demo/blob/main/F5_TTS_Latest.ipynb
      You may need to run:
      !sudo apt install ffmpeg
      Ensure you are connected to a GPU runtime.
      You may also need to install torch if PyTorch is not pre-installed on Kaggle by default.
      github.com/SWivid/F5-TTS

  • @Carlon15
    @Carlon15 Месяц назад

    Can you make a video about how to train your model in a different language, please?

    • @neuralfalcon
      @neuralfalcon  Месяц назад

      github.com/SWivid/F5-TTS/discussions/143
      ruclips.net/video/RQXHKO5F9hg/видео.html

    • @neuralfalcon
      @neuralfalcon  Месяц назад

      Watch this video: ruclips.net/video/GmketyZW2c4/видео.html

    • @neuralfalcon
      @neuralfalcon  Месяц назад

      Watch this video : ruclips.net/video/UO4usaOojys/видео.html

  • @snakezo4218
    @snakezo4218 2 месяца назад

    is there a way to speak with our voice and make a transfer to this voice to reproduce the emotions of tones you know
    let's imagine that I play the game of an angry person can the cloned voice reproduce this angry voice ?

    • @neuralfalcon
      @neuralfalcon  Месяц назад

      Easy, Record a short, 15-second audio clip where you speak in a specific tone, like angry, sad, or happy. Use this audio as a reference in F5 TTS, and the output voice will match your chosen emotion, such as anger.

  • @hiepinh5599
    @hiepinh5599 22 дня назад

    can i training with own voice, for example: optimus voice..

    • @neuralfalcon
      @neuralfalcon  22 дня назад

      Yes 100%, copy this notebook and use F5-TTS colab.research.google.com/github/NeuralFalconYT/F5-TTS-Demo/blob/main/F5_TTS_Latest.ipynb

    • @hiepinh5599
      @hiepinh5599 21 день назад

      I checked your collab, and it doesn't work

    • @neuralfalcon
      @neuralfalcon  21 день назад

      It's working

    • @hiepinh5599
      @hiepinh5599 14 дней назад

      @@neuralfalcon thank you, it worked. Now I have a checkpoint file trained through TTS-F5 but I don't know where to inference through, can you help me, I need python script

  • @pneuma23093
    @pneuma23093 2 месяца назад +1

    2:57 That's Dva right?

  • @Deewayne94
    @Deewayne94 15 дней назад

    Hello, can i also clone a voice in french?😊

    • @neuralfalcon
      @neuralfalcon  15 дней назад

      Yes, but you either need to train the model in French yourself or wait for someone else to do it.
      The best option right now is to pay for a service like ElevenLabs.io to clone your voice.

    • @neuralfalcon
      @neuralfalcon  12 дней назад

      huggingface.co/RASPIAUDIO/F5-French-MixedSpeakers-reduced

  • @asfandsherazkhan9135
    @asfandsherazkhan9135 Месяц назад

    can we dubbed into other language like from english to hindi

    • @neuralfalcon
      @neuralfalcon  Месяц назад

      It only supports English and Chinese , but you can train it in other languages. Watch this video to learn how:
      ruclips.net/video/GmketyZW2c4/видео.html

    • @neuralfalcon
      @neuralfalcon  12 дней назад +1

      huggingface.co/SPRINGLab/F5-Hindi-24KHz

    • @neuralfalcon
      @neuralfalcon  11 дней назад

      F5-TTS Hindi: ruclips.net/video/Pb3Zx562Juw/видео.html

  • @kanavwastaken
    @kanavwastaken 2 месяца назад

    Can you please make it work on LightningAI bro?

    • @neuralfalcon
      @neuralfalcon  2 месяца назад

      github.com/NeuralFalconYT/F5-TTS-Demo/blob/main/F5-TTS-lightning-ai.ipynb
      Download this notebook and upload it to lightning.ai/. Make sure to switch to GPU.

  • @411KJB
    @411KJB Месяц назад

    Link no longer works. Any new links?

    • @neuralfalcon
      @neuralfalcon  Месяц назад

      colab.research.google.com/github/NeuralFalconYT/F5-TTS-Demo/blob/main/F5_TTS_Latest.ipynb
      Or follow official instructions:
      github.com/SWivid/F5-TTS

    • @411KJB
      @411KJB Месяц назад

      It was PERFECT for that window though and I thank you so much.

  • @PratikshaPatil-r9o
    @PratikshaPatil-r9o 2 месяца назад

    HEY.. IS THE PROCESS FOR E2 IS SAME?

    • @neuralfalcon
      @neuralfalcon  2 месяца назад

      Yes same, just choose E2-TTS button

  • @snakezo4218
    @snakezo4218 Месяц назад

    I tried, is it possible to make him speak with a French accent, he still has difficulty or can I speak to the creator to ask him the question?

    • @neuralfalcon
      @neuralfalcon  Месяц назад

      ruclips.net/video/RQXHKO5F9hg/видео.html

    • @neuralfalcon
      @neuralfalcon  Месяц назад

      Watch this video : ruclips.net/video/UO4usaOojys/видео.html

    • @neuralfalcon
      @neuralfalcon  12 дней назад

      huggingface.co/RASPIAUDIO/F5-French-MixedSpeakers-reduced

  • @priyakumari-ky4nn
    @priyakumari-ky4nn Месяц назад

    F5 tts Can Support Hindi voice Give Answer ?

    • @neuralfalcon
      @neuralfalcon  Месяц назад +1

      It only supports English and Chinese , but you can train it in other languages. Watch this video to learn how:
      ruclips.net/video/GmketyZW2c4/видео.html

    • @priyakumari-ky4nn
      @priyakumari-ky4nn Месяц назад

      @@neuralfalcon Please you can make video realistic hindi tts voice

    • @neuralfalcon
      @neuralfalcon  Месяц назад

      Watch this video : ruclips.net/video/UO4usaOojys/видео.html

    • @neuralfalcon
      @neuralfalcon  12 дней назад

      huggingface.co/SPRINGLab/F5-Hindi-24KHz

    • @neuralfalcon
      @neuralfalcon  11 дней назад

      F5-TTS Hindi: ruclips.net/video/Pb3Zx562Juw/видео.html

  • @RostinSino
    @RostinSino 2 месяца назад

    does it work in indonesian language?🙏

    • @neuralfalcon
      @neuralfalcon  2 месяца назад

      For now it's a big 'NO' . You need to train on Indonesian language from scratch. You can use elevenlabs but it's paid.

    • @neuralfalcon
      @neuralfalcon  Месяц назад +1

      Watch this video: ruclips.net/video/GmketyZW2c4/видео.html

    • @neuralfalcon
      @neuralfalcon  Месяц назад

      Watch this video : ruclips.net/video/UO4usaOojys/видео.html

  • @abhishekkumar-bz1ql
    @abhishekkumar-bz1ql 2 месяца назад

    Will it work with hindi language

    • @neuralfalcon
      @neuralfalcon  2 месяца назад +1

      For now it's a big No, you need to train for other languages From scratch

    • @abhishekkumar-bz1ql
      @abhishekkumar-bz1ql 2 месяца назад

      @@neuralfalcon do you know how to train it? Or any reference video of it?

    • @neuralfalcon
      @neuralfalcon  2 месяца назад

      @@abhishekkumar-bz1ql github.com/SWivid/F5-TTS/issues/87

    • @neuralfalcon
      @neuralfalcon  Месяц назад

      Watch this video : ruclips.net/video/UO4usaOojys/видео.html

    • @neuralfalcon
      @neuralfalcon  11 дней назад

      F5-TTS Hindi: ruclips.net/video/Pb3Zx562Juw/видео.html

  • @QHawk7
    @QHawk7 Месяц назад +1

    *Is it Multi-language?*

    • @neuralfalcon
      @neuralfalcon  Месяц назад +1

      Only English and Chinese

    • @neuralfalcon
      @neuralfalcon  Месяц назад

      Watch this video : ruclips.net/video/UO4usaOojys/видео.html

  • @weini-sf3pu
    @weini-sf3pu 2 месяца назад

    when use Generate TTS, get an error " FileNotFoundError: [Errno 2] No such file or directory: 'nvidia-smi' ", can you help me ?

    • @neuralfalcon
      @neuralfalcon  2 месяца назад

      @@weini-sf3pu yes send screenshot at NeuralFalcon@proton.me

    • @neuralfalcon
      @neuralfalcon  2 месяца назад

      @@weini-sf3pu first you need a GPU to use 'nvidia-smi' then if you are running in a jupyter notebook '!nvidia-smi'
      Or if you are running in terminal just 'nvidia-smi'. Else you can skip this.
      Use another way to find the cuda version to install the pytorch .

  • @Dex383-d8d
    @Dex383-d8d 2 месяца назад

    Why did the page ask me for permission to use my microphone? Do not enter the pinned link, you will probably be hacked... The video seemed useful but better not risk it

    • @neuralfalcon
      @neuralfalcon  2 месяца назад +1

      Thank you for your comment! It sounds like you might not be familiar with how Gradio applications work. The page requests microphone permission because the app needs to record or upload audio in order to clone it. Our code prioritizes recording audio before launching the app, which is why microphone access is required. If you're interested, you can learn more about this in the Gradio documentation here: www.gradio.app/docs/gradio/audio .

    • @Dex383-d8d
      @Dex383-d8d 2 месяца назад

      @@neuralfalcon Thank you very much for replying to my comment, I will read the documentation, it is true that I am not familiar with the application

  • @Ice_camp
    @Ice_camp 2 месяца назад

    uncheck remove silence

    • @neuralfalcon
      @neuralfalcon  2 месяца назад

      You can uncheck the silence option, which may create silence in the generated audio .