Introducing Speech To Speech: Elevenlabs Unveils Mind-blowing New Feature!

Поделиться
HTML-код
  • Опубликовано: 3 янв 2025

Комментарии • 105

  • @BobDoyleMedia
    @BobDoyleMedia  11 месяцев назад +1

    For a free alternative for speech to speech, check out this video: ruclips.net/video/Usua2LnnX4g/видео.html

  • @SuperEliasTM
    @SuperEliasTM 9 месяцев назад +7

    I'm editing a Wedding recap video and theres a section where during the bridal party speeches the microphone kinda cut out and ruined the flow of what they were saying. I used this to recreate their voice on the sections that cut out and my goodness, it's wonderful. Truly a gem for issues like this.

  • @marcdevinci893
    @marcdevinci893 Год назад +12

    Game changer for sure. No more cryptic use of punctuation to try to get the right flow and inflection on words and multiple re-rolls of lines. Brilliant

  • @PhilAndersonOutside
    @PhilAndersonOutside Год назад +6

    I've tried over a dozen different AI voice labs, several are good, some are not. The one I decided to go with was Eleven Labs.

  • @joshstone5227
    @joshstone5227 Год назад +7

    I don't see nothing wrong with using it to speak with passed family members if there voices are saved, it can help people who are grieving

    • @Fivemacs
      @Fivemacs 8 месяцев назад +2

      That's not letting go, not grieving. That seems unhealthy.

  • @TheBlueRage
    @TheBlueRage Год назад

    5:29 what generator did you use. I have a face swap software one with no sound and D-ID didn't allow me to use a famous person although the image was ai generated.

  • @wasthataflute
    @wasthataflute Год назад +1

    Good, fun demonstration. Just the tool I've been looking for. Thanks.

  • @komakaze1
    @komakaze1 Год назад +9

    I'd love something that can read e-books to me with emotion. Some Text To Speech voices are good, but completely robotic in their emotional emphasis.
    I've encountered audio books where i like the story but the voice who is reading it is not to my taste, especially during dialogue of the opposite gender to the reader.
    It would be great if there were easy AI solutions to both of these.

    • @moltenpros
      @moltenpros Год назад

      It's going to cost too much. If you have lots of money then go ahead.

  • @MrVapi23
    @MrVapi23 Год назад +1

    I appreciate your time ✌️

  • @alia8766
    @alia8766 Год назад

    What microphone are you using? The quality is great

    • @BobDoyleMedia
      @BobDoyleMedia  Год назад

      It's a Blue Yeti Pro. I have it as close to me as I can without it being in the frame. And I also may be running some compression on it, depending on the video.

  • @bobhawkey3783
    @bobhawkey3783 Год назад +7

    Nice. I've used some Replica voices because they have good emotional weight but poor voice clarity. Thanks for this.

    • @DawnPeacockOwens
      @DawnPeacockOwens Год назад

      Should we try it?

    • @saintfame23
      @saintfame23 Год назад

      @@DawnPeacockOwens just try respeecher

    • @DawnPeacockOwens
      @DawnPeacockOwens Год назад

      @@saintfame23 we already have the subscription to Eleven labs , so may as well use that one first

  • @luminrabbit9488
    @luminrabbit9488 Год назад +2

    Whoa, this is awesome! Quick question, what was used for the Liam Neeson headshot movement (face over), I’m hoping there’s an API out there somewhere..
    Thank you keep up the great work!

    • @MickPerezRealEstate
      @MickPerezRealEstate 8 месяцев назад

      That's what I want to know...did you ever find out?

  • @DoctorKusanagi
    @DoctorKusanagi Год назад

    I extensively use Eleven Labs and I love it

  • @RexSmithII
    @RexSmithII 6 месяцев назад

    Can you do speech to speech with output voice a cloned voiced?

  • @jonathanrice2568
    @jonathanrice2568 8 месяцев назад

    Hey Bob, do you mind if I ask you achieve such perfect background substitution?? Thanks.

  • @TheBlueRage
    @TheBlueRage Год назад

    A workaround could also be Creative Commons impersonators.

  • @hecaz7052
    @hecaz7052 5 месяцев назад

    It's possible to change the voice and change some text from the audio too? To change for example the speech of a film

    • @BobDoyleMedia
      @BobDoyleMedia  5 месяцев назад

      @@hecaz7052 you could certainly use this tool in conjunction with lip syncing software to do something like that.

    • @hecaz7052
      @hecaz7052 5 месяцев назад

      @@BobDoyleMedia ok because I'm watching a lot of videos and I can see you can change the voice, but not the text... I wanted to be sure I can before paying for it :)

  • @saintfame23
    @saintfame23 Год назад

    Respeecher have this feature and specialized on speech to speech technology. Try them as well

  • @gadgetgrader
    @gadgetgrader Год назад

    Hey Gen will do this also

  • @Vtrest
    @Vtrest Год назад

    Brilliant! How can I do a clone of a singing voice?

    • @BobDoyleMedia
      @BobDoyleMedia  Год назад

      There are definitely solutions for that which I would love to cover eventually here on the channel. Do a search for RVC voice clone and you’ll find your answer.

  • @MeAndMyRoyalEnfield
    @MeAndMyRoyalEnfield Год назад

    Just today tried Descript for the first time for a lot of text I have to read. It, or I, sound like I want to take a long walk off a short pier and when I'm half joking I can't get that light hearted flavor tone to come out. I may try ElevenLabs tomorrow?

  • @IdeasThatHeal
    @IdeasThatHeal Год назад

    Fun stuff! Thanks

  • @douglaskastle
    @douglaskastle Год назад

    What is it like re rendering your own voice. I am thinking of a bad recording, like in a noisy cafe, or echo-y room and redoing it so it sounds studio quality. Bonus points for 2 people talking a spearting them out into different tracks.

  • @thethoughtfield
    @thethoughtfield Год назад

    you've an amazing voice, what do you need this for?

    • @BobDoyleMedia
      @BobDoyleMedia  Год назад +1

      As I say in the video, now I can use whatever acting skill I use with my own voice and then apply it to others, thus being able to create a stable of characters that I can offer or use for my own projects.

  • @Molandria
    @Molandria 9 месяцев назад

    I'm just so lost. I'm trying to start streaming online using a voice changer, speaking live and having the voice changed live, and I'm trying to clone a voice for this purpose. Do you know of anything? I can't find anything. Every time I search for ANYTHING on this, I keep getting "text to speech" options, or cloning voices that ultimately result in text to speech only options.
    Is what I'm looking for a thing? I don't know what to actually search for. ;(

  • @Daemon1995_
    @Daemon1995_ Год назад

    hmm so how do they calculate the limit using speech to speech?

    • @BobDoyleMedia
      @BobDoyleMedia  Год назад

      My guess is that it creates a transcript of what is being, said, and counts the characters.

  • @ReadyToFly24
    @ReadyToFly24 11 месяцев назад

    I use it to clone my own voice for my videos, as i tend to slur some words and not keep tempo.

    • @BobDoyleMedia
      @BobDoyleMedia  11 месяцев назад

      It definitely isn't perfect, and I've had to do some re-recordings to work around that very issue. How long was the sample that you sent it, and does it have any examples of the word that is slurring?

    • @ReadyToFly24
      @ReadyToFly24 11 месяцев назад

      4 clips 30 seconds long seemed to work. read from a script I found online. @@BobDoyleMedia

  • @ddrci88
    @ddrci88 Год назад

    What is the open source voice cloning best app then ?

    • @BobDoyleMedia
      @BobDoyleMedia  Год назад

      Personally I like RVC. With my 3090 I can clone a voice with about 20 minutes of audio in around 30 minutes that sounds pretty good, and can then convert recordings like this, or do "real time" conversion, with a delay depending on your GPU.

  • @IamAaliJah
    @IamAaliJah Год назад

    Well, I know the feature but I like your way of presenting it, jsut love your videos. I am also Liam Neson's big fan.

  • @markmatthews1972
    @markmatthews1972 Год назад

    how much text/time can you upload at one time?

    • @BobDoyleMedia
      @BobDoyleMedia  Год назад

      They've changed it since I did the video, and it looks like they'll take up to 50MB audio files. That's a lot!

  • @kevnar
    @kevnar Год назад

    I would love to use this technology, if they didn't nickle and dime you for every little character you use.

  • @Vifer09
    @Vifer09 Год назад

    This is awesome I’m hoping to use it to answer customers that ask the same question over and over on the phone so I don’t gotta sit there for 10 minutes the turbo feature will make it seem real I hope

  • @aldiergreen
    @aldiergreen Год назад +1

    what if you sing it?

    • @BobDoyleMedia
      @BobDoyleMedia  Год назад +1

      Unfortunately, it does not work, at least in any of the tests I did. Generally, the models require a slightly different type of training if you’re going to use them for singing, but I’m only speaking about training approaches that I know. I really don’t know what ElevenLabs is doing.

  • @thewebstylist
    @thewebstylist Год назад +1

    So grateful for 11 and my subscription to!

    • @BobDoyleMedia
      @BobDoyleMedia  Год назад

      Yeah, it's getting to be a better and better value!

  • @AT-os6nb
    @AT-os6nb Год назад +1

    so where does this leave security? voice print identification etc..... crazy. whatcha out you don't get cloned!

    • @BobDoyleMedia
      @BobDoyleMedia  Год назад

      It's a likely thing, no doubt. But I think we probably all already are...

  • @vivektyagi6848
    @vivektyagi6848 Год назад +2

    Awesome. 🎉 Many Thanks for sharing face fusion and 11 labs 🎉

  • @NeuroGlob
    @NeuroGlob Год назад

    You can really test the voice AI with something like a "love note" reading. They sound like a business transcript... funny though.

  • @nigeldogg
    @nigeldogg Год назад

    Please make videos about open source solutions for this 🎉

  • @Morrisseys7thFriend
    @Morrisseys7thFriend Год назад

    I tried it and it makes the result all jumbled up

  • @richardsaddress580
    @richardsaddress580 Год назад

    Does anyone know if there is a service like this where you can purchase or download your Voice and add it to your Apple or Windows computer?
    I want to do an audiobook of my late great father, reading a public domain translation of the Bible. If I’m limited to an amount of words or minutes, I’m gonna spend 1 trillion billion dollars getting that done…

  • @quizwell
    @quizwell Год назад +1

    great vid - we live in exciting times

  • @plushtownevents
    @plushtownevents 10 месяцев назад

    Hey Bob, nice vid. You want to know how I use it? I’m one of the pre-made Australian voices on ElevenLabs - Friends send me all sorts of crazy stuff people just e used my voice for everyday! As a producer I use it for allowing me to perform reads in voices I don’t have - I even did a read in a 30yo Aussie female voice recently :)

    • @BobDoyleMedia
      @BobDoyleMedia  10 месяцев назад

      Precisely! That's just the kind of use case I'm talking about. I have another video that addresses this specifically for VO professionals: ruclips.net/video/edNQd2LgBrw/видео.htmlsi=PbK0i5jN_BeylcB7

  • @IndePro-z1y
    @IndePro-z1y 9 месяцев назад

    No hate for 11labs but doesnt Vocs AI & Kits already do this?

    • @BobDoyleMedia
      @BobDoyleMedia  9 месяцев назад +1

      I’m not familiar with these as you’ve listed them. Do you have a link? I’d love to check them out!

  • @xXWillyxWonkaXx
    @xXWillyxWonkaXx Год назад +3

    Here's a question, do you think Eleven Labs can get to a point where voices are synthesized in real-time instead of submitting an audio sample file and it churning through it and then spitting out the end result like it has (which is impressive still to say the least)

    • @BobDoyleMedia
      @BobDoyleMedia  Год назад +1

      Whether they do it or not, it's hard to say - but will they be ABLE to? No doubt.

    • @stedbenj
      @stedbenj Год назад +1

      I've seen videos of a man who is using AI on his PC to change his voice to a female anime character in almost real time. The technology is pretty much there.

    • @danielle78730
      @danielle78730 Год назад

      what engine is he using to do this…

    • @sirdrak
      @sirdrak Год назад

      @@danielle78730 It's w-okada AI voice changer, opensource, free, local and easy to use... And you can use it in realtime in games,Discord, etc... and every app with microphone support...

  • @dantestaccato
    @dantestaccato 11 месяцев назад

    I tried Elevenlabs but had better results with Vocs AI speech to speech

  • @mrhoneystinger3676
    @mrhoneystinger3676 Год назад

    The thing that concerns me is if you upload your voice to ElevenLabs are you giving them permission to use your voice somewhere else without compensation

    • @BobDoyleMedia
      @BobDoyleMedia  Год назад

      I’m not sure that’s 100% true but I will certainly look into it. I think you have to give them permission to use your voice in their marketplace.

  • @ThyLegohood
    @ThyLegohood Год назад +1

    Emily definitely sounds like she isn't thrilled about what you're wearing.

  • @aliruane
    @aliruane Год назад

    It still sounds generated to me. Delivery is flat and unnaturally inflected

    • @BobDoyleMedia
      @BobDoyleMedia  Год назад

      Well, I think some voices are better than others, and I believe that like with most AI things, a lot of it has to do with the data going into the model. If the read going in is flat, that's all you're going to get out. That's why I have several models of my voice with a range of modulation.

  • @StevenWebb
    @StevenWebb Год назад +1

    You could have picked me!

  • @Kelvinapplegate
    @Kelvinapplegate 2 месяца назад

    Emily is kind of an Emily Downer is wild, or melancholy. 😂😂😂

  • @olexiisokolov
    @olexiisokolov Год назад +1

    only english((

    • @BobDoyleMedia
      @BobDoyleMedia  Год назад +1

      Yes, good point. Forgot to mention that! Obviously, that will change any moment. :)

    • @olexiisokolov
      @olexiisokolov Год назад

      @@BobDoyleMedia hope so)

    • @xHeadcleanerx
      @xHeadcleanerx Год назад

      Elevenlabs has multilingual model too.

    • @olexiisokolov
      @olexiisokolov Год назад

      ​@@xHeadcleanerx Not for Speech to Speech yet

  • @rhondahoward8025
    @rhondahoward8025 Год назад

    It's still a little wonky. The voices can still sound slurred and drunk with the replica feature.

  • @vijayrana9134
    @vijayrana9134 Месяц назад

    i am vijay 0:43 0:45

  • @SynthwaveDuck
    @SynthwaveDuck Год назад +3

    Loved the Deepfake ending

    • @BobDoyleMedia
      @BobDoyleMedia  Год назад +3

      :) Thanks. That's Facefusion. Going to do another video on that.

  • @sorijin
    @sorijin Год назад +1

    This is dope, also grapes are toxic to dogs big FYI for those who don't know

    • @BobDoyleMedia
      @BobDoyleMedia  Год назад +1

      I actually do know from first hand experience. Lost my chihuahua after only 2 years. We had no idea, and he loved them as treats. Hard lesson.

    • @Wasaia
      @Wasaia Год назад

      ​@@BobDoyleMedia😢❤

  • @Gray-Today
    @Gray-Today Год назад +1

    "Cloning" is, making an identical copy. Cloning is NOT voice-to-voice. Voice-to-voice is a conversion. The two are very different. I'm having a tough time finding out what this program is capable of, thanks to illiterate use of terminology. Many are.

    • @BobDoyleMedia
      @BobDoyleMedia  Год назад +1

      Well, I feel like the word "cloning" describes enough for the general public what the result will be. And I guess the term "conversion" isn't as sexy. I get your point, but is it really unclear what the program does? From your viewpoint, I'd say it does exactly what you said: converts. It converts text to speech, and it converts voice to another voice, clearly based on some kind of AI model that is created amazingly fast. "Cloning" or not, I'd say it's amazing.

    • @Gray-Today
      @Gray-Today Год назад

      The program itself is written using non-standard terminology. Most are today. You are obliged, I think, to use the same terminology as the product, right or wrong. What you think is a "sexy" word is irrelevant.
      Yes, it is unclear to those who may have a technical vocabulary. Another example is "AI." It has no definition at all. It means "really cool," right? There are quite a few examples. It's a sign of the sorry times we live in.
      If you must use undefined terms, consider adding a link to a glossary.
      @@BobDoyleMedia

  • @Anarchy-Is-Liberty
    @Anarchy-Is-Liberty Год назад +1

    ROFLMFAO!!! That was great!! ha ha ha!!!

  • @southcoastinventors6583
    @southcoastinventors6583 Год назад

    What the legality of dead actors I wonder

    • @TXanders
      @TXanders Год назад

      It's can be illegal /harassment/defiling and infringement, unless you get permission, for the dead it's considered false light. The bottom line is, you don't own it. Regardless of legislation not updated fully yet anywhere, there's the morality of it too.
      Even the end of this video is infringement on the look, even though it's a satire and genuinely means no harm. Pay for actor release forms, even yourself.

    • @southcoastinventors6583
      @southcoastinventors6583 Год назад

      @@TXanders There no morality for the dead they do not need to worry about that. It probably more a public domain thing so maybe after 50 years or less depending on living heirs trying to cash in for work they didn't do.

    • @BobDoyleMedia
      @BobDoyleMedia  Год назад

      Yes, your point is totally valid. I guess I'm just "going with it while I can" until firmer rules are in place - but it's going to be hard to backpaddle on this tech, so it will be interesting to see what kind of legislation is created. In my case, I'm always making it evident that it's AI, so I believe that this is currently acceptable to RUclips, which suits me just fine.@@TXanders

  • @bravo1oh1
    @bravo1oh1 Год назад +1

    Changing the name Emily to George is transphobic

  • @nattsurfaren
    @nattsurfaren Год назад +1

    The value of putting your face on youtube will not have any value at all because people will think it is all fake. Well I think it is kind of good because then it boils down to the value of the content. But I predict people will upload hundreds of automated content every week all AI generated so it is all going to be BS. Maybe AI will generate uniqness as well so it will all go to BS anyway. Welcome to this BS future.

    • @dinoscheidt
      @dinoscheidt Год назад

      It all has been fake for years. The focal length of the camera is already changing how you look, video is color graded, using a green screen, high lumen lighting,... Does it matter? No. People look at faces, eyes, mouth, gesticulation, mimics,… because it’s a strong part of the humans multi-sensory toolset for communication and understanding.

  • @brytonkalyi277
    @brytonkalyi277 Год назад

    \>