I Built a Personal Speech Recognition System for my AI Assistant

Поделиться
HTML-код
  • Опубликовано: 3 окт 2024

Комментарии • 306

  • @zacknawrocki
    @zacknawrocki 4 года назад +48

    I've been looking forward to this part of the series the most! I've been trying to create/run a voice assistant locally, and could not figure out how to apply speech recognition without relying on Google's Python module (which i was trying to avoid for privacy reasons, defeating the purpose of making one) and the HMM basics in my Intro to AI course weren't enough to implement it. This is fantastic.

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

  • @theroyal1914
    @theroyal1914 4 года назад +30

    we need programmers like you. For advance learning.

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

  • @totoma3297
    @totoma3297 3 года назад +255

    this is michael reeves from the universe where he decided to do something useful with his life

    • @isawcornflakes6201
      @isawcornflakes6201 3 года назад +9

      LMFAOOOO DIDNT HAVE TO DO HIM LIKE THAT 😭☝️

    • @aliveandwellinisrael2507
      @aliveandwellinisrael2507 3 года назад +4

      6:57 yep

    • @UmbraAtrox_
      @UmbraAtrox_ 3 года назад +3

      Wow, that's mean bro.

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

    • @DayoBrandon
      @DayoBrandon 2 года назад +1

      Imagine the greatest Michael colab. The two of them plus Michael Stevens (vsauce)

  • @smeagol92055
    @smeagol92055 3 года назад +8

    I'm building my own wearable AI assistant and this series is **exactly** what I was looking for! Great stuff!

    • @kiss-bws
      @kiss-bws 3 года назад

      Can you make tutorial

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

    • @PonchoManOG
      @PonchoManOG Год назад

      dude no way same

    • @morraza3307
      @morraza3307 Год назад

      @@PonchoManOG does this tutorial still work?

    • @PonchoManOG
      @PonchoManOG Год назад

      @@morraza3307 yes

  • @thiscrow
    @thiscrow 3 года назад +33

    at the beginning of the video: Oh I see !
    6:57 : Oh I ... oh ...

  • @akulgoel9259
    @akulgoel9259 Год назад +3

    This is so good, I remember seeing this video a year ago and wishing he'd continued the series.

  • @joeyrivenbark5056
    @joeyrivenbark5056 2 года назад +1

    Hey man, I really like how you have written definitions in addition to your speaking, helps a lot.

  • @fteoOpty64
    @fteoOpty64 4 года назад +4

    Loved the high speed speech part!. Well done. Excellent production Mike!. TQ

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

  • @chrisw1462
    @chrisw1462 4 года назад +1

    A Cue Stick - used for playing billiards. Acoustic (a-COO-stick) - dealing with sound or audio energy.

  • @vicehaiti914
    @vicehaiti914 4 года назад +37

    Keep going bro.full support

  • @victor7ultimate
    @victor7ultimate 3 года назад

    After watching this video, I literally took off my hat as a mark of respect to this.
    Cant thank you enough.
    Thanks a million

  • @OtRatsaphong
    @OtRatsaphong 2 года назад +3

    Wow, just discovered your channel. Great work. I'm just starting my journey into Deep learning and speech recognition. Will be following your progress.

  • @PritishMishra
    @PritishMishra 3 года назад +11

    Why aren't you uploading more videos? I have already seen this video just came here to say... plzz upload it's been 7 months now!

  • @CreateYourWorld1
    @CreateYourWorld1 3 года назад +1

    Planning on creating my own Jarvis, this video has given me an insight.

    • @s1krrpilot
      @s1krrpilot 3 года назад

      Same, I'm going to call mind Alfred and integrate it into my helmet

  • @davidkim2389
    @davidkim2389 4 года назад +4

    When next?? Best Series ever!! Please post next!!

  • @michealhall7776
    @michealhall7776 3 года назад +1

    I'm enjoying discovering all these smaller ai channels

  • @PaulClifford
    @PaulClifford 3 года назад +3

    Parts 3 & 4 haven't materialized in a year. I'd love to see the rest.

  • @briankim49
    @briankim49 4 года назад +2

    Loved the video. You really showed me the tools I could use to build my own speech recognition model!

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

  • @rahulkumarm1446
    @rahulkumarm1446 4 года назад

    Brooo.I really dont know whether u coded this or just took reference from something....idrc u are AMMMMMAAAAZZZZIIINGGGGG.Hats off 2 u.U have a great talent man.......u could be the next ceo of any big fours too....

  • @Alex.In_Wonderland
    @Alex.In_Wonderland Год назад

    omg, thank you! every other video I look up on this subject is just an ad for a text-speech readers! thanks for going into such detail about your thought process, buut after looking at the rig you have vs the one I've got ... well. . . if it took you a handful of days, it'd take me a week or two LOL great video! thanks a lot!

  • @罗杰瑞-p7g
    @罗杰瑞-p7g 2 года назад

    i think this is a very good video for me ,It can not only let me learn some knowledge, but also make me feel relaxed.thank you

  • @seannam1218
    @seannam1218 4 года назад +10

    This is incredibly educational. Thx for sharing ur knowledge for free!

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

  • @muhammadrezahaghiri
    @muhammadrezahaghiri 3 года назад +7

    Can you make a TTS using deep learning? :) I really want to see that.

  • @swarajshinde3950
    @swarajshinde3950 4 года назад +3

    Loved it Man , Great Video !

  • @jumbejolly3129
    @jumbejolly3129 2 года назад

    Man your a genius man. I wish I could do this. I have some many ideas but dont know where to start.

  • @alexkonopatski429
    @alexkonopatski429 3 года назад

    this series is so cool! keep it up bro

  • @sirlightshadowslayer473
    @sirlightshadowslayer473 10 месяцев назад

    This was insane, gonna try to do similar now, thank you for the informations

  • @JoshuaHerath
    @JoshuaHerath 3 года назад

    This video is so high quality wish you uploaded more

  • @CraftClone1
    @CraftClone1 3 года назад +2

    This is awesome! I wish there was more content from you
    One Ai hacker to another, keep on going!

  • @kadaliakshay6770
    @kadaliakshay6770 7 месяцев назад +1

    bro amazing wrapping on 6:54

  • @kevinrtres
    @kevinrtres 2 года назад

    Thanks for the information. Just goes to show that the idea that we evolved is just sheer madness.

  • @mtaneesh1411
    @mtaneesh1411 4 года назад +6

    This was a really good video dude. Can you tell me how to make the soundwave display that you had while testing the model

  • @neilosborne8682
    @neilosborne8682 2 года назад

    This is excellent! (subscribed!)
    I had to quickly brush up my skills for a project I'm working on (will be open sourcing it soon!) - and this video was short, sweet and to the point! Thanks

  • @gauravshipurkar1570
    @gauravshipurkar1570 2 года назад

    Bro you are freaking awesome!!! i love your content, helps a lot.

  • @kimkubik7547
    @kimkubik7547 2 года назад +1

    You Michael Rock!!!! Way to teach!!!

  • @chenjus
    @chenjus 4 года назад +1

    Really dope video. Can't wait to see your next one.

  • @jtlunsford780
    @jtlunsford780 Год назад +1

    Totally awesome. Understood about .5% (that's point 5%). Just got my headset set up in Win 10 and am loving it. You're awesome and I bow to your knowledge and expertise....thanks for the cool vid. It was not wasted on my limited knowledge, but it peaked my interest...thanks again...JT

  • @JasonTRogers
    @JasonTRogers 2 года назад +1

    Hey Michele, your videos on AI is fantastic! I haven’t seen any videos lately and I am course what you are doing these days?

  • @benceelmokovacs1422
    @benceelmokovacs1422 3 года назад

    Wo hooo!
    This thing for FREE?! And help for us how to make it ours?!
    This data worth a HUGE amount of money, but you shared it! I'm so much surprised, in the good term!
    Thanks, thanks, thanks for it!!
    I really want to make an own Virtual Assistant, so big thanks for this video, for the data and for the help!
    Be blessed!

  • @nikhilhukkerikar6753
    @nikhilhukkerikar6753 4 года назад +11

    Wow that was a pretty neat video but as someone who’s aspiring to be a AI dev can you make a video explains the code in detail like a stepper! Loved it awesome work!

    • @troopekyt
      @troopekyt 3 года назад +2

      Yes pls

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

  • @r7rahuls
    @r7rahuls 3 года назад +19

    This guy is capable of making a real life JARVIS

    • @dabomb3864
      @dabomb3864 3 года назад +2

      There is already. Theres even a python module called JarvisAI and does exactly that.

    • @yentarachangethelife3897
      @yentarachangethelife3897 3 года назад

      @@dabomb3864 and how exactly do you know that??

  • @w3w3w3
    @w3w3w3 Год назад

    your videos are great bro! 🤝

  • @kalyanstock8058
    @kalyanstock8058 Год назад

    Wow...who knew you can make AI teaching so much fun....You should make more videos

  • @UttamDas-ub5ow
    @UttamDas-ub5ow 3 года назад +1

    This man is really a hero 👍💓

  • @SivaShankarsss
    @SivaShankarsss 3 года назад

    Eagerly waiting

  • @jairojosy5985
    @jairojosy5985 4 года назад

    Keep going on and finish the project fast. I'm looking ahead for the project to be finished

  • @alexandergrayson9856
    @alexandergrayson9856 3 года назад +1

    Hey pal, your work's great I love it 🙌🙌

  • @sreerajsathish3635
    @sreerajsathish3635 4 года назад

    Omg the video i was looking for thank for making one..... Full support❤

  • @fteoOpty64
    @fteoOpty64 4 года назад

    Love your War Machine!. I build my first Pentium Pro Dual Proc decades ago. It had a special powersupply and I had to rig my Generic case to fit the Tyan motherboard!. It ran Linux then.

  • @itumelengmothapo2456
    @itumelengmothapo2456 3 года назад

    thank you man... this was fun to watch

  • @DavidAlvesWeb
    @DavidAlvesWeb 3 года назад

    what a great video man, really inspiring! keep up the good work!
    PS: you deserve a better t-shirt bro 😅

  • @jaskeeratsingh9929
    @jaskeeratsingh9929 10 месяцев назад +1

    why are the three micheals i know all so smart : Micheal Phi, Micheal Reeves, Micheal from VSauce

  • @hemanth8195
    @hemanth8195 3 года назад

    This is really nice work dude

  • @emrehankaraoglu4122
    @emrehankaraoglu4122 2 года назад +4

    This is such a amazing video. Congrats! I am wondering about model deployment part. Are you going to share the coding part of ıweb interface? The sound wave and the text that occurs below the sound wave are awesome.

    • @adibakhan2865
      @adibakhan2865 Год назад

      Hey did you found the code for deployment

  • @zikpin
    @zikpin 3 года назад

    This is what i was looking for, thanks

  • @vladiklass1890
    @vladiklass1890 3 года назад +1

    Cool video!!! This will help me a lot with my first NLP project. I wanted to get radio voice data and transcribe it. Any tips on that?
    Btw you should come up with a more memorable outro! :D

  • @redtako.
    @redtako. 10 месяцев назад

    THIS ONE WAS REALLY FUNNY gj love keep up the uploads :)

  • @adeniyiadeboye3300
    @adeniyiadeboye3300 4 года назад +1

    Thanks for this..I am going to thoroughly go through the speech recognition your code on Github

  • @aneekeshkumar8199
    @aneekeshkumar8199 4 месяца назад

    The audio kept buggin me, I'd heard it somewhere, then I remembered the Iconic Outros of the Channel Veritasium !!!!!

  • @MrDonald911
    @MrDonald911 3 года назад +4

    Hey ! I just discovered your channel, nice content ! Your model seems overfitting, I think you should evaluate it on a test data (and not the validation). I would be curious to know how it would perform if you do hyperparameter tuning.

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

  • @nathancook8452
    @nathancook8452 2 года назад

    Excellent video, you helped me out tremendously

  • @itsjustsam04
    @itsjustsam04 3 года назад

    Wow I love it! I do have two questions tho. 1 how did you run it in ur Chrome browser. 2 how did u get the cool visual effects for while u were speaking?

  • @NathanaelNewton
    @NathanaelNewton 2 года назад

    This looks like exactly what I need!
    Thanks for posting, I'm gunna follow along and watch tonight.
    One question.. Why are you using the auto generated subs on this video 😂😁

  • @DrewNewmanEngineer
    @DrewNewmanEngineer 3 года назад +1

    Fantastic tutorial! The detail of explanation is remarkable. Nice code editor, which editor is that?

  • @diegomartin6332
    @diegomartin6332 4 года назад

    Please post more videos about this!

  • @rodios-md5du
    @rodios-md5du Год назад

    You are gold💛

  • @yashrajhawle4
    @yashrajhawle4 3 года назад

    Thank you for sharing your knowledge !

  • @tripathi26
    @tripathi26 4 года назад

    This is awesome! thanks man.

  • @angelgabrielortiz-rodrigue2937
    @angelgabrielortiz-rodrigue2937 3 года назад

    Wao, great video man. Really awesome stuff

  • @soonapaana24
    @soonapaana24 3 года назад

    You are totally awesome bro...👏👏👏

  • @scarlett_j
    @scarlett_j Год назад

    Sorry to inform you, but you pretty much rock, at the same time solved this so I don't have to.

  • @rachitahuja1257
    @rachitahuja1257 4 года назад +3

    Hi it's a pretty neat tutorial!! thanks a lot for the insight. I just have two questions :-
    1. Why did u transpose your data before entering it into LSTM layers?
    2. Why have you used MelSpectrogram instead of MFCC coefficients? ( I mean is there some specific reason of doing so?)

    • @oguzynx
      @oguzynx 4 года назад +2

      because sonopy's function mfcc_spec gives the data reverse. so x axis is the frequency and the y axis is time. But we need the reverse. that's why. Pytorch's MFCC or MelSpecgram automatically gives the data in a way we want but he chose that sonopy because it is really fast. Check this out github.com/MycroftAI/sonopy

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

  • @shannonsteward4034
    @shannonsteward4034 Год назад

    hi great work I just found your channel great job

  • @numbah16
    @numbah16 2 года назад +1

    where tf are you Michael! Need some new videos already! Let's see what you have been up to with your monster rig

  • @waisyousofi9139
    @waisyousofi9139 2 года назад

    Thanks ,
    Can you make a tutorial on code implementation of speech recognition.
    that would be great.

  • @microgamawave
    @microgamawave 2 года назад

    You can make a video about gait recognition biometrics in python
    recognized you from your walk model

  • @rangefreewords
    @rangefreewords 2 года назад +4

    Can you have this A.I. system set up to make a journal or blog to provide links in the speech to find materials that you had previously recorded? If so, I would be interested to an adaptive blog that can provide updates to previously mentioned material without hand-stakingly rewriting everything.
    I really would like to know what materials you used in your audio recordings with this A.I. that made it all the more concise with your objective. I would also like to see how small you might be able to have this computer system since your previous video with Pi.

    • @Prometheus720
      @Prometheus720 Год назад

      This seems like something to be done by integrating an assistant like this into another application like Obsidian.
      They need to be separate. There are lots of ways of passing external data into Obsidian and vice versa.

  • @stereopsych6381
    @stereopsych6381 2 года назад

    Please upload more!

  • @hg4lyfe
    @hg4lyfe 4 года назад

    Bro this is perfect wow thanks

  • @tilahunanagaw6175
    @tilahunanagaw6175 Год назад

    what interesting presentation it is!!!

  • @aviavinav7208
    @aviavinav7208 4 года назад +1

    Great Video!

  • @Tera2Space
    @Tera2Space Год назад

    Hello, when will there be a guide to creating your own speech synthesis?
    (TTS)

  • @vincebelansky425
    @vincebelansky425 11 месяцев назад

    Thank you for this video and the insight of how to design a voice recognition system independently from the ground up by an newly to AI. Most videos tell you to connect the internet and to a big server by google or someone else. The only question that I have is why use python and not C or C++, especially since you are running a raspberry pi with limited memory and slower CPU and the natural time restraints of real-time speech recognition?

  • @letsplat__
    @letsplat__ Месяц назад

    Hi Michael!
    I am a student and highly interested in AI for building things.
    It would be great help if you could make a video or share some resources on how to get started.

  • @pranavthakur6744
    @pranavthakur6744 2 года назад

    Can you make a detailed video how did you manage to make it. I want to learn it.

  • @ZpErMy
    @ZpErMy Год назад

    Hello, I was fascinated with your Speech Recognition System. I wonder, could your system recognize sung musical notes?
    that is, instead of words, musical notation.

  • @SpeechProductivity
    @SpeechProductivity 3 года назад

    Very informative!

  • @deelordthegreat
    @deelordthegreat 2 года назад

    THANK YOU!! ✌

  • @Pixel_Recap
    @Pixel_Recap 2 года назад

    Can you put a video about offline speech recognition and text to speech

  • @AlanJames1987
    @AlanJames1987 Год назад

    Good video but are you using Linix at 9:31 and Windows at 9:34? I haven't used Windows in a few years so I didn't know you could do this.

  • @aakaashshroff1672
    @aakaashshroff1672 3 года назад

    Can you please make a video on making your own speech synthesizer

  • @MineInjected
    @MineInjected 3 года назад

    Your voice is perfect for lend to a robot, don't worry, Im not offending you, its an awesome voice.

  • @ritikasingh2846
    @ritikasingh2846 3 года назад

    Thank you so much.I got a lot of help with this video. Sir, can you upload a video about calculating "Persentage of voice similarly score?”
    I will be very grateful to you.🙏

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

  • @brendanguhle5761
    @brendanguhle5761 Год назад

    Yo , im a little confused, what do i have to do to get exactly what you did, except i want more hours specifically for me ? Thanks man ! Great vid! I subbed

  • @alexzab7653
    @alexzab7653 4 года назад +2

    Did you start a rap in 6:58? 😂😂😂

  • @danieleangelini6238
    @danieleangelini6238 3 года назад

    Fantastic Bro 💞

  • @itzfin433
    @itzfin433 4 года назад

    I am late I know but you can make it search the output on the google.com it will be most accurate

  • @peacekeepermoe
    @peacekeepermoe 3 года назад

    Great content dude. I haven't seen anything new for the last 7 months though. Hope you're well :)

  • @superaluis
    @superaluis 4 года назад

    Awesome content!

  • @Menuseto
    @Menuseto 3 года назад +1

    Any plans on continuing this project?