I Built a Personal Speech Recognition System for my AI Assistant

Поделиться
HTML-код
  • Опубликовано: 3 дек 2024

Комментарии • 313

  • @theroyal1914
    @theroyal1914 4 года назад +31

    we need programmers like you. For advance learning.

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

  • @zacknawrocki
    @zacknawrocki 4 года назад +50

    I've been looking forward to this part of the series the most! I've been trying to create/run a voice assistant locally, and could not figure out how to apply speech recognition without relying on Google's Python module (which i was trying to avoid for privacy reasons, defeating the purpose of making one) and the HMM basics in my Intro to AI course weren't enough to implement it. This is fantastic.

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

  • @thiscrow
    @thiscrow 4 года назад +34

    at the beginning of the video: Oh I see !
    6:57 : Oh I ... oh ...

  • @totoma3297
    @totoma3297 4 года назад +259

    this is michael reeves from the universe where he decided to do something useful with his life

    • @isawcornflakes6201
      @isawcornflakes6201 3 года назад +9

      LMFAOOOO DIDNT HAVE TO DO HIM LIKE THAT 😭☝️

    • @aliveandwellinisrael2507
      @aliveandwellinisrael2507 3 года назад +4

      6:57 yep

    • @UmbraAtrox_
      @UmbraAtrox_ 3 года назад +3

      Wow, that's mean bro.

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

    • @DayoBrandon
      @DayoBrandon 2 года назад +1

      Imagine the greatest Michael colab. The two of them plus Michael Stevens (vsauce)

  • @akulgoel9259
    @akulgoel9259 2 года назад +3

    This is so good, I remember seeing this video a year ago and wishing he'd continued the series.

  • @smeagol92055
    @smeagol92055 3 года назад +8

    I'm building my own wearable AI assistant and this series is **exactly** what I was looking for! Great stuff!

    • @kiss-bws
      @kiss-bws 3 года назад

      Can you make tutorial

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

    • @PonchoManOG
      @PonchoManOG Год назад

      dude no way same

    • @morraza3307
      @morraza3307 Год назад

      @@PonchoManOG does this tutorial still work?

    • @PonchoManOG
      @PonchoManOG Год назад

      @@morraza3307 yes

  • @victor7ultimate
    @victor7ultimate 3 года назад

    After watching this video, I literally took off my hat as a mark of respect to this.
    Cant thank you enough.
    Thanks a million

  • @fteoOpty64
    @fteoOpty64 4 года назад +4

    Loved the high speed speech part!. Well done. Excellent production Mike!. TQ

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

  • @kevinrtres
    @kevinrtres 3 года назад

    Thanks for the information. Just goes to show that the idea that we evolved is just sheer madness.

  • @joeyrivenbark5056
    @joeyrivenbark5056 3 года назад +1

    Hey man, I really like how you have written definitions in addition to your speaking, helps a lot.

  • @chrisw1462
    @chrisw1462 4 года назад +1

    A Cue Stick - used for playing billiards. Acoustic (a-COO-stick) - dealing with sound or audio energy.

  • @OtRatsaphong
    @OtRatsaphong 2 года назад +3

    Wow, just discovered your channel. Great work. I'm just starting my journey into Deep learning and speech recognition. Will be following your progress.

  • @michealhall7776
    @michealhall7776 3 года назад +1

    I'm enjoying discovering all these smaller ai channels

  • @Alex.In_Wonderland
    @Alex.In_Wonderland 2 года назад

    omg, thank you! every other video I look up on this subject is just an ad for a text-speech readers! thanks for going into such detail about your thought process, buut after looking at the rig you have vs the one I've got ... well. . . if it took you a handful of days, it'd take me a week or two LOL great video! thanks a lot!

  • @rahulkumarm1446
    @rahulkumarm1446 4 года назад

    Brooo.I really dont know whether u coded this or just took reference from something....idrc u are AMMMMMAAAAZZZZIIINGGGGG.Hats off 2 u.U have a great talent man.......u could be the next ceo of any big fours too....

  • @davidkim2389
    @davidkim2389 4 года назад +4

    When next?? Best Series ever!! Please post next!!

  • @jaskeeratsingh9929
    @jaskeeratsingh9929 Год назад +1

    why are the three micheals i know all so smart : Micheal Phi, Micheal Reeves, Micheal from VSauce

  • @CreateYourWorld1
    @CreateYourWorld1 3 года назад +1

    Planning on creating my own Jarvis, this video has given me an insight.

    • @s1krrpilot
      @s1krrpilot 3 года назад

      Same, I'm going to call mind Alfred and integrate it into my helmet

    • @madhu_mohanreddy
      @madhu_mohanreddy Месяц назад

      @@s1krrpilot no way

  • @vicehaiti914
    @vicehaiti914 4 года назад +37

    Keep going bro.full support

  • @swarajshinde3950
    @swarajshinde3950 4 года назад +3

    Loved it Man , Great Video !

  • @benceelmokovacs1422
    @benceelmokovacs1422 3 года назад

    Wo hooo!
    This thing for FREE?! And help for us how to make it ours?!
    This data worth a HUGE amount of money, but you shared it! I'm so much surprised, in the good term!
    Thanks, thanks, thanks for it!!
    I really want to make an own Virtual Assistant, so big thanks for this video, for the data and for the help!
    Be blessed!

  • @sirlightshadowslayer473
    @sirlightshadowslayer473 Год назад

    This was insane, gonna try to do similar now, thank you for the informations

  • @jumbejolly3129
    @jumbejolly3129 2 года назад

    Man your a genius man. I wish I could do this. I have some many ideas but dont know where to start.

  • @kadaliakshay6770
    @kadaliakshay6770 9 месяцев назад +1

    bro amazing wrapping on 6:54

  • @chenjus
    @chenjus 4 года назад +1

    Really dope video. Can't wait to see your next one.

  • @kimkubik7547
    @kimkubik7547 2 года назад +1

    You Michael Rock!!!! Way to teach!!!

  • @CraftClone1
    @CraftClone1 3 года назад +2

    This is awesome! I wish there was more content from you
    One Ai hacker to another, keep on going!

  • @PritishMishra
    @PritishMishra 3 года назад +11

    Why aren't you uploading more videos? I have already seen this video just came here to say... plzz upload it's been 7 months now!

  • @alexkonopatski429
    @alexkonopatski429 3 года назад

    this series is so cool! keep it up bro

  • @罗杰瑞-p7g
    @罗杰瑞-p7g 2 года назад

    i think this is a very good video for me ,It can not only let me learn some knowledge, but also make me feel relaxed.thank you

  • @gauravshipurkar1570
    @gauravshipurkar1570 2 года назад

    Bro you are freaking awesome!!! i love your content, helps a lot.

  • @briankim49
    @briankim49 4 года назад +2

    Loved the video. You really showed me the tools I could use to build my own speech recognition model!

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

  • @r7rahuls
    @r7rahuls 4 года назад +19

    This guy is capable of making a real life JARVIS

    • @dabomb3864
      @dabomb3864 3 года назад +2

      There is already. Theres even a python module called JarvisAI and does exactly that.

    • @yentarachangethelife3897
      @yentarachangethelife3897 3 года назад

      @@dabomb3864 and how exactly do you know that??

  • @abramtaylor7575
    @abramtaylor7575 2 года назад

    Speech Morphing Inc has the BEST Voice Technology.

  • @mtaneesh1411
    @mtaneesh1411 4 года назад +6

    This was a really good video dude. Can you tell me how to make the soundwave display that you had while testing the model

  • @jtlunsford780
    @jtlunsford780 Год назад +1

    Totally awesome. Understood about .5% (that's point 5%). Just got my headset set up in Win 10 and am loving it. You're awesome and I bow to your knowledge and expertise....thanks for the cool vid. It was not wasted on my limited knowledge, but it peaked my interest...thanks again...JT

  • @JoshuaHerath
    @JoshuaHerath 3 года назад

    This video is so high quality wish you uploaded more

  • @seannam1218
    @seannam1218 4 года назад +10

    This is incredibly educational. Thx for sharing ur knowledge for free!

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

  • @ZetaReticulli
    @ZetaReticulli 3 года назад

    This is excellent! (subscribed!)
    I had to quickly brush up my skills for a project I'm working on (will be open sourcing it soon!) - and this video was short, sweet and to the point! Thanks

  • @kalyanstock8058
    @kalyanstock8058 Год назад

    Wow...who knew you can make AI teaching so much fun....You should make more videos

  • @muhammadrezahaghiri
    @muhammadrezahaghiri 3 года назад +7

    Can you make a TTS using deep learning? :) I really want to see that.

  • @itumelengmothapo2456
    @itumelengmothapo2456 3 года назад

    thank you man... this was fun to watch

  • @redtako.
    @redtako. Год назад

    THIS ONE WAS REALLY FUNNY gj love keep up the uploads :)

  • @scarlett_j
    @scarlett_j 2 года назад

    Sorry to inform you, but you pretty much rock, at the same time solved this so I don't have to.

  • @alexandergrayson9856
    @alexandergrayson9856 3 года назад +1

    Hey pal, your work's great I love it 🙌🙌

  • @fteoOpty64
    @fteoOpty64 4 года назад

    Love your War Machine!. I build my first Pentium Pro Dual Proc decades ago. It had a special powersupply and I had to rig my Generic case to fit the Tyan motherboard!. It ran Linux then.

  • @sreerajsathish3635
    @sreerajsathish3635 4 года назад

    Omg the video i was looking for thank for making one..... Full support❤

  • @MineInjected
    @MineInjected 3 года назад

    Your voice is perfect for lend to a robot, don't worry, Im not offending you, its an awesome voice.

  • @jairojosy5985
    @jairojosy5985 4 года назад

    Keep going on and finish the project fast. I'm looking ahead for the project to be finished

  • @UttamDas-ub5ow
    @UttamDas-ub5ow 4 года назад +1

    This man is really a hero 👍💓

  • @angelgabrielortiz-rodrigue2937
    @angelgabrielortiz-rodrigue2937 3 года назад

    Wao, great video man. Really awesome stuff

  • @PaulClifford
    @PaulClifford 3 года назад +3

    Parts 3 & 4 haven't materialized in a year. I'd love to see the rest.

  • @JasonTRogers
    @JasonTRogers 2 года назад +1

    Hey Michele, your videos on AI is fantastic! I haven’t seen any videos lately and I am course what you are doing these days?

  • @hemanth8195
    @hemanth8195 3 года назад

    This is really nice work dude

  • @nikhilhukkerikar6753
    @nikhilhukkerikar6753 4 года назад +11

    Wow that was a pretty neat video but as someone who’s aspiring to be a AI dev can you make a video explains the code in detail like a stepper! Loved it awesome work!

    • @troopekyt
      @troopekyt 3 года назад +2

      Yes pls

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

  • @TungjangpoMusic-yq4rf
    @TungjangpoMusic-yq4rf 2 месяца назад +1

    Can you edit or train speedchrecognition library so that it will able to convert our dialect/unknown language to text

  • @zikpin
    @zikpin 3 года назад

    This is what i was looking for, thanks

  • @w3w3w3
    @w3w3w3 Год назад

    your videos are great bro! 🤝

  • @rangefreewords
    @rangefreewords 3 года назад +4

    Can you have this A.I. system set up to make a journal or blog to provide links in the speech to find materials that you had previously recorded? If so, I would be interested to an adaptive blog that can provide updates to previously mentioned material without hand-stakingly rewriting everything.
    I really would like to know what materials you used in your audio recordings with this A.I. that made it all the more concise with your objective. I would also like to see how small you might be able to have this computer system since your previous video with Pi.

    • @Prometheus720
      @Prometheus720 2 года назад

      This seems like something to be done by integrating an assistant like this into another application like Obsidian.
      They need to be separate. There are lots of ways of passing external data into Obsidian and vice versa.

  • @mohammadrezakhalilishoja2701
    @mohammadrezakhalilishoja2701 4 года назад +1

    how did you up-sampled data to create 50 hrs from 1 hr?

  • @Menuseto
    @Menuseto 3 года назад +1

    Any plans on continuing this project?

  • @nathancook8452
    @nathancook8452 2 года назад

    Excellent video, you helped me out tremendously

  • @pranavthakur6744
    @pranavthakur6744 3 года назад

    Can you make a detailed video how did you manage to make it. I want to learn it.

  • @microgamawave
    @microgamawave 2 года назад

    You can make a video about gait recognition biometrics in python
    recognized you from your walk model

  • @vladiklass1890
    @vladiklass1890 4 года назад +1

    Cool video!!! This will help me a lot with my first NLP project. I wanted to get radio voice data and transcribe it. Any tips on that?
    Btw you should come up with a more memorable outro! :D

  • @waisyousofi9139
    @waisyousofi9139 2 года назад

    Thanks ,
    Can you make a tutorial on code implementation of speech recognition.
    that would be great.

  • @Tera2Space
    @Tera2Space Год назад

    Hello, when will there be a guide to creating your own speech synthesis?
    (TTS)

  • @adeniyiadeboye3300
    @adeniyiadeboye3300 4 года назад +1

    Thanks for this..I am going to thoroughly go through the speech recognition your code on Github

  • @SivaShankarsss
    @SivaShankarsss 4 года назад

    Eagerly waiting

  • @diegomartin6332
    @diegomartin6332 4 года назад

    Please post more videos about this!

  • @guidoscalise
    @guidoscalise 2 года назад

    What books/material would you recommend to someone wanting to learn to design models like the one you’re detailing around 7:36?

  • @numbah16
    @numbah16 2 года назад +1

    where tf are you Michael! Need some new videos already! Let's see what you have been up to with your monster rig

  • @tilahunanagaw6175
    @tilahunanagaw6175 2 года назад

    what interesting presentation it is!!!

  • @ZpErMy
    @ZpErMy Год назад

    Hello, I was fascinated with your Speech Recognition System. I wonder, could your system recognize sung musical notes?
    that is, instead of words, musical notation.

  • @hg4lyfe
    @hg4lyfe 4 года назад

    Bro this is perfect wow thanks

  • @rachitahuja1257
    @rachitahuja1257 4 года назад +3

    Hi it's a pretty neat tutorial!! thanks a lot for the insight. I just have two questions :-
    1. Why did u transpose your data before entering it into LSTM layers?
    2. Why have you used MelSpectrogram instead of MFCC coefficients? ( I mean is there some specific reason of doing so?)

    • @oguzynx
      @oguzynx 4 года назад +2

      because sonopy's function mfcc_spec gives the data reverse. so x axis is the frequency and the y axis is time. But we need the reverse. that's why. Pytorch's MFCC or MelSpecgram automatically gives the data in a way we want but he chose that sonopy because it is really fast. Check this out github.com/MycroftAI/sonopy

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

  • @shannonsteward4034
    @shannonsteward4034 Год назад

    hi great work I just found your channel great job

  • @yashrajhawle4
    @yashrajhawle4 4 года назад

    Thank you for sharing your knowledge !

  • @soonapaana24
    @soonapaana24 4 года назад

    You are totally awesome bro...👏👏👏

  • @tomhamser7216
    @tomhamser7216 3 года назад

    Could you show the code in detail or how I can use it with another model? Could I use a deepspeech model for testingit, too?

  • @MrDonald911
    @MrDonald911 3 года назад +4

    Hey ! I just discovered your channel, nice content ! Your model seems overfitting, I think you should evaluate it on a test data (and not the validation). I would be curious to know how it would perform if you do hyperparameter tuning.

    • @justinross2664
      @justinross2664 2 года назад

      ruclips.net/video/iyl53zyz5zk/видео.html

  • @DavidAlvesWeb
    @DavidAlvesWeb 3 года назад

    what a great video man, really inspiring! keep up the good work!
    PS: you deserve a better t-shirt bro 😅

  • @AlanJames1987
    @AlanJames1987 Год назад

    Good video but are you using Linix at 9:31 and Windows at 9:34? I haven't used Windows in a few years so I didn't know you could do this.

  • @tripathi26
    @tripathi26 4 года назад

    This is awesome! thanks man.

  • @rodios-md5du
    @rodios-md5du Год назад

    You are gold💛

  • @emrehankaraoglu4122
    @emrehankaraoglu4122 2 года назад +4

    This is such a amazing video. Congrats! I am wondering about model deployment part. Are you going to share the coding part of ıweb interface? The sound wave and the text that occurs below the sound wave are awesome.

    • @adibakhan2865
      @adibakhan2865 Год назад

      Hey did you found the code for deployment

  • @yashkumar2716
    @yashkumar2716 4 года назад +2

    I just have one question... Why we build one speech recognition model if we already have some speech recognition library

    • @orlando_kawaii
      @orlando_kawaii 9 месяцев назад

      They are pre-trained on billions of data samples. Here, we're trying to build our own with less data and training time.

  • @itsjustsam04
    @itsjustsam04 3 года назад

    Wow I love it! I do have two questions tho. 1 how did you run it in ur Chrome browser. 2 how did u get the cool visual effects for while u were speaking?

  • @DrewNewmanEngineer
    @DrewNewmanEngineer 3 года назад +1

    Fantastic tutorial! The detail of explanation is remarkable. Nice code editor, which editor is that?

  • @maryamnazari1281
    @maryamnazari1281 Год назад

    great job! i want to train a speaker identification project..any ideas where to start?

  • @notgegulclearly
    @notgegulclearly 3 года назад

    so it is possyble to make the virtual assistant write on another command prompt instead of talking, to use it with a custom text to speech AI? would love to see that

  • @stereopsych6381
    @stereopsych6381 2 года назад

    Please upload more!

  • @aviavinav7208
    @aviavinav7208 4 года назад +1

    Great Video!

  • @justinfuruness7954
    @justinfuruness7954 3 года назад

    Do you have any recommendations for how to learn AI? How long did this take to train?

  • @Pixel_Recap
    @Pixel_Recap 2 года назад

    Can you put a video about offline speech recognition and text to speech

  • @itzfin433
    @itzfin433 4 года назад

    I am late I know but you can make it search the output on the google.com it will be most accurate

  • @vincebelansky425
    @vincebelansky425 Год назад

    Thank you for this video and the insight of how to design a voice recognition system independently from the ground up by an newly to AI. Most videos tell you to connect the internet and to a big server by google or someone else. The only question that I have is why use python and not C or C++, especially since you are running a raspberry pi with limited memory and slower CPU and the natural time restraints of real-time speech recognition?

  • @peacekeepermoe
    @peacekeepermoe 3 года назад

    Great content dude. I haven't seen anything new for the last 7 months though. Hope you're well :)

  • @letsplat__
    @letsplat__ 3 месяца назад

    Hi Michael!
    I am a student and highly interested in AI for building things.
    It would be great help if you could make a video or share some resources on how to get started.