I've been looking forward to this part of the series the most! I've been trying to create/run a voice assistant locally, and could not figure out how to apply speech recognition without relying on Google's Python module (which i was trying to avoid for privacy reasons, defeating the purpose of making one) and the HMM basics in my Intro to AI course weren't enough to implement it. This is fantastic.
Wow, just discovered your channel. Great work. I'm just starting my journey into Deep learning and speech recognition. Will be following your progress.
omg, thank you! every other video I look up on this subject is just an ad for a text-speech readers! thanks for going into such detail about your thought process, buut after looking at the rig you have vs the one I've got ... well. . . if it took you a handful of days, it'd take me a week or two LOL great video! thanks a lot!
Brooo.I really dont know whether u coded this or just took reference from something....idrc u are AMMMMMAAAAZZZZIIINGGGGG.Hats off 2 u.U have a great talent man.......u could be the next ceo of any big fours too....
Wo hooo! This thing for FREE?! And help for us how to make it ours?! This data worth a HUGE amount of money, but you shared it! I'm so much surprised, in the good term! Thanks, thanks, thanks for it!! I really want to make an own Virtual Assistant, so big thanks for this video, for the data and for the help! Be blessed!
Totally awesome. Understood about .5% (that's point 5%). Just got my headset set up in Win 10 and am loving it. You're awesome and I bow to your knowledge and expertise....thanks for the cool vid. It was not wasted on my limited knowledge, but it peaked my interest...thanks again...JT
This is excellent! (subscribed!) I had to quickly brush up my skills for a project I'm working on (will be open sourcing it soon!) - and this video was short, sweet and to the point! Thanks
Love your War Machine!. I build my first Pentium Pro Dual Proc decades ago. It had a special powersupply and I had to rig my Generic case to fit the Tyan motherboard!. It ran Linux then.
Wow that was a pretty neat video but as someone who’s aspiring to be a AI dev can you make a video explains the code in detail like a stepper! Loved it awesome work!
Can you have this A.I. system set up to make a journal or blog to provide links in the speech to find materials that you had previously recorded? If so, I would be interested to an adaptive blog that can provide updates to previously mentioned material without hand-stakingly rewriting everything. I really would like to know what materials you used in your audio recordings with this A.I. that made it all the more concise with your objective. I would also like to see how small you might be able to have this computer system since your previous video with Pi.
This seems like something to be done by integrating an assistant like this into another application like Obsidian. They need to be separate. There are lots of ways of passing external data into Obsidian and vice versa.
Cool video!!! This will help me a lot with my first NLP project. I wanted to get radio voice data and transcribe it. Any tips on that? Btw you should come up with a more memorable outro! :D
Hello, I was fascinated with your Speech Recognition System. I wonder, could your system recognize sung musical notes? that is, instead of words, musical notation.
Hi it's a pretty neat tutorial!! thanks a lot for the insight. I just have two questions :- 1. Why did u transpose your data before entering it into LSTM layers? 2. Why have you used MelSpectrogram instead of MFCC coefficients? ( I mean is there some specific reason of doing so?)
because sonopy's function mfcc_spec gives the data reverse. so x axis is the frequency and the y axis is time. But we need the reverse. that's why. Pytorch's MFCC or MelSpecgram automatically gives the data in a way we want but he chose that sonopy because it is really fast. Check this out github.com/MycroftAI/sonopy
Hey ! I just discovered your channel, nice content ! Your model seems overfitting, I think you should evaluate it on a test data (and not the validation). I would be curious to know how it would perform if you do hyperparameter tuning.
This is such a amazing video. Congrats! I am wondering about model deployment part. Are you going to share the coding part of ıweb interface? The sound wave and the text that occurs below the sound wave are awesome.
Wow I love it! I do have two questions tho. 1 how did you run it in ur Chrome browser. 2 how did u get the cool visual effects for while u were speaking?
so it is possyble to make the virtual assistant write on another command prompt instead of talking, to use it with a custom text to speech AI? would love to see that
Thank you for this video and the insight of how to design a voice recognition system independently from the ground up by an newly to AI. Most videos tell you to connect the internet and to a big server by google or someone else. The only question that I have is why use python and not C or C++, especially since you are running a raspberry pi with limited memory and slower CPU and the natural time restraints of real-time speech recognition?
Hi Michael! I am a student and highly interested in AI for building things. It would be great help if you could make a video or share some resources on how to get started.
we need programmers like you. For advance learning.
ruclips.net/video/iyl53zyz5zk/видео.html
I've been looking forward to this part of the series the most! I've been trying to create/run a voice assistant locally, and could not figure out how to apply speech recognition without relying on Google's Python module (which i was trying to avoid for privacy reasons, defeating the purpose of making one) and the HMM basics in my Intro to AI course weren't enough to implement it. This is fantastic.
ruclips.net/video/iyl53zyz5zk/видео.html
at the beginning of the video: Oh I see !
6:57 : Oh I ... oh ...
this is michael reeves from the universe where he decided to do something useful with his life
LMFAOOOO DIDNT HAVE TO DO HIM LIKE THAT 😭☝️
6:57 yep
Wow, that's mean bro.
ruclips.net/video/iyl53zyz5zk/видео.html
Imagine the greatest Michael colab. The two of them plus Michael Stevens (vsauce)
This is so good, I remember seeing this video a year ago and wishing he'd continued the series.
I'm building my own wearable AI assistant and this series is **exactly** what I was looking for! Great stuff!
Can you make tutorial
ruclips.net/video/iyl53zyz5zk/видео.html
dude no way same
@@PonchoManOG does this tutorial still work?
@@morraza3307 yes
After watching this video, I literally took off my hat as a mark of respect to this.
Cant thank you enough.
Thanks a million
Loved the high speed speech part!. Well done. Excellent production Mike!. TQ
ruclips.net/video/iyl53zyz5zk/видео.html
Thanks for the information. Just goes to show that the idea that we evolved is just sheer madness.
Hey man, I really like how you have written definitions in addition to your speaking, helps a lot.
A Cue Stick - used for playing billiards. Acoustic (a-COO-stick) - dealing with sound or audio energy.
Wow, just discovered your channel. Great work. I'm just starting my journey into Deep learning and speech recognition. Will be following your progress.
I'm enjoying discovering all these smaller ai channels
omg, thank you! every other video I look up on this subject is just an ad for a text-speech readers! thanks for going into such detail about your thought process, buut after looking at the rig you have vs the one I've got ... well. . . if it took you a handful of days, it'd take me a week or two LOL great video! thanks a lot!
Brooo.I really dont know whether u coded this or just took reference from something....idrc u are AMMMMMAAAAZZZZIIINGGGGG.Hats off 2 u.U have a great talent man.......u could be the next ceo of any big fours too....
When next?? Best Series ever!! Please post next!!
why are the three micheals i know all so smart : Micheal Phi, Micheal Reeves, Micheal from VSauce
Planning on creating my own Jarvis, this video has given me an insight.
Same, I'm going to call mind Alfred and integrate it into my helmet
@@s1krrpilot no way
Keep going bro.full support
Loved it Man , Great Video !
Wo hooo!
This thing for FREE?! And help for us how to make it ours?!
This data worth a HUGE amount of money, but you shared it! I'm so much surprised, in the good term!
Thanks, thanks, thanks for it!!
I really want to make an own Virtual Assistant, so big thanks for this video, for the data and for the help!
Be blessed!
7:05 7:18 7:26 11:09 12:09-12:39 13:15 14:14 14:29 14:59
This was insane, gonna try to do similar now, thank you for the informations
Man your a genius man. I wish I could do this. I have some many ideas but dont know where to start.
bro amazing wrapping on 6:54
Really dope video. Can't wait to see your next one.
You Michael Rock!!!! Way to teach!!!
This is awesome! I wish there was more content from you
One Ai hacker to another, keep on going!
Why aren't you uploading more videos? I have already seen this video just came here to say... plzz upload it's been 7 months now!
this series is so cool! keep it up bro
i think this is a very good video for me ,It can not only let me learn some knowledge, but also make me feel relaxed.thank you
Bro you are freaking awesome!!! i love your content, helps a lot.
Loved the video. You really showed me the tools I could use to build my own speech recognition model!
ruclips.net/video/iyl53zyz5zk/видео.html
This guy is capable of making a real life JARVIS
There is already. Theres even a python module called JarvisAI and does exactly that.
@@dabomb3864 and how exactly do you know that??
Speech Morphing Inc has the BEST Voice Technology.
This was a really good video dude. Can you tell me how to make the soundwave display that you had while testing the model
Totally awesome. Understood about .5% (that's point 5%). Just got my headset set up in Win 10 and am loving it. You're awesome and I bow to your knowledge and expertise....thanks for the cool vid. It was not wasted on my limited knowledge, but it peaked my interest...thanks again...JT
This video is so high quality wish you uploaded more
This is incredibly educational. Thx for sharing ur knowledge for free!
ruclips.net/video/iyl53zyz5zk/видео.html
This is excellent! (subscribed!)
I had to quickly brush up my skills for a project I'm working on (will be open sourcing it soon!) - and this video was short, sweet and to the point! Thanks
Wow...who knew you can make AI teaching so much fun....You should make more videos
Can you make a TTS using deep learning? :) I really want to see that.
thank you man... this was fun to watch
THIS ONE WAS REALLY FUNNY gj love keep up the uploads :)
Sorry to inform you, but you pretty much rock, at the same time solved this so I don't have to.
Hey pal, your work's great I love it 🙌🙌
Love your War Machine!. I build my first Pentium Pro Dual Proc decades ago. It had a special powersupply and I had to rig my Generic case to fit the Tyan motherboard!. It ran Linux then.
Omg the video i was looking for thank for making one..... Full support❤
Your voice is perfect for lend to a robot, don't worry, Im not offending you, its an awesome voice.
Keep going on and finish the project fast. I'm looking ahead for the project to be finished
This man is really a hero 👍💓
Wao, great video man. Really awesome stuff
Parts 3 & 4 haven't materialized in a year. I'd love to see the rest.
yes
Hey Michele, your videos on AI is fantastic! I haven’t seen any videos lately and I am course what you are doing these days?
This is really nice work dude
Wow that was a pretty neat video but as someone who’s aspiring to be a AI dev can you make a video explains the code in detail like a stepper! Loved it awesome work!
Yes pls
ruclips.net/video/iyl53zyz5zk/видео.html
Can you edit or train speedchrecognition library so that it will able to convert our dialect/unknown language to text
This is what i was looking for, thanks
your videos are great bro! 🤝
Can you have this A.I. system set up to make a journal or blog to provide links in the speech to find materials that you had previously recorded? If so, I would be interested to an adaptive blog that can provide updates to previously mentioned material without hand-stakingly rewriting everything.
I really would like to know what materials you used in your audio recordings with this A.I. that made it all the more concise with your objective. I would also like to see how small you might be able to have this computer system since your previous video with Pi.
This seems like something to be done by integrating an assistant like this into another application like Obsidian.
They need to be separate. There are lots of ways of passing external data into Obsidian and vice versa.
how did you up-sampled data to create 50 hrs from 1 hr?
Any plans on continuing this project?
Excellent video, you helped me out tremendously
Can you make a detailed video how did you manage to make it. I want to learn it.
You can make a video about gait recognition biometrics in python
recognized you from your walk model
Cool video!!! This will help me a lot with my first NLP project. I wanted to get radio voice data and transcribe it. Any tips on that?
Btw you should come up with a more memorable outro! :D
Thanks ,
Can you make a tutorial on code implementation of speech recognition.
that would be great.
Hello, when will there be a guide to creating your own speech synthesis?
(TTS)
Thanks for this..I am going to thoroughly go through the speech recognition your code on Github
Eagerly waiting
Please post more videos about this!
What books/material would you recommend to someone wanting to learn to design models like the one you’re detailing around 7:36?
where tf are you Michael! Need some new videos already! Let's see what you have been up to with your monster rig
what interesting presentation it is!!!
Hello, I was fascinated with your Speech Recognition System. I wonder, could your system recognize sung musical notes?
that is, instead of words, musical notation.
Bro this is perfect wow thanks
Hi it's a pretty neat tutorial!! thanks a lot for the insight. I just have two questions :-
1. Why did u transpose your data before entering it into LSTM layers?
2. Why have you used MelSpectrogram instead of MFCC coefficients? ( I mean is there some specific reason of doing so?)
because sonopy's function mfcc_spec gives the data reverse. so x axis is the frequency and the y axis is time. But we need the reverse. that's why. Pytorch's MFCC or MelSpecgram automatically gives the data in a way we want but he chose that sonopy because it is really fast. Check this out github.com/MycroftAI/sonopy
ruclips.net/video/iyl53zyz5zk/видео.html
hi great work I just found your channel great job
Thank you for sharing your knowledge !
You are totally awesome bro...👏👏👏
Could you show the code in detail or how I can use it with another model? Could I use a deepspeech model for testingit, too?
Hey ! I just discovered your channel, nice content ! Your model seems overfitting, I think you should evaluate it on a test data (and not the validation). I would be curious to know how it would perform if you do hyperparameter tuning.
ruclips.net/video/iyl53zyz5zk/видео.html
what a great video man, really inspiring! keep up the good work!
PS: you deserve a better t-shirt bro 😅
Good video but are you using Linix at 9:31 and Windows at 9:34? I haven't used Windows in a few years so I didn't know you could do this.
This is awesome! thanks man.
You are gold💛
This is such a amazing video. Congrats! I am wondering about model deployment part. Are you going to share the coding part of ıweb interface? The sound wave and the text that occurs below the sound wave are awesome.
Hey did you found the code for deployment
I just have one question... Why we build one speech recognition model if we already have some speech recognition library
They are pre-trained on billions of data samples. Here, we're trying to build our own with less data and training time.
Wow I love it! I do have two questions tho. 1 how did you run it in ur Chrome browser. 2 how did u get the cool visual effects for while u were speaking?
Fantastic tutorial! The detail of explanation is remarkable. Nice code editor, which editor is that?
Looks like vs code
ruclips.net/video/iyl53zyz5zk/видео.html
great job! i want to train a speaker identification project..any ideas where to start?
so it is possyble to make the virtual assistant write on another command prompt instead of talking, to use it with a custom text to speech AI? would love to see that
Please upload more!
Great Video!
Do you have any recommendations for how to learn AI? How long did this take to train?
Can you put a video about offline speech recognition and text to speech
I am late I know but you can make it search the output on the google.com it will be most accurate
Thank you for this video and the insight of how to design a voice recognition system independently from the ground up by an newly to AI. Most videos tell you to connect the internet and to a big server by google or someone else. The only question that I have is why use python and not C or C++, especially since you are running a raspberry pi with limited memory and slower CPU and the natural time restraints of real-time speech recognition?
Great content dude. I haven't seen anything new for the last 7 months though. Hope you're well :)
Hi Michael!
I am a student and highly interested in AI for building things.
It would be great help if you could make a video or share some resources on how to get started.