How to Code a Machine Learning Lip Reading App with Python Tensorflow and Streamlit

Nicholas Renotte

Просмотров 37 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 24 фев 2023
Get notified of the free Python course on the home page at www.coursesfromnick.com
Sign up for the Full Stack course here and use RUclips50 to get 50% off:
www.coursesfromnick.com/bundl...
Hopefully you enjoyed this video.
💼 Find AWESOME ML Jobs: www.jobsfromnick.com
Get the Code: github.com/nicknochnack/LipNet
Links:
CTC Blog Post: distill.pub/2017/ctc
Oh, and don't forget to connect with me!
LinkedIn: bit.ly/324Epgo
Facebook: bit.ly/3mB1sZD
GitHub: bit.ly/3mDJllD
Patreon: bit.ly/2OCn3UW
Join the Discussion on Discord: bit.ly/3dQiZsV
Happy coding!
Nick
Наука

Комментарии • 146

@Jaybaas Год назад ⁺⁷
Thank you so much Nic. This was mind blowing!! 🔥🔥
Phenomenal job and we appreciate the consistency of high quality tutorials. 🙏🏾
@NicholasRenotte Год назад
Thanks a mil Jabulani!! 🙏
@gavision97 6 месяцев назад ⁺¹
Videos like these, where you transform a model into an app, are 🔥! First, we learn how to build the model, and secondly, we get to take that model and use it to create an application. Please keep those ideas coming!
@harshag8980 Месяц назад
Do you have running model
@noobicorn_gamer Год назад ⁺³
I know 0 about coding but man this man is INFECTIOUS. 🎉🎉🎉🎉 I keep coming back for more lol
@ziiiiiiiiiii7325 Год назад ⁺³
don't know if u still remember me, but two years ago I got an offer majoring in DS, was very unsure if I should pursue, I asked you, you replied, told me definitely, and gave me some advice. And I have been watching your channel throughout my school time.
Graduated in Dec. Today I just received my degree on paper. Thanks.
@NicholasRenotte Год назад ⁺²
CONGRATULATIONS!!!!!! All that hard work paid off 🙌🏼
@flychuban9896 Год назад
Maybe the best ML video in RUclips
@BOGABOOfull Год назад ⁺¹
My guy Nich keeps elevating the level, amazing job. I see the new edits also 😅👌👌
@NicholasRenotte Год назад
😅 phew, editing took like three days but I dropped so many new things in it!! Thanks a mil @Miguel!!
@shakeershakeer885 Год назад ⁺³
Well done Nic, we're waiting for another project
@NicholasRenotte Год назад
🙏🙏🙏
@birdropping Год назад ⁺¹
Thanks for the amazing video Nic! I've primarily been using Tensorflow in my work but am curious to learn PyTorch too. Would it be possible for a video implementing a model in both libraries side by side so that we can visualise at a glance the differences between the implementations?
@tobiasm161 7 месяцев назад
thank you so much! high quality next level teaching and awesome prep for my TensorFlow Certificate exam which I will take. As well for my resume! thx Nic 🤩
@yashchoudhary2196 6 месяцев назад
hi tobiasm
i want to enqire abt this project
@priyam66 Год назад ⁺¹
Thanks Nic for creating such a wonderful video. Look forward to the video on the LLama model.
@NicholasRenotte Год назад
🙏🙏 cheers Priyam!
@Kevgas Год назад ⁺⁶
Awesome Nic! Can't wait to complete this along with you!
Also not sure If you just saw Meta released their LLaMa paper, and releasing that model soon. I know its not feasible to recreate coding that but would it be possible to pull the pretrained model and run it locally (similar to stable diffusion or bloom)? Also apparently the model is small enough to run on a single GPU, well at least that's what the paper claims. I still haven't finished reading through it but it seems like they might open source it...
@NicholasRenotte Год назад ⁺³
Got it on my list to look at this week. I’m getting up to speed on LLMs as we speak so should have something in that space soon!!
@Kevgas Год назад ⁺²
@@NicholasRenotte Thanks! Yea a LLM project would be pretty fun to follow along with you!!
@fishnchips6627 Год назад ⁺¹
You're insanely brilliant!
@NicholasRenotte Год назад
🙏🙏🙏
@katu406 Год назад ⁺¹
Waiting for this!!!🔥🔥🔥
@NicholasRenotte Год назад
Ayyy, thanks so much for checking it out!
@katu406 Год назад
@@NicholasRenottecould you make a tutorial for live feed from camera. I'm waiting for it.
@Villagelife.stories Год назад
Hi again Nicholas. Thank you so much for the split payment method. Is it on purpose that the price of the 3 months split is going to at up at 800$ instead of the original one-time 697$ Price?
@leartmorina3011 Год назад
Hi Nicholas! I really love your videos, you have something in you that makes me feel comfortable and friendly. Could you please do a video of an AI detecting floor and changing it to different products or can you just give me some advice on this. BTW I got a client for this so if you could help me I would really appreciate it.
@JonCookeBridge Год назад ⁺⁴
Fantastic as ever.
Are you using the streamlit defaults for styling?
@NicholasRenotte Год назад
Cheers Jon! Yep, all defaults are the same bar changing the layout to wide.
@AnilSingh-dy2yd 8 месяцев назад ⁺¹
Hi Nicholas, Is there a way this can be changed so it can work on live videos? if you can do a video that will be great.
@sumukharaghavanm6466 4 месяца назад ⁺¹
Thank you so much sir.
Can you tell me how did you calculate the alignments for the model?
More so, if we want to have our own video, what should we do ?
@sebasgutierrezjuansgutierr3666 Год назад
Can you do a video about the laptop you use? What are your laptop's specifications?
@yogeshkendre5054 Год назад ⁺¹
This is great , can this work on random video or webcam ?
@kevynkrancenblum5350 Год назад ⁺¹
Thanks Nic wonderful tutorial again ! 🎉🎉🎉
Is there a way doing this with our own videos input by extracting the face with mediapipe for example?
@NicholasRenotte Год назад ⁺¹
Yep!! Could use mediapipe to mask out the mouth and make detections from that. Once I get some time I want to get that done!
@mr.sweaty7862 Год назад
I have a challenge for you Nic my man (Btw, thanks a ton for your great videos, I'm learning a lot from you) :)
Make a CNN classifier that specializes in 2 classes of grayscale images. But not just any grayscale images, they have to be a perfect circle of white with black around (good objects), and the other class is the same, but it has small imperfections in the white circle, like a small line of gray, or maybe the spraycan tool from paint.
@user-kk5oz3qn6y Год назад
Can you please upload a tutorial on that like how to fine tune existing model with some random dataset so that it performs well irrespective of the video we use for prediction. Maybe real time prediction from the web camera. That would be helpful, Thanks.
@HarshilDangar-tc3ns Год назад
hey man i wanted to know that can i use this model to make prediction on other videos which are around 60 seconds ?
@user-ru2pg4hq2i 9 месяцев назад ⁺¹
will it output if the video has no sound but only movement of lips?
@surajmenon7631 5 месяцев назад ⁺¹
Hey Nick, can you consider to make a real-time lip reading model ?
@wisdom_Lane Год назад ⁺¹
Thank you very much very much 🥰
@NicholasRenotte Год назад
Anytime!! Glad you liked it!
@roshanisilva7932 Год назад
I want to make a face image dataset for my Final year research. And I downloaded faces from google images by using an image downloading chrome extension. I want to know weather is it ok to use those dataset or else do I need ethical clearance on this. Can you please share your thoughts?
@dipankarnandi7708 Год назад
Hey nick, I was wondering, how can the distance of an object be measured simply with the use of object detection ?
Do we have to use the rescaling of bounding box around that object to measure the distance? I don't want to use any sensors for distance measurement. Simply camera,
Any other suggestions would be great too
@sourcex-yl5kk Месяц назад ⁺¹
it is difficult brother, idk exactly if any high level libraries are available to approximate the distance. But sensors are the way to go for accurate measurements. LIDAR or Ultrasonic sensors works perfect. But have to mention, Elon actually worked on this. He negated the entire sensor based approach and built image based measurement systems from scratch. Do look into it for any ideas...
@learnaiwithjoelbunyan4764 Год назад
You are a legend!!!!!
@muhammadanas7698 Год назад ⁺²
Nan Nic! Awesome!
I just want to know something, like I'm 14 and finding project ideas is like super hard, and seeing such amazing projects are amazing.
And 2nd, how do you work though a project...meaning from idea to deployment? Would highly appreciate your response.
@NicholasRenotte Год назад ⁺³
Damn you’re 14 and already getting into this. Amazing work!! Ideas sort of come from everywhere; colleagues, RUclips comments, research papers and clients.
For the second q: one step at a time! The CRISP-DM framework helps clarify the steps required :)
@muhammadanas7698 Год назад
@@NicholasRenotte PS. Looks good. Sent you a DM on Linkedin. Do have a look!
@atulanand4824 Год назад
Hi Nick can we have some projects related to NLP too?
@jayasuryak9551 2 месяца назад
hi sir , actually i would like to add an option based pop windows ,like you have window so that should show the option of " i want to eat" or "i want to go out for walk" like this ,so is that possible to add it in this , if possible please help me out
@collegemenduksai8995 Год назад
Hey Nick,
Could you put custom videos to check the model's accuracy?
Also, these are the errors when I put a custom video :
1. For one of the videos I got this -> ValueError: Attempt to convert a value (None) with an unsupported type () to a Tensor.
2. For the other one, this -> in assert_input_compatibility raise ValueError( ValueError: Input 0 of layer "sequential" is incompatible with the layer: expected shape=(None, 75, 46, 140, 1), found shape=(None, 61, 46, 140, 1)
@derilraju2106 Год назад
it means that the video had only 61 frames, while the model was expecting 75
@rishavchandra3026 Год назад ⁺¹
Can someone pls give an idea on how to take inputs from our webcam and feed that to the model to get the lip reading output?
@danielsilva3383 Год назад ⁺¹
Hi, great video!
Can you build a SaaS Startup with Streamlit? Or do you need JS? Thanks!
@NicholasRenotte Год назад ⁺²
Would need to get into Js eventually for things like auth, teams and payments.
@danielsilva3383 Год назад
@@NicholasRenotte Thanks! Keep up the great content 💪
@rishichowdhury4296 Год назад
Hey nick how do you think we can use the webcam for taking video input?
and also can you make a tutorial on multiple activities recognition. Thank you
@derilraju2106 Год назад
you could use opencv for webcam
@parth.mandaliya Год назад
Can you please make a video for multiple objects detection (like SSD or yolo) but from scratch not transfer learning (you've made a similar video but with transfer learning using tensorflow model zoo). There is not much out there on how to build it except for a book (dive into Deep learning) which is very difficult to follow
@DGHere12 Год назад ⁺¹
Hello Nic! I am 16 years old, And I had a question that I was not able to find anywhere. The questions is that I have trained MNIST model as well as fashion mnist model. But when I am working with yolov5, i was curious on how to create models like yolov5 from scratch in PyTorch. They have bounding box, classification as well.
@NicholasRenotte Год назад
I did a basic implementation with Tensorflow, I think it was around Face Detection. Should be up on the channel!!
@laetitia6720 4 месяца назад ⁺⁴
I have try to run the code, i want try with a video. but this isn't working. Isn"t compatible .... a problem with format of image or something like this... And I don't understand how to have alignements (??)
@abdallaabduwahab3036 2 месяца назад
Bro where can I contact you
@syit_417_vinamradholam3 11 месяцев назад ⁺¹
If there is no sound can it'll predict the sentence?
@SaurabhSingh-oi5ev Год назад
Can we try this with any other video without having video alignment file???
@Grounded441 Год назад
Dose tenserflow certificate helps getting a job to newbies?
@nicolassuarez2933 Месяц назад
Outstanding! But what happened with custom videos? Can we at least record a video saying equal sentences from the trainning videos and get a good result? I tried but the code ask me for the align file... Thanks!
@vasanthkumar_24 Год назад
Next project idea ! Noise cancellation??
@narsimhareddy4742 Год назад
while accessing dataset Iam getting this error : Cannot retrieve the public link of the file. You may need to change
the permission to 'Anyone with the link', or have had many accesses
@pema8230 3 месяца назад
can some one give me any idea on which tool to be used to align the video to get the same data like video.align like in the video
@SaravanaKumar-ci1zk Год назад ⁺⁶
I am sure that at least a big tech will watch this ....
@NicholasRenotte Год назад ⁺¹
👀
@vishnusandeep1774 Год назад
can u make a tutorial on how to give user input directly from web cam????
@Maddy_akil 9 месяцев назад
great and it works fine 😇
@ENGCS_chetanr Год назад ⁺¹
Nick love u man how do u even do this please tell us the way u understand the paper and implement it ❤️❤️
Love from india ❤️❤️
@NicholasRenotte Год назад
Love that you enjoyed it 🙏
@yogeshkendre5054 Год назад
How to create checkpoint , or where we will get this checkpoint file
@munagalasairaghava3029 Год назад ⁺¹
Nice video bro, Can you develop a ML model that take input as a voice and determine the person is under pressure or not under pressure and give suggestions also😊
@narsimhareddy4742 Год назад
Iam unable to access the dataset , can you help me with it.
@karthikeyans7332 Месяц назад
hey nick can you tell me its algorithm and architecture
@lokeshart3340 9 месяцев назад ⁺¹
Can you mix all lip reading and object detection at one for deaf and blind people
@arshas786 4 месяца назад ⁺¹
Data set?
@user-wr4yl7tx3w Год назад
Do software engineers report to product managers?
@doomatyourservice2218 3 месяца назад
How did you find the mouth coordintes? Can anyone explain?
@uveshsalmani6128 9 месяцев назад
I need to add a option of translating it to 2 different languages, how can i do that, i need help
@amanwithasleepyhead4006 Год назад
does it work on other languages than english?
@saumyavartak9167 4 месяца назад
Hey can you make a video on realtime or change data set with our own video it will be very grateful
@happy-mo1qc 8 месяцев назад ⁺¹
can we make it realtime with webcam
can you please make a project for that and video
@solomon_leo_27 Год назад
Nic! can I use this model to predict a video without alignments?
@joaopereira2751 11 месяцев назад
i tried that and it didnt work.
@ngtankha Год назад
Please work on the topic of MTCNN, FaceNet
@ghostplays5291 4 месяца назад
For some videos i got decompression bomb error some video getting crt predictions anyone help
@SaurabhSingh-oi5ev Год назад ⁺²
Hey how the videos is getting change by the chossing another video, isn't that hard coded for test_video4.mp4 only???? Please tell me
@NicholasRenotte Год назад
We pick up the video from the s1 folder then convert it to test_video.mp4. So we're actually converting on the fly and only reading the converted video!!
@SaurabhSingh-oi5ev Год назад
@@NicholasRenotte thanks Nic, I don't know why it's not working for me, may be the reason is of ffmpeg is not recognized as internal or external command....every thing is working fine just the video is not updating.
@NicholasRenotte Год назад
@@SaurabhSingh-oi5ev might need to install ffmpeg. Check this out: phoenixnap.com/kb/ffmpeg-windows
@giuseppedimaria6253 Год назад ⁺¹
would it be better to retrain the model on the other datasets?
@NicholasRenotte Год назад ⁺¹
Yep! I didn't bother because I eventually want to create my own dataset and train it on other words and people!
@UnKnown-lp9gl Год назад
What's your pc mate?
@anjanikumarbose5489 Год назад
How can be grep real world object by his image with robotics arm in tensorflow
@manav-khadka Год назад
hey nicholas please made video on kaggle
@mohammedthanseer146 4 месяца назад
Can you make it for realtime lip reading tutorials
@soumikrakshit8452 Год назад ⁺¹
I have a doubt, like all the videos that you are using to test the model are already used to train the model so thats why u get good predictions. What if we use a video to test the model that the model has never seen before? Will we still get good accuracy?
@NicholasRenotte Год назад
You are spot on. It makes sense that they would perform well on data similar to mine as that's the sample that the model has seen during training. For this to perform well in another use case you would need to fine tune on that data.
@soumikrakshit8452 Год назад ⁺¹
@@NicholasRenotte Can you please upload a tutorial on that like how to fine tune it so that it performs well irrespective of the video we use for prediction. Maybe real time prediction from the web camera. That would be helpful, Thanks.
@user-di5zq3yv1e Год назад
can we upload our own video to get the text?pls someone can help me??
@erickmacias5153 Год назад
Create a video about fine tunning chat neo plzzz
@picklenickil Год назад ⁺²
It's a complete pain in the ass loading the 'latest' tensor flow GPU version on windows 11.
And as a solution if we use containers opencv starts throwing tantrums for video files.
So yes, please to everything PYTORCH
@NicholasRenotte Год назад ⁺¹
Oh, I didn't know. Don't know why they keep making it so hard. Anyway, I think that'll be the gameplan soon!
@gameguy7348 9 месяцев назад
yo bro but what if i want it to be in real time by using my webcam
@pranav-patil Год назад
Will this model work for a custom video ?
@prajwal_said 10 месяцев назад
Have you tired model for custom video?
@GundamExia88 Год назад
Can it do lip reading like Spanish?
@sourcex-yl5kk Месяц назад
Guys, anyone tried running locally ? How is it performing in other videos ??
@Nearnface 2 месяца назад
wowowooww
@febinrajan1637 8 месяцев назад
Can i upload my own video? Will it read my lip?
@makhmud_jumanazarov Год назад
coooooooool
@tarthechef Месяц назад
Can I do this when I know absolutely nothing about coding
@bobdillon1138 Год назад
Please finish Forza
@MattSullivn Год назад ⁺¹
Dave, I’m afraid I can’t do that
@NicholasRenotte Год назад
🤖🤖🤖
@sauravyadav2018 2 месяца назад
plz do it for webcam too..
@arjunramesh8469 2 месяца назад
Is the project working for you?
@bossgd100 Год назад ⁺¹
Great Now I have the same abilities as a Russian spy who can read lips
@NicholasRenotte Год назад
LOL, skillset enhanced!
@marymaegregore5209 Год назад
Android App Tensorflow Object Detection Pleaseeee 😭😭😭😭😭😭😭
@deadlox.d Год назад
But is this more accurate and efficient than speech-to-text 🙃
@gauravboob8737 3 месяца назад
raise ValueError(
ValueError: File format not supported: filepath=E:/BackUp/Desktop/LipNet_Final/LipNet/models - checkpoint 50/models/checkpoint. Keras 3 only supports V3 `.keras` and `.weights.h5` files, or legacy V1/V2 `.h5` files.
thats what I get when I load the checkpoints
@gauravboob8737 3 месяца назад
Do we have checkpoints in .h5 format uploaded?
@gauravboob8737 3 месяца назад
Any alternative to this?
@OfficialGamer-kc4dz 2 месяца назад
Hey did you find solution to this?
@arjunramesh8469 2 месяца назад
Anyone found the solution?
@vareable646 2 месяца назад
SOLUTION

Следующие

Автовоспроизведение

Build a Deep Learning Model that can LIP READ using Python and Tensorflow | Full Tutorial