Tutorial 34- LSTM Recurrent Neural Network In Depth Intuition

Krish Naik

Просмотров 221 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 5 янв 2025

Комментарии • 133

@Official-tk3nc 4 года назад ⁺⁹¹
if you are watching this in lockdown you are one of the rare species on the earth . many students are wasting their time on facebook, youtube, twitter, netflix, watching movies playing pubg, but you are working hard to achieve something . ALL the best ...nitj student here
@aasavravi5919 4 года назад ⁺³
self love is important......wts nitj?
@piyushpandey7646 4 года назад ⁺¹
@@aasavravi5919 it's NIT Jodhpur
@shreyasb.s3819 4 года назад
Superb..100% true well said
@techtrader8434 3 года назад
@@aasavravi5919 nit JALANDHAR.
@nishant3086 3 года назад ⁺¹
@@techtrader8434 Jamshedpur/Jaipur are also options
@commonboy1116 4 года назад ⁺⁵⁹
Ravi first time in this session I felt like lost . I loved your board presentation .
@shubhamsongire6712 3 года назад ⁺²¹
you are really lost he is Krish
@sujathaontheweb3740 3 года назад ⁺²
I agree... This format is harder to follow.
@bruhm0ment767 2 года назад
@@shubhamsongire6712 lmao
@aaditya9030 5 месяцев назад
@@shubhamsongire6712 🤣🤣
@Premnatraj 3 года назад ⁺⁵
I have recently been thinking of Data Science and Machine Learning, Krishna Naik's videos were very helpful in framing my decision. Thank you Krishna Naik.
@sandipansarkar9211 4 года назад ⁺¹⁴
That was an awesome journey.Now I have finished all the videos in the deep learning playlist. If you notice I have written a comment on each of the videos which was unnecessary.Now I will commence my journey to the ineuron course of Deep Learning with NLP which has commenced on the 18th of April.
Oh Krish I wonder should review all the videos once again before commencing the journey of ineuron .Not a bad thought indeed.
Ha!Ha!.Bye Krish .Stay blessed . Keep contributing.
@taniaafroztoma993 4 года назад ⁺²
i also see your comments in every vedio.ha ha
@ritishmadan3730 4 года назад
Hello Sir, Is the concept of the video clear to you? If yes, Please help me with the same. Please reply on ritish_m@outlook.com
@vcjayan8206 3 года назад ⁺⁷
I was really strugling to understand the core concept of LSTM. This really helped me. Thank you very much,,Also the blog is really awesome..
@sudhanvagokhale5368 4 года назад ⁺¹
@Krish Naik great video! the first video that gets to the point and explains concepts in detail.
@tingutech8201 2 года назад
sir g love ho gia ap sy main first time nlp smjny ki koshish kr raha tha q ky main ny final year my isy as a research work choose kia hy and sir your videos help me alot love you sir boht ziada
@moayyadarz2965 Год назад
Hi , Thanks for your wonderful explanation,
In my opinion , this detailed video is more important for researcher rather than programmers want to use LSTM or RNN
@PriyaM-og6ji 3 года назад ⁺⁵
Thank you, sir! It's great content and I'm almost following your NLP playlist.
@mambomambo4363 3 года назад
Me watching other YT videos: Watch then like/dislike/do nothing
Me watching Krish sir's videos: First like then watch
Thank you so much for explaining so many things. I learnt complete practical ML/DL from your videos. A big thumbs up from my side. Definitely, I will share your channel to anyone who would want to dive into ML/DL/DS.
@ritishmadan3730 4 года назад ⁺²
Man You explain really great. I was confused in GRU and LSTM, your explanation was wonderful. Your skills gained one more subscriber to your channel. Thank You for such videos.
@tanvishinde805 4 года назад ⁺⁹
At 20:27 when context is similar , sigmoid(y) is vector[1 1 1 1], why will sigmoid(y)*tanh(y) give me vextor [0 0 0 0] , by looking at sigmoid and tanh graph when sigmoid(y) tends to -> 1 even tanh(y) graph tends to 1 , then sigmoid(y)*tanh(y) should result to vector [1 1 1 1] as well
@deepakpote6379 4 года назад
I have same doubt plz reply.
@utsasen1756 2 года назад
same doubt
@nikhilgupta6624 10 месяцев назад
Same doubt
@akash_thing 3 года назад ⁺²
Amazing explanation, you made it very simple and clear
@codepathsala 3 года назад
This is the best explanation on LSTM.. really thanks
@abhishek-shrm 4 года назад ⁺¹
Wonderful video. Again great explanation. I think I might run out of words after a few more videos.
@jainvinith9421 2 года назад ⁺²
Nice lecture sir. Plz, try to solve only one numerical example manually for at least one epoch sir. It will be helpful to understand lstm in depth. Thank you
@fatmamamdouh6168 3 года назад ⁺¹
the best explanation as usual,, thank you so much for your effort.
@anumhasan5494 7 месяцев назад
watching it in 2024 from Pakistan... he saved me from failing NLP course... thank you
@mohammedk.k6472 4 года назад ⁺²
thanks so much my brother..great explanation .Allah bless you
@ngelospapoutsis9389 4 года назад ⁺²
I do not get something. We know that vanishing gradient problem is happening because the derivative of the sigmoid or tank function is between 0.25 max and 1 max and after many layers, the derivative cannot help in the update of the weight. However, here we are using sigmoid again. Aren't going to have the same problem
@ParthivShah Год назад ⁺¹
Thank You sir for such videos, Just please arrange it in playlist or in your website in order to access it easily. Thank You so much.
@manjulakalmath4591 Месяц назад
Tq for your unconditional service
@deepakpote6379 4 года назад ⁺⁴
Hi Sir, I have a serious doubt. At 20:31 you are saying tanh will give output as 0000.. if context has not changed. How this happens plz elaborate that. I have spent a lot of time thinking on it bt still couldn't find the answer.
@nikhilgupta6624 10 месяцев назад
Did you find answer to this bro? Even I came across the same doubt. It would be better if Krish could explain it.
@varunparuchuri9544 3 года назад
@krish naik wonderfull explanation
@atreyanagal2790 2 года назад
Finest explanation of such a difficult topic, hats off!! 🫡
@mujeebrahman5282 4 года назад ⁺¹
I have been waiting for this video so long.
@akilesh.ml.engineer 4 месяца назад
good explanation i have ever seen ..
@prantikbanerjee1573 4 года назад ⁺³
Sir, please upload videos on Boltzmann Machines...it feels very much complicated to understand the maths equations behind it...your videos has helped me a lot to learn ML/DL concepts
Love ur videos♥️♥️
@vaishnav4035 2 года назад
Hi, Can you please tell me which all concepts in ML and DL you feel are mathematically complicated to understand?
@mukeshnarendran1083 3 года назад ⁺¹
Hey Krish, it was a very informative video on the subject. thanks for the lovely work. I am not sure if I can request a topic that I and many others could be interested in. However, you being from an AI industrial side, it would be nice to see some content in the future about ml model encryption and resources for production. Great job on the youtube playlists
@azizahmad1344 4 года назад ⁺²
Thank you so much sir, for such a great explanation
@kiranpctricks 3 года назад ⁺¹
What happens to the -1 values of tanh and sigmoid cross product when the information is added to cell state in lstm?
@indrashispowali 2 года назад
nice! simple explanations.... much appreciable Sir
@jackshaak 2 года назад
I'm having a feel that the equation mentioned at 10:40 isn’t right...
For Ft = sig(Wf * [Ht-1, Xt] + Bf)
Ht-1 should already have its weight associated, ie., Ht-1 = sig(Wt-1 * Xt-1 + Bt-1) , correct?
Which means, for Wf, we won’t be factoring in Wt-1 into it again, but only use the current weight Wi
Can someone comment on this and correct me if I'm wrong, please?
@delllaptop5971 4 года назад ⁺³
Hey krish could you like explain how each of the input features are mapped to the rnn units and how the ouputs are then formed? like im really having a hard time picturing how these input features are getting mapped at each time step? Like could you explain with this text sequence example itself where each word has n no. of features i.e is a vector of size n and how these features are mapped Thanks!!!
@DeepROde 3 года назад
Sigmoid doesn't inherently converts real values to binary labels i.e. 0 or 1, instead it'll be range of real values between 0 to 1 (inclusive). The vectors at output of gates need NOT be something like [0 0 1 1] but can be, and most probably be, something like [0.122, 0.23, 0, 0.983].
@sriramayeshwanth9789 Год назад
Sir why are we again applying sigmoid function in the input layer while we have already done in the Forget Date? what is the necessity of calculating i(t) again? isn't f(t) = i(t)?
@nipundahra1174 3 года назад
amazing explanation sir..many thanks
@thepresistence5935 3 года назад
Wonderful Explanation!
@vishakarudhra8665 3 года назад
So is it fair to say the forget gate decides "where the new word fits in the context" and hence the forgetting in the context and the input gate decides how the new word 'changes' the context, thereby altering the influence of the new word on the context?
@rengarajit6241 2 года назад
Excellent sir
@shahrinnakkhatra2857 3 года назад ⁺²
Hi, actually I don't understand why do we need to do the sigmoid part twice? Once for input and once for forget gate? Isn't it doing the same thing?
@sriramayeshwanth9789 Год назад
Bro I have the same doubt. The weights may change but doesn't that impact the model? Please let me know if you found any answer
@kaziranga_national_park 3 года назад
Sir, are you possible image classify of folder wise move. I'm data operator forest Kaziranga National Park. Many photos trapped camera. Manual segregation is to very hard. Please help you
@vishnusit1 Год назад
At time stamp 7:00 i think this matrix multiplication not possible. In matrix multiplication, the number of columns in the first matrix must be equal to the number of rows in the second matrix for the multiplication to be valid.
@seshansesha7645 2 года назад
Excellent..
@srikanthiremath5824 3 года назад
LSTM accepts input of variable size?? Or padding is required to make all input of same size?
@pyclassy 3 года назад ⁺¹
hello Krish can you explain Conv-LSTM with one sample data and difference with LSTM and time distributed concept of LSTM?
@louerleseigneur4532 3 года назад
Thanks Krish
@hamzanaeem4838 4 года назад ⁺¹
How does long term dependency problem relates with Vanishing gradient problem , anyone plz explain ?
@alphonseinbaraj2959 4 года назад
So Inputgate is containing sigmoid and multiplication operation. same inputgate is involving in forget gate also. So forget gate is including input gate and output gate also including input gate. but output gate is something different like added tanh first then input gate . am i right ? anything wrong
@sowmyakavali2670 3 года назад
Hi krish ,
In lstm don't we have back propagation and weight updation ? if yes, why?
@theinvisibleghost141 Год назад
great explaination
@anirudhrajhgopal7534 4 года назад
Krish sir,how are the weights different at every gate?Since we are sending the same concatenated weights to every gate,how can it be different?
@hadihonarvarnazari4941 4 года назад
finally i saw detailed explanation. ty.
@phanikatraj181 4 года назад ⁺²
Bro because of you i understood deep learning very well I need a small help can u send some resources for learning deep learning with tensorflow pls
@abhijitbhandari621 3 года назад
A small confusion in C t-1. How does Ct-1 differ from h t-1, if both are previous output
@happycatshappylife539 Год назад
Thank you sir❤
@rukesh.shrestha111 4 года назад ⁺¹
Could you please make the video in seq2seq architecture for the Conversational Modeling?
@bhargavpotluri5147 4 года назад ⁺²
Thanks for the video Krish. One doubt is, how would word vectors change to 0 & 1 when we pass through sigmoid function? Greater than 0.5 might mark as 1, but how is this probability determined? based on what value?
@debanjandey64 4 года назад
Sigmoid function is f(x) = 1/1+e^-x, after calculating value W.x + b this result passes through the sigmoid function which outputs value between 0 and 1. If output is greater than 0.5 then it is assigned 1 else 0 is assigned.
@DeepROde 3 года назад
There's a mistake, the output of gate will be a vector of real values between 0 and 1 (inclusive) not binaries - i.e. not 0 or 1.
Network learns best way to project, first by linear transformation (W times something) then by non-linear transformation (applying sigmoid).
To answer your "how", the network "learns" what's best way to do this transformation (by learning weights) to optimize the objective function.
@kapilbisht7376 3 года назад
How can we do extractive summarisation in BERT??
@malathysivakumar4046 4 года назад
Hi Krish I have done with the lstm forecasting in that I am facing a data mismatch, like prediction was done for test data but the prediction data is lower than the test data
@reynoldbarboza 3 года назад
is the video for the different types of LSTM skipped ?
@tanvishinde805 4 года назад
@KrishNaik please could you tell what is the math behnid this concatenation operation [ht-1 ,xt] ? what is ',' ?is it addition , multiplication?
@sg042 4 года назад ⁺¹
It is actually concatenation.. let say ht-1 is m size vector and xt is n size then resulting thing would be m+n size v vector
@animeshsharma7332 4 года назад ⁺¹
6:41 this is against the matrix multiplication rule, I was also doing the same manually for input layer, I was stuck for hours why I am not able to add the output to the memory state, then I found out that the I am applying wrong matrix multiplication rules. Anyways great Explanation.
@dineshs5738 3 года назад
it is hadamard product
@vincentlius1569 4 года назад ⁺¹
Love your video, but I have a question so how do we update the weight or backproprogate the LSTM ?
@RinkiKumari-us4ej 4 года назад
I think the bacpropogation process of lstm rnn is same as simple rnn
@ritishmadan3730 4 года назад ⁺¹
Buddy, Even MIT did not go into the deep of it. Understanding the Math Behind the complex Deep Learning Networks are really complex.
I was wondering that as the context changes, how the sigmoid function makes the value to 0 or near to zero to forget the past memory. Because the input is changing right? Then it must not proceed further..isn't it?
@nidhichakravarty9483 3 года назад
Can you please make a video on how to combine two deep learning model which are trained on different dataset
@mks7846 4 года назад ⁺⁷
please upload video any real time project in deep learning using like lstm algotihm
@travel_with_rahullanannd 4 года назад
Few suggestion. Please reduce the frequency of words particular and over. As you already talk about something specific, it's not really needed everytime to use particular same way over. You are referring here, so simply here will sound good in place of over here.
@islamicinterestofficial 4 года назад
Thanks Sir.
@deepcontractor6968 4 года назад ⁺²
LSTM is kinda crappy when it comes to predictions corona video cases.
Krish according to you which algorithm should be the best to predict world's COVID-19 cases
@ritishmadan3730 4 года назад
Hello Sir, Is the concept of the video clear to you? If yes, Please help me with the same. Please reply on ritish_m@outlook.com
@joker-yd3uk 4 года назад
Sir how can we use time series data as input in CNN.please guid me
@diljitdutta8246 4 года назад ⁺³
please upload time series analysis using RNN asap...
@krishnaik06 4 года назад ⁺¹
Yes coming up
@joker-yd3uk 4 года назад
Please help me to work with time series data
@teetanrobotics5363 4 года назад ⁺¹
Could you please programming tutorial for lstm and GRU ?
@sachinborgave8094 4 года назад ⁺¹
Please upload further videos....
@usmanakram8841 4 года назад
Is accuracy meaningless in keras models.....?
@ShivShankarDutta1 4 года назад
Thanks. Please upload LSTM video on practicals
@sargun_narula 4 года назад
Can anyone provide reference link to learn the word to vector conversion topics
@nareshjadhav4962 4 года назад
go to deep learning playlist
@shubhradutta1839 4 года назад
Is there anything left with deep learning tutoril or it is completed
@iramarshad700 3 года назад
Can you please make a video on GAN as well?
@tarunbilla1900 4 года назад
Krish , Please upload more on LSTM .
@sujathaontheweb3740 3 года назад
Please go back to your whiteboard. You're amazing with whiteboard and marker!
@DeepakKumar-nw7wy 4 года назад
i m waiting for ur next video
@MuruganVeera1980 2 года назад
from which book you are teaching krish
@MuruganVeera1980 2 года назад
link is given in reference kkk
@aqibfayyaz1542 3 года назад
Great
@tanaygupta632 4 года назад
Will you be uploading videos on Transfer Learning ?
@RinkiKumari-us4ej 4 года назад ⁺¹
Transfer learning is a very broad topic bro everyday a new algorithm is comming using transfer learning
@ritishmadan3730 4 года назад
@@RinkiKumari-us4ej Hello Sir, Is the concept of the video clear to you? If yes, Please help me with the same. Please reply on ritish_m@outlook.com
@joker-yd3uk 4 года назад
Please upload video about autoencoder
@cyrinenasri5624 Год назад ⁺⁴
so complicated ...
@Itachi-vg5ef 3 месяца назад
hehe😂
@Itachi-vg5ef 3 месяца назад
i think you have to clear your basics first
@md.shafaatjamilrokon8587 2 года назад
21:48 yes , very confusing
@devmaharaj1 3 года назад
Day of Recording of this video is the day when the LOCKDOWN started !!!!!!!!
@sunnybhojwani3199 4 года назад
please upload more videos
@roh_95 3 года назад
Mar-24-2021
@RAZZKIRAN 4 года назад
R u 29 years old?
@nikhildixit9626 Месяц назад
board is more good
@ivan_inanych 2 года назад
Паша Техник совсем плох, индусом стал, нейронками занялся
@hm2715 Год назад
confusing
@sagarparab6973 2 года назад ⁺¹
You disappointed us
@linux9505 5 месяцев назад
😂
@jagadeeshmandala4097 3 года назад
Too much advertisements😒😔
@indranilpaul8328 4 года назад
Hello all my name is Krish Naik...🤣😁😝

Следующие

Автовоспроизведение

Word Embedding - Natural Language Processing| Deep Learning