Reinforcement Learning Tutorial | Reinforcement Learning Example Using Python | Edureka

Поделиться
HTML-код
  • Опубликовано: 26 дек 2024

Комментарии •

  • @edurekaIN
    @edurekaIN  6 лет назад +9

    Got a question on the topic? Please share it in the comment section below and our experts will answer it for you. For Edureka Python Course curriculum, Visit our Website: bit.ly/2OpzQWw

  • @wolfisraging
    @wolfisraging 6 лет назад +21

    That's why I love scratch implementation rather than using high end library, good job

  • @murtuza.chawala
    @murtuza.chawala 4 года назад +5

    Edureka is the modern education system ! We love you, keep on the great work specially the free content !

  • @prasadgvs4639
    @prasadgvs4639 5 лет назад +6

    brilliant!! A perfect intro to ML. Well done Edureka!!

  • @wolfisraging
    @wolfisraging 5 лет назад +6

    Best tutorial for reinforcement learning, well done. Thank u so much

  • @arnavverma8622
    @arnavverma8622 2 года назад +1

    Very good explanation

  • @TheStrelok7
    @TheStrelok7 3 года назад +1

    You are legend!!
    Thank you!

  • @ratangles820
    @ratangles820 4 года назад +4

    This is the most beautiful think Ive seen today :)

  • @bilalsadiq45
    @bilalsadiq45 4 года назад +1

    This is one of the best lecture i have got to understand the crux of Q learning ,,hats off to you mam

    • @edurekaIN
      @edurekaIN  4 года назад

      Thanks for the compliment! We are glad we could help. Do subscribe to our channel to stay posted on upcoming tutorials.

  • @sureshnambiar8566
    @sureshnambiar8566 2 года назад +1

    Excellent

  • @kanchan6731
    @kanchan6731 5 лет назад +4

    One of the bestest learning source I have ever seen 🙄

    • @edurekaIN
      @edurekaIN  5 лет назад +1

      Thank you for appreciating our efforts, Kan. Do subscribe, like and share to stay connected with us. Cheers :)

  • @anintrovert4128
    @anintrovert4128 5 лет назад +6

    This video is better than Udacity nano degree ml program class on Reinforcement learning

  • @hidayatzeb1463
    @hidayatzeb1463 4 года назад

    I have never seen like this lecture in my entire life .expecting more video like this thank you

  • @dr.savitasheoran473
    @dr.savitasheoran473 3 года назад

    very well explained

  • @UlrichArmel
    @UlrichArmel 2 года назад

    Well done. I really understood this in 30 minutes after going through bunch of notes and maths without really understand what was happening. Thanks very much

    • @edurekaIN
      @edurekaIN  2 года назад +1

      Hey:) Thank you so much for your sweet words :) Really means a lot ! Glad to know that our content/courses is making you learn better :) Our team is striving hard to give the best content. Keep learning with us -Team Edureka :) Don't forget to like the video and share it with maximum people:) Do subscribe the channel:)

  • @guillaumenelson6996
    @guillaumenelson6996 2 года назад

    You explained it all in 46minutes. Thanks a lot!

    • @edurekaIN
      @edurekaIN  2 года назад

      You're welcome 😊 Stay connected with our channel and team :) . Do subscribe the channel for more updates : ) Hit the bell icon to never miss an update from our channel : )

    • @guillaumenelson6996
      @guillaumenelson6996 2 года назад

      @@edurekaIN it was already done ✅
      I did subscribe and hit the bell button 😊

  • @ANIMESHKUMARPGP-
    @ANIMESHKUMARPGP- 4 года назад

    Very good lecture, whoever was playing CS is a very good awper.....

  • @hhhgdgb5205
    @hhhgdgb5205 5 лет назад +4

    Thank you I like it, happy day .

  • @raginisharma9302
    @raginisharma9302 2 года назад

    Very Useful and easy to understand - brilliant teacher , thank you !!

    • @edurekaIN
      @edurekaIN  2 года назад

      Hi : ) We really are glad to hear this ! Truly feels good that our team is delivering and making your learning easier :) Keep learning with us .Stay connected with our channel and team :) . Do subscribe the channel for more updates : ) Hit the bell icon to never miss an update from our channel : )

  • @chaitanyakaushik6772
    @chaitanyakaushik6772 3 года назад

    Excellent explaination,really helpful..

    • @edurekaIN
      @edurekaIN  3 года назад

      Hi : ) We really are glad to hear this ! Truly feels good that our team is delivering and making your learning easier :) Keep learning with us .Stay connected with our channel and team :) . Do subscribe the channel for more updates : ) Hit the bell icon to never miss an update from our channel : )

  • @ntsikelelonelsonmbekwa3231
    @ntsikelelonelsonmbekwa3231 5 лет назад +1

    Wow :) Thanks edureka!

  • @moriumakter9429
    @moriumakter9429 5 лет назад +1

    good explanation. thank you

  • @fathialwosaibi4024
    @fathialwosaibi4024 5 лет назад +4

    Amazing video. Very well done, u managed to introduce a very technical matter into simple words. Tx for sharing

    • @edurekaIN
      @edurekaIN  5 лет назад

      Thanks for the compliment, Fathi! We are glad you loved the video. Do subscribe to the channel and hit the bell icon to never miss an update from us in the future. Cheers!

  • @rohitshaw3922
    @rohitshaw3922 5 лет назад +3

    it was really a great explanation . Thank you so much

  • @santoshkumarsahu8482
    @santoshkumarsahu8482 2 года назад +1

    At Video 19:00
    Policy {A->C->D) = 15+ 50 = 65
    Policy (A->B->C->D} = 30 + (-10) + 50 = 70
    IS IT CORRECT? Please Clarify....

  • @engineered.mechanized
    @engineered.mechanized 5 лет назад +3

    This was a great lecture.

  • @bhargavamahesh
    @bhargavamahesh 4 года назад

    Excellent and this is amazing to go through your video good job

  • @ipdevelopment1357
    @ipdevelopment1357 5 лет назад +1

    What a fantastic video! Great work!!

  • @farenhite4329
    @farenhite4329 4 года назад

    Amazing!! It’s so clear now!

  • @ranam
    @ranam 5 лет назад

    Simple but powerful explanation

  • @maheshvangala8472
    @maheshvangala8472 5 лет назад +1

    Good explanation thank you 😘

  • @anandsankar4014
    @anandsankar4014 5 лет назад

    awesome explanation

  • @41abhishek
    @41abhishek 4 года назад

    Superb tutorial

  • @rekhars1396
    @rekhars1396 5 лет назад

    Happy with the explanation. Thank you so much .😊

  • @Jeevankumar-ju2nt
    @Jeevankumar-ju2nt 4 года назад

    amazing session

  • @mattcoakes5682
    @mattcoakes5682 5 лет назад

    Very informative video,, thank you!

  • @syrymzhakypbekov1949
    @syrymzhakypbekov1949 4 года назад

    I like it! Super! Keep Going!

  • @adanesh
    @adanesh 4 года назад

    what a simple and wonderful lecture

  • @paichethan
    @paichethan 4 года назад

    Nice explanation. Short , accurate and practical.

  • @Janani.G
    @Janani.G 4 года назад

    Fantastic

  • @mdmamun-vp9xj
    @mdmamun-vp9xj 3 года назад

    please make a video of kalman filter with python.

    • @edurekaIN
      @edurekaIN  3 года назад

      Hi Mamun, thank you for your suggestion. We will definitely come up with an exclusive tutorial for the same. Meanwhile, do subscribe to our channel and stay tuned. Cheers :)

  • @Asmutiwari
    @Asmutiwari 4 года назад

    well explained !! thanks

  • @jim78able
    @jim78able 4 года назад

    Very nicely explained, best tutorial ill show to my university also how edureka teaches

  • @yunusemredarici7284
    @yunusemredarici7284 4 года назад

    ıt was so helpful . Thanks a lot:)

  • @mergenlideki4055
    @mergenlideki4055 5 лет назад +3

    if there is a R(5,5) even though the end goal (room 5) is already reached, why is there no R(4,4), R(3,3), R(2,2) and R(1,1) ?

    • @edurekaIN
      @edurekaIN  5 лет назад +6

      Hey, There is a (1,1), (2,2), (3,3), (4,4), connectivity, but the reward to traverse from node 4 to 4 is zero. Because node 1,2,3,4 are not the goal nodes. Hope this helps. Cheers!

  • @rifanaaa2692
    @rifanaaa2692 4 года назад

    What is the applications of reinforcement learning??

    • @edurekaIN
      @edurekaIN  4 года назад

      Here are some of the applications of Reinforcement Learning:
      1. Robotics for industrial automation.
      2. Business strategy planning.
      3. Machine learning and data processing.
      4. It helps you to create training systems that provide custom instruction and materials according to the requirement of students.
      5. Aircraft control and robot motion control.

  • @raedm9244
    @raedm9244 4 года назад

    That was very good video. I am still learning. Thank You.

  • @surbhigupta1419
    @surbhigupta1419 5 лет назад

    nice video

  • @muhammadusmanakram406
    @muhammadusmanakram406 6 лет назад

    excellent

  • @jeromystewart
    @jeromystewart 5 лет назад +1

    I liked the explanation and the flow of concepts but there are moments in this talk where the user (me/us) must ask, is the speaker instructing us based on an industry practice or on how this specific model is configured.. For example, when you say, the reward for an action that doesn't take you directly to the goal is zero .. do you mean that the goal is zero in this specific implementation or do you mean this is universally always that case. My brain gets hung up when the exact context isn't defined.

    • @edurekaIN
      @edurekaIN  5 лет назад

      Hi Jeromy, thanks for watching the video. For each problem statement a different approach or a different model is built. So to answer your question, the instructor was referring to that particular problem statement. Hope this helps!

  • @venkystellar1877
    @venkystellar1877 5 лет назад

    lucid explanation....I have a doubt....how can we decide the value of iterations?..the machine is intended to explore b those iterations?..

    • @edurekaIN
      @edurekaIN  5 лет назад

      Hi Venky, thanks for the compliment! The iterations depends on the type of problem you're solving. Since this is a reinforcement learning problem, the agent requires more training because he must do everything from scratch.

  • @ragulsithuraj9929
    @ragulsithuraj9929 5 лет назад

    Hats off

  • @akshaybhosale1100
    @akshaybhosale1100 4 года назад

    Nicely explained. But still I am getting an error in the code. Please guide me.

    • @edurekaIN
      @edurekaIN  4 года назад

      Hi Akshay, we regret the error in your code. However, you can drop your email id in the comments and we shall assist you with the source codes. Hope this might be helpful, cheers :)

  • @sgt.mcgragon359
    @sgt.mcgragon359 5 лет назад

    Halo,
    Great explanation but one doubt, I saw the code at the end.....are you using the same code to show the final Q matrix and path?.....because I am not getting the correct Q matrix and also the results are wrong!

    • @edurekaIN
      @edurekaIN  5 лет назад

      Hey, The code creates and updates the Q matrix based on the movements of the agent. Can you please mention the error you are facing?

  • @chintandd
    @chintandd 5 лет назад

    Wow. Nicely Explained by the instructor. I thought Python has inbuilt algo for calculating Q Matrix. But looking to the python code, I realized that we need to code it. Am I right?

    • @edurekaIN
      @edurekaIN  5 лет назад +1

      Hi Chintan, thanks for watching the video. Yes, you need to write the code for Q Matrix.

  • @Joseroberto-rr2wp
    @Joseroberto-rr2wp 5 лет назад +1

    Why the reward from minute 33.41 from Q(5,5) is not 100?

    • @edurekaIN
      @edurekaIN  5 лет назад +5

      Hi Jose, Q(5,5) is zero initially because it represents the memory of the agent. On the other hand R(5,5) is 100 because it represents the reward the agent recieves on reaching the goal state (5).

    • @ankitbrijwasi9902
      @ankitbrijwasi9902 4 года назад

      @@edurekaIN okay, thank you

  • @jitendravasava4586
    @jitendravasava4586 6 лет назад

    Present sir :)

  • @liaastuti1170
    @liaastuti1170 4 года назад

    sorry, how i get the code?

    • @edurekaIN
      @edurekaIN  4 года назад

      Hi Lia, kindly drop in your respective email id and we will share the code to you :)

  • @annanyamathur8869
    @annanyamathur8869 3 года назад

    please share code

    • @edurekaIN
      @edurekaIN  3 года назад

      Good to know our contents and videos are helping you learn better . We are glad to have you with us ! Please share your mail id to send the data sheets to help you learn better :) Do subscribe the channel for more updates : ) Hit the bell icon to never miss an update from our channel : )

  • @spamspamer3679
    @spamspamer3679 5 лет назад +2

    I really appreciated the explanation and that you didn't use any ML-libraries. But in my case, where you have two objects, which randomly spawn on a grid-map at the beginning of the "Game". One object (the "agent") has to reach the other object ("the goal"). But I can't create a matching matrix in this kind of problem, right? So, how should I deal with it?

    • @edurekaIN
      @edurekaIN  5 лет назад +1

      Hey, Glad you liked the content. Your 'goal' is not an agent. It can't span around in the grid because the goal is fixed. Are you suggesting that you want to create two machine learning agents? Can you please be more specific about it.

  • @sain5275
    @sain5275 2 года назад +1

    Very well explained.. 👍

  • @kusumasriram2016
    @kusumasriram2016 3 года назад

    Very clear explanation

  • @ymgindia
    @ymgindia 2 года назад

    Very good Explaination!..Thank You

    • @edurekaIN
      @edurekaIN  2 года назад

      We are super happy that Edureka is helping you learn better. Your support means a lot to us and it motivated us to create even better learning content and courses experience for you . Do subscribe the channel for more updates : ) Hit the bell icon to never miss an update from our channel : )

  • @rachanadesai7984
    @rachanadesai7984 4 года назад

    very helpful!!

  • @systemsoftwareandcompilers3440
    @systemsoftwareandcompilers3440 5 лет назад

    Very well explained. Thank you very much