Reinforcement Learning (RL) explained (LLM, Vision, Robot)

Поделиться
HTML-код
  • Опубликовано: 20 окт 2024

Комментарии • 10

  • @samyogdhital
    @samyogdhital 25 дней назад

    loved it brother and keep making videos on robotics.
    i am loving it and I am fully supporting you on these kinds of videos.

  • @Tech_Datasavvy
    @Tech_Datasavvy Год назад +1

    Awesome and really liked the way you explained.

    • @code4AI
      @code4AI  Год назад

      Thanks a lot 😊

  • @manojtiwari7754
    @manojtiwari7754 Год назад +1

    I love your videos.. please keep going and thank you for all these amazing videos

  • @VenkatesanVenkat-fd4hg
    @VenkatesanVenkat-fd4hg Год назад

    Thanks for the efforts....

  • @NeuroScientician
    @NeuroScientician Год назад

    What do you do if the output from Human Feedback bit is not very good? At what point it becomes futile? It's nearly impossible to align a large number of raters for tasks that aren't very black and white and there is no way to standardise the training enough to actually be repeatable across many languages and cultures. Because if you have A vs B type of decision and your group effort ends up with more or less 50:50, you don't really have anything, right?

  • @SaiKiranAdusumilli
    @SaiKiranAdusumilli 8 месяцев назад

    Great and neat explanation ❤

  • @aspelot
    @aspelot Год назад

    Well presented!

  • @brendanbrowne2103
    @brendanbrowne2103 Год назад

    Great channel, currently working on a robot arm with ai