Why Choose Model-Based Reinforcement Learning?

MLPC2020: Sergey Levine, Model-based RL

DexNex v0 at IROS 2024

Love Hurts | Official Trailer

$1 vs $1000 LEGO SET...

Inside the NBA Reacts to Bronny James' debut with LeBron in Lakers Opening Night Win | NBA on TNT

Hybrid Control for Reinforcement Learning---the Half-Cheetah Benchmark

Northwestern Robotics

Просмотров 697

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 25 окт 2024
Our IJRR paper (with Allison Pinosky, Ian Abraham, Alex Broad, and Brenna Argall) on hybrid control for reinforcement learning is out (available with open access).
journals.sagep...
This video is a comparison between techniques, using the half-cheetah benchmark. Instead of using RL as only a data-driven approach for control, we use optimal control to improve RL by optimally switching between model-based and model-free approaches, deriving an analytical formula for control that depends on both.
We use soft-actor-critic (SAC) as our model-free approach and NN-MPPI as the model-based approach. In comparisons, both do well on simple problems (e.g., swing-up problems) but perform worse on the half-cheetah and hopper problems. We trained with small networks (2-layers) and training steps (50k), but model-based and model-free methods can successfully learn these problems with larger networks and many more training steps.
Code is available at github.com/Mur....

Комментарии •

Следующие

Автовоспроизведение

Why Choose Model-Based Reinforcement Learning?

Why Choose Model-Based Reinforcement Learning?

MLPC2020: Sergey Levine, Model-based RL

MLPC2020: Sergey Levine, Model-based RL

DexNex v0 at IROS 2024

DexNex v0 at IROS 2024

Love Hurts | Official Trailer

Love Hurts | Official Trailer

$1 vs $1000 LEGO SET...

$1 vs $1000 LEGO SET...

Inside the NBA Reacts to Bronny James' debut with LeBron in Lakers Opening Night Win | NBA on TNT

Inside the NBA Reacts to Bronny James' debut with LeBron in Lakers Opening Night Win | NBA on TNT

Pacific Championships 2024 | Fiji Bati v PNG Kumuls | Extended Highlights

Pacific Championships 2024 | Fiji Bati v PNG Kumuls | Extended Highlights

Introducing the NSF Human AugmentatioN via Dexterity (HAND) Engineering Research Center

Introducing the NSF Human AugmentatioN via Dexterity (HAND) Engineering Research Center

DeepRL1.6 Model based versus Model free Reinforcement Learning Source

DeepRL1.6 Model based versus Model free Reinforcement Learning Source

World-Models 🌍 Model Based Reinforcement Learning

World-Models 🌍 Model Based Reinforcement Learning

Self-Healing Distributed Swarm Formation Control Using Image Moments

Self-Healing Distributed Swarm Formation Control Using Image Moments

Russ Tedrake, MIT and Toyota Research Institute: "Large Behavior Models for Dexterous Manipulation"

Russ Tedrake, MIT and Toyota Research Institute: "Large Behavior Models for Dexterous Manipulation"

Temporal Difference Models: Deep Model-free RL for Model-based

Temporal Difference Models: Deep Model-free RL for Model-based

PID vs. Other Control Methods: What's the Best Choice

PID vs. Other Control Methods: What's the Best Choice

Шизофрения)Телеграмм-Колян Карелия #юмор #roblox #гном #скетч #вайны

Шизофрения)Телеграмм-Колян Карелия #юмор #roblox #гном #скетч #вайны

His dad said that this way he wouldn't have to worry about the baby running around with the bottle!

His dad said that this way he wouldn't have to worry about the baby running around with the bottle!

Хамзат Чимаев КРАСИВО ОТВЕТИЛ НА ПРОВОКАЦИОННЫЙ ВОПРОС #мма

Хамзат Чимаев КРАСИВО ОТВЕТИЛ НА ПРОВОКАЦИОННЫЙ ВОПРОС #мма

Czn Burak Vadi İstanbul Mall Hoşgeldiniz #shorts #cznburak

Czn Burak Vadi İstanbul Mall Hoşgeldiniz #shorts #cznburak

Я уговариваю своего друга попробовать чипсы Лава Лава

Я уговариваю своего друга попробовать чипсы Лава Лава

Физика пасты Карбонара 🧪🔬

Физика пасты Карбонара 🧪🔬

青椒把子肉做好了，大家看看怎么样#food #shorts

青椒把子肉做好了，大家看看怎么样#food #shorts

ТАРАКАН И ГРАБИТЕЛЬ ВСТРЕТИЛИСЬ

ТАРАКАН И ГРАБИТЕЛЬ ВСТРЕТИЛИСЬ