Видео 85
Просмотров 38 488

Megha Srivastava's talk on "Generating Language Corrections for Teaching Physical Control Tasks"

4:47

Supplemental video for "In-Mouth Robotic Bite Transfer With Visual and Haptic Sensing"

3:01

Minae Kwon's talk on "Reward Design with Language Models"

4:47

Siddharth Karamcheti's talk on "Online Language Corrections for Robotics via Shared Autonomy"

7:55

Joey Hejna's Talk on "Few-Shot Preference-based RL"

12:51

Priya Sundaresan's talk on "Learning Visuo-Haptic Skewering Strategies for Robot-Assisted Feeding"

13:52

Bidipta Sarkar's talk on "Diverse Conventions for Human-AI Collaboration"

Supplementary Video for the NeurIPS 2023 paper:
Bidipta Sarkar, Andy Shih, and Dorsa Sadigh. Diverse conventions for Human-AI collaboration. In Advances in Neural Information Processing Systems 36 (NeurIPS), 2023
Paper: arxiv.org/abs/2310.15414
Website: iliad.stanford.edu/Diverse-Conventions/
Code: github.com/Stanford-ILIAD/Diverse-Conventions
Abstract: Conventions are crucial for strong performance in cooperative multi-agent games, because they allow players to coordinate on a shared strategy without explicit communication. Unfortunately, standard multi-agent reinforcement learning techniques, such as self-play, converge to conventions that are arbitrary and non-diverse, leading to poor gene...

Видео

Megha Srivastava's talk on "Generating Language Corrections for Teaching Physical Control Tasks"

4:47

Megha Srivastava's talk on "Generating Language Corrections for Teaching Physical Control Tasks"

Просмотров 282Год назад

"Generating Language Corrections for Teaching Physical Control Tasks" Megha Srivastava, Noah Goodman, Dorsa Sadigh International Conference of Machine Learning (ICML) 2023, Honolulu, Hawai'i We design and build CORGI, a model trained to generate language corrections for physical control tasks, such as learning to ride a bike. CORGI takes in as input a pair of student and expert trajectories, an...

Supplemental video for "In-Mouth Robotic Bite Transfer With Visual and Haptic Sensing"

3:01

Supplemental video for "In-Mouth Robotic Bite Transfer With Visual and Haptic Sensing"

Просмотров 359Год назад

Supplemental video for "In-Mouth Robotic Bite Transfer With Visual and Haptic Sensing", ICRA 2023. View more information and the paper on our website: tinyurl.com/btICRA Music: Music from #Uppbeat (free for Creators!): uppbeat.io/t/sky-toes/the-long-ride-home License code: XHROZYRPSAI2DR4B

Minae Kwon's talk on "Reward Design with Language Models"

4:47

Minae Kwon's talk on "Reward Design with Language Models"

Просмотров 646Год назад

"Reward Design with Language Models" Minae Kwon, Sang Michael Xie, Kalesha Bullard, Dorsa Sadigh International Conference on Learning Representations (ICLR), May 2023

Siddharth Karamcheti's talk on "Online Language Corrections for Robotics via Shared Autonomy"

7:55

Siddharth Karamcheti's talk on "Online Language Corrections for Robotics via Shared Autonomy"

Просмотров 300Год назад

Paper: arxiv.org/abs/2301.02555 "No, to the Right" - Online Language Corrections for Robotic Manipulation via Shared Autonomy Yuchen Cui*, Siddharth Karamcheti*, Raj Palleti, Nidhya Shivakumar, Percy Liang, Dorsa Sadigh 18th ACM/IEEE International Conference on Human-Robot Interaction (HRI), March 2023

Joey Hejna's Talk on "Few-Shot Preference-based RL"

12:51

Joey Hejna's Talk on "Few-Shot Preference-based RL"

Просмотров 741Год назад

Joey Hejna's Talk on "Few-Shot Preference-based RL"

Priya Sundaresan's talk on "Learning Visuo-Haptic Skewering Strategies for Robot-Assisted Feeding"

13:52

Priya Sundaresan's talk on "Learning Visuo-Haptic Skewering Strategies for Robot-Assisted Feeding"

Просмотров 276Год назад

Paper Website: sites.google.com/view/hapticvisualnet-corl22/home PDF: openreview.net/pdf?id=lLq09gVoaTE Priya Sundaresan, Suneel Belkhale, Dorsa Sadigh

Megha Srivastava's talk on "Assistive Teaching of Motor Control Tasks to Humans"

19:10

Megha Srivastava's talk on "Assistive Teaching of Motor Control Tasks to Humans"

Просмотров 218Год назад

Paper: iliad.stanford.edu/pdfs/publications/srivastava2022assistive.pdf "Assistive Teaching of Motor Control Tasks to Humans" by Megha Srivastava, Erdem Biyik, Suvir Mirchandani, Noah Goodman, Dorsa Sadigh

Jennifer Grannen's talk on "Learning Bimanual Scooping Policies for Food Acquisition"

16:07

Jennifer Grannen's talk on "Learning Bimanual Scooping Policies for Food Acquisition"

Просмотров 355Год назад

Paper: arxiv.org/abs/2211.14652 Learning Bimanual Scooping Policies for Food Acquisition Jennifer Grannen*, Yilin Wu*, Suneel Belkhale, Dorsa Sadigh

PLATO: Predicting Latent Affordances through Object Centric Play

8:10

PLATO: Predicting Latent Affordances through Object Centric Play

Просмотров 2562 года назад

Paper: arxiv.org/abs/2203.05630 PLATO: Predicting Latent Affordances through Object Centric Play Suneel Belkhale, Dorsa Sadigh Conference on Robot Learning, December 2022

Erik Brockbank's talk on Incorporate Advice from Artificial Agents when Making Physical Judgments

15:03

Erik Brockbank's talk on Incorporate Advice from Artificial Agents when Making Physical Judgments

Просмотров 852 года назад

Paper: arxiv.org/abs/2205.11613 How do People Incorporate Advice from Artificial Agents when Making Physical Judgments? Erik Brockbank*, Haoliang Wang*, Justin Yang, Suvir Mirchandani, Erdem Bıyık, Dorsa Sadigh, Judith Fan Cognitive Science Society Conference (CogSci), July 2022

Andy Shih's talk on "Conditional Imitation Learning for Multi Agent Games"

10:02

Andy Shih's talk on "Conditional Imitation Learning for Multi Agent Games"

Просмотров 702 года назад

Paper: arxiv.org/abs/2201.01448 Conditional Imitation Learning for Multi-Agent Games Andy Shih, Stefano Ermon, Dorsa Sadigh 17th ACM/IEEE International Conference on Human-Robot Interaction (HRI), March 2022

Mark Beliaev's talk at ICML 2022 on "Imitation Learning by Estimating Expertise of Demonstrators"

5:01

Mark Beliaev's talk at ICML 2022 on "Imitation Learning by Estimating Expertise of Demonstrators"

Просмотров 5092 года назад

Paper: arxiv.org/abs/2202.01288 Imitation Learning by Estimating Expertise of Demonstrators Mark Beliaev*, Andy Shih*, Stefano Ermon, Dorsa Sadigh, Ramtin Pedarsani 39th International Conference on Machine Learning (ICML), July 2022

ICRA22 talk for 'Weakly Supervised Correspondence Learning'

4:00

ICRA22 talk for 'Weakly Supervised Correspondence Learning'

Просмотров 1332 года назад

The talk for ''Weakly Supervised Correspondence Learning', which is accepted by ICRA22.

ICRA talk for 'Leveraging Smooth Attention Prior for Multi-Agent Trajectory Prediction'

3:56

ICRA talk for 'Leveraging Smooth Attention Prior for Multi-Agent Trajectory Prediction'

Просмотров 2392 года назад

ICRA talk for 'Leveraging Smooth Attention Prior for Multi-Agent Trajectory Prediction'

Zihan Wang's Talk on "Learning from Imperfect Demonstrationsvia Adversarial Confidence Transfer"

11:09

Zihan Wang's Talk on "Learning from Imperfect Demonstrationsvia Adversarial Confidence Transfer"

Просмотров 1602 года назад

Zihan Wang's Talk on "Learning from Imperfect Demonstrationsvia Adversarial Confidence Transfer"

Erdem Bıyık's Talk on "Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams" at AI-HRI

14:29

Erdem Bıyık's Talk on "Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams" at AI-HRI

Просмотров 3243 года назад

Erdem Bıyık's Talk on "Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams" at AI-HRI

Erdem Bıyık's Talk on "APReL: A Library for Active Preference-based Reward Learning Algorithms"

12:59

Erdem Bıyık's Talk on "APReL: A Library for Active Preference-based Reward Learning Algorithms"

Просмотров 4233 года назад

Erdem Bıyık's Talk on "APReL: A Library for Active Preference-based Reward Learning Algorithms"

Andy Shih's Talk on "HyperSPNs: Compact and Expressive Probabilistic Circuits"

11:02

Andy Shih's Talk on "HyperSPNs: Compact and Expressive Probabilistic Circuits"

Просмотров 3823 года назад

Andy Shih's Talk on "HyperSPNs: Compact and Expressive Probabilistic Circuits"

Songyuan's Talk on "Confidence-Aware Imitation Learning from Demonstrations with Varying Optimality"

13:44

Songyuan's Talk on "Confidence-Aware Imitation Learning from Demonstrations with Varying Optimality"

Просмотров 3513 года назад

Songyuan's Talk on "Confidence-Aware Imitation Learning from Demonstrations with Varying Optimality"

Suvir Mirchandani's Talk on "ELLA: Exploration through Learned Language Abstraction" at NeurIPS 2021

13:58

Suvir Mirchandani's Talk on "ELLA: Exploration through Learned Language Abstraction" at NeurIPS 2021

Просмотров 3493 года назад

Suvir Mirchandani's Talk on "ELLA: Exploration through Learned Language Abstraction" at NeurIPS 2021

Learning Reward Functions from Scale Feedback

3:55

Learning Reward Functions from Scale Feedback

Просмотров 2033 года назад

Learning Reward Functions from Scale Feedback

Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams

3:42

Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams

Просмотров 1833 года назад

Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams

Learning Multimodal Rewards from Rankings

4:17

Learning Multimodal Rewards from Rankings

Просмотров 2183 года назад

Learning Multimodal Rewards from Rankings

PantheonRL: A MARL Library for Dynamic Training Interactions

9:02

PantheonRL: A MARL Library for Dynamic Training Interactions

Просмотров 1,1 тыс.3 года назад

PantheonRL: A MARL Library for Dynamic Training Interactions

APReL: A Library for Active Preference-based Reward Learning Algorithms

2:53

APReL: A Library for Active Preference-based Reward Learning Algorithms

Просмотров 3383 года назад

APReL: A Library for Active Preference-based Reward Learning Algorithms

Woodrow Z. Wang's Talk at IJCAI 2021 on "Emergent Prosociality in Multi-Agent Games Through Gifting"

14:08

Woodrow Z. Wang's Talk at IJCAI 2021 on "Emergent Prosociality in Multi-Agent Games Through Gifting"

Просмотров 1193 года назад

Woodrow Z. Wang's Talk at IJCAI 2021 on "Emergent Prosociality in Multi-Agent Games Through Gifting"

Learning How to Dynamically Route Autonomous Vehicles on Shared Roads

3:37

Learning How to Dynamically Route Autonomous Vehicles on Shared Roads

Просмотров 1843 года назад

Learning How to Dynamically Route Autonomous Vehicles on Shared Roads

Minae Kwon's Talk at ICML 2021 on "Targeted Data Acquisition for Evolving Negotiation Agents"

4:49

Minae Kwon's Talk at ICML 2021 on "Targeted Data Acquisition for Evolving Negotiation Agents"

Просмотров 2183 года назад

Minae Kwon's Talk at ICML 2021 on "Targeted Data Acquisition for Evolving Negotiation Agents"

Kejun Li's Talk on Region of Interest Active Learning for Characterizing Exoskeleton Gait Pref's

11:58

Kejun Li's Talk on Region of Interest Active Learning for Characterizing Exoskeleton Gait Pref's

Просмотров 1233 года назад

Kejun Li's Talk on Region of Interest Active Learning for Characterizing Exoskeleton Gait Pref's

@AliHamadBakar 3 месяца назад
good job
@fmparrott5485 10 месяцев назад
This is excellent. The passive data presentation was very helpful.
@AlgoNudger 11 месяцев назад
Thanks.
@MacGyverKilla Год назад
Seriously? Are you guys out of your minds? That's a robot without any restriction on it's motion with a sharp EOA attachment (your fork) and you're not even using safety glasses to protect your eyes?
@soltani8781 2 года назад
❗ 𝓹𝓻𝓸𝓶𝓸𝓼𝓶
@michaelprzystupa520 3 года назад
I'm confused by how you get h to transition between s to s' before training the aligning model. How exactly does the data collection of getting the human input work? so to go form s to s' using h, is h being passed through the untrained transformation function in the paper, or is h being initially handled by some other teleoperation control scheme which is then mapped to the transformation map?
@michaelprzystupa520 3 года назад
Wait, is the idea that. The user sees the transition from s -> s' and then you ask them "how would this look to you on the joystick?" if that's the case, I think I answer my own question. Thank you!
@mengxili6078 3 года назад
@@michaelprzystupa520 Thanks for the question. Yes exactly, your understanding is correct. We don't assume a default control scheme. In the data collection process, we would show to the user the transition s -> s', and then ask them to give their preferred h for realizing the transition. Please let us know if you have further questions :)
@michaelprzystupa520 3 года назад
First!
@Chrisspru 3 года назад
the main issue taht limits the suit is constant misalignemnet of the joints and the lack of a few major axis of movement (leg rotation and abduction for example). this creates a frame that forces the wearer to move unnaturaly and output that can not match human movement by not being able to move like a human. a more anatomicaly correct design (better ankle and hip joint alignment, double joint knee, leg abductor behiind the back and rotator beside it, lengt adjustable upper and lower leg) would greatly improve movement fluidity and accuracy, allowing for more useful software output. a system of bungee cords to simulate the elastic properties of muscle wihout having it rely on beyond peak motor function might help too.
@sayarerdi 3 года назад
Hi. Where did you publish your codes related to the study?
@liyangku1253 4 года назад
So in the paper it says that only 2DOF joystick control is used, is that per task? It seems that the user still presses with their right hand sometimes. Is that for switching tasks or different tasks have different controls?
@dylanlosey2476 4 года назад
Hello! Within some of the baseline methods the user directly controls the robot's end-effector. This requires 6 DoF, but the joystick only allows 2 DoF at a time. So the user presses a button with their right hand to toggle between different modes.
@liyangku1253 4 года назад
@@dylanlosey2476 Thanks for the reply! Great work! Just to clarify, for the proposed approach LA + SA, is it correct that when switching from say reaching the tofu to cutting it, the user needs to press a button with their right hand to switch between latent spaces?
@dylanlosey2476 3 года назад
@@liyangku1253 With the proposed LA + SA approach, the user never needs to "manually" switch latent spaces. The meaning of the latent space automatically changes as a function of the robot's state and belief. If you see the user pressing a button with our approach, this could just be because they forgot / got confused by the different modes.
@lantaoyu129 4 года назад
great talk, zhangjie !

Stanford ILIAD

Видео

Комментарии