Multi-Robot Cooperative Socially-Aware Navigation Using Multi-Agent Reinforcement Learning

要約

人間と共有する公共スペースでは、特にコミュニケーションが制限されている場合、社会規範を尊重しながら複数のロボットシステムが衝突することなく確実に移動することを保証することは困難です。
現在のロボットのソーシャルナビゲーション技術は強化学習と深層学習の進歩を活用していますが、シミュレーションにおけるロボットのダイナミクスを見落とすことが多く、シミュレーションと現実のギャップにつながります。
この論文では、Dec-POSMDP とマルチエージェント強化学習を使用して構築された新しいマルチロボットソーシャルナビゲーション環境を提示することで、このギャップを埋めます。
さらに、協調的なマルチロボットソーシャルナビゲーションのための新しいベンチマークである SAMARL を紹介します。
SAMARL は、マルチエージェント強化学習と組み合わせた独自の時空間変換機能を採用しています。
このアプローチは、ロボットと人間の間の複雑な相互作用を効果的に捉え、マルチロボットシステムにおける協力傾向を促進します。
当社の広範な実験により、SAMARL が当社の設計環境において既存のベースラインおよびアブレーションモデルよりも優れたパフォーマンスを発揮することが明らかになりました。
この作業のデモビデオは、https://sites.google.com/view/samarl でご覧いただけます。

要約(オリジナル)

In public spaces shared with humans, ensuring multi-robot systems navigate without collisions while respecting social norms is challenging, particularly with limited communication. Although current robot social navigation techniques leverage advances in reinforcement learning and deep learning, they frequently overlook robot dynamics in simulations, leading to a simulation-to-reality gap. In this paper, we bridge this gap by presenting a new multi-robot social navigation environment crafted using Dec-POSMDP and multi-agent reinforcement learning. Furthermore, we introduce SAMARL: a novel benchmark for cooperative multi-robot social navigation. SAMARL employs a unique spatial-temporal transformer combined with multi-agent reinforcement learning. This approach effectively captures the complex interactions between robots and humans, thus promoting cooperative tendencies in multi-robot systems. Our extensive experiments reveal that SAMARL outperforms existing baseline and ablation models in our designed environment. Demo videos for this work can be found at: https://sites.google.com/view/samarl

arxiv情報

著者	Weizheng Wang,Le Mao,Ruiqi Wang,Byung-Cheol Min
発行日	2024-05-15 19:57:59+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Multi-Robot Cooperative Socially-Aware Navigation Using Multi-Agent Reinforcement Learning

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー