LEMURS: Learning Distributed Multi-Robot Interactions

要約

この論文では、協調タスクのデモンストレーションからスケーラブルなマルチロボット制御ポリシーを学習するためのアルゴリズムである LEMURS について説明します。
相互接続されたシステムにおける普遍的な物理的制約を活用し、閉ループの安定性を達成するために、マルチロボットシステムのポートハミルトニアン記述を提案します。
自己注意メカニズムとニューラル常微分方程式を組み合わせたアーキテクチャを使用して、マルチロボット制御ポリシーを表します。
前者はロボットチーム内の時変通信を処理し、後者は連続時間のロボットダイナミクスを尊重します。
私たちの表現は構築によって分散されるため、学習した制御ポリシーをさまざまな規模のロボットチームに展開できます。
マルチエージェントナビゲーションと群れタスクのデモンストレーションから、LEMURSが相互作用と協調行動を学習できることを示します。

要約(オリジナル)

This paper presents LEMURS, an algorithm for learning scalable multi-robot control policies from cooperative task demonstrations. We propose a port-Hamiltonian description of the multi-robot system to exploit universal physical constraints in interconnected systems and achieve closed-loop stability. We represent a multi-robot control policy using an architecture that combines self-attention mechanisms and neural ordinary differential equations. The former handles time-varying communication in the robot team, while the latter respects the continuous-time robot dynamics. Our representation is distributed by construction, enabling the learned control policies to be deployed in robot teams of different sizes. We demonstrate that LEMURS can learn interactions and cooperative behaviors from demonstrations of multi-agent navigation and flocking tasks.

arxiv情報

著者	Eduardo Sebastian,Thai Duong,Nikolay Atanasov,Eduardo Montijano,Carlos Sagues
発行日	2023-02-21 19:04:40+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

LEMURS: Learning Distributed Multi-Robot Interactions

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー