Deploying Ten Thousand Robots: Scalable Imitation Learning for Lifelong Multi-Agent Path Finding

要約

Lifelong Multi-Agent Path Finding (LMAPF) は MAPF の一種で、エージェントには継続的に新しい目標が割り当てられるため、動的な変化に対応するために頻繁な再計画が必要になります。
最近、この分野では、個々の局所観察に基づいて単一ステップのアクションを反応的に生成する学習ベースの方法が採用されています。
ただし、特に大規模な設定では、最高の検索ベースのアルゴリズムのパフォーマンスに匹敵することは依然として困難です。
この研究では、新しい通信モジュールと体系的なシングルステップ衝突解決およびグローバルガイダンス技術を導入する模倣学習ベースの LMAPF ソルバーを提案します。
私たちが提案するソルバーである Scalable Imitation Learning for LMAPF (SILLM) は、最新の GPU の助けを借りて、学習ベースの手法の高速推論速度と検索ベースの手法の高いソリューション品質を継承しています。
最大 10,000 のエージェントとさまざまな障害物構造を含む 6 つの大規模マップにわたって、SILLM は最高の学習ベースと検索ベースのベースラインを上回り、それぞれ 137.7% と 16.0% の平均スループットの向上を達成しました。
さらに、SILLM は、Amazon Robotics が主催する国際 LMAPF コンテストである 2023 League of Robot Runners の優勝ソリューションにも勝利しました。
最後に、モックアップ倉庫環境で 10 台の実際のロボットと 100 台の仮想ロボットを使用して SILLM を検証しました。

要約(オリジナル)

Lifelong Multi-Agent Path Finding (LMAPF) is a variant of MAPF where agents are continually assigned new goals, necessitating frequent re-planning to accommodate these dynamic changes. Recently, this field has embraced learning-based methods, which reactively generate single-step actions based on individual local observations. However, it is still challenging for them to match the performance of the best search-based algorithms, especially in large-scale settings. This work proposes an imitation-learning-based LMAPF solver that introduces a novel communication module and systematic single-step collision resolution and global guidance techniques. Our proposed solver, Scalable Imitation Learning for LMAPF (SILLM), inherits the fast reasoning speed of learning-based methods and the high solution quality of search-based methods with the help of modern GPUs. Across six large-scale maps with up to 10,000 agents and varying obstacle structures, SILLM surpasses the best learning- and search-based baselines, achieving average throughput improvements of 137.7% and 16.0%, respectively. Furthermore, SILLM also beats the winning solution of the 2023 League of Robot Runners, an international LMAPF competition sponsored by Amazon Robotics. Finally, we validated SILLM with 10 real robots and 100 virtual robots in a mockup warehouse environment.

arxiv情報

著者	He Jiang,Yutong Wang,Rishi Veerapaneni,Tanishq Duhan,Guillaume Sartoretti,Jiaoyang Li
発行日	2024-10-28 18:13:15+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Deploying Ten Thousand Robots: Scalable Imitation Learning for Lifelong Multi-Agent Path Finding

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー