LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner

要約

言語モデル (LM) は自然言語を理解する強力な能力を備えており、人間の指示を単純なロボットタスクの詳細な計画に変換するのに効果的です。
それにもかかわらず、特に協力的な異種ロボットチームのサブタスクの特定と割り当てにおいて、長期的なタスクを処理することは依然として大きな課題です。
この問題に対処するために、私たちは、長期タスクで最先端のパフォーマンスを実現する新しいマルチエージェントタスクプランニングフレームワークである、言語モデル駆動型マルチエージェント PDDL プランナー (LaMMA-P) を提案します。
LaMMA-P は、LM の推論能力と従来のヒューリスティック検索プランナーの長所を統合して、タスク全体にわたる強力な一般化を実証しながら、高い成功率と効率を実現します。
さらに、AI2-THOR 環境に基づいて 2 つの異なる複雑さの家事タスクを特徴とする包括的なベンチマークである MAT-THOR を作成します。
実験結果は、LaMMA-P が既存の LM ベースのマルチエージェントプランナーよりも 105% 高い成功率と 36% 高い効率を達成することを示しています。
この作業の実験ビデオ、コード、データセット、および各モジュールで使用される詳細なプロンプトは、https://lamma-p.github.io で入手できます。

要約(オリジナル)

Language models (LMs) possess a strong capability to comprehend natural language, making them effective in translating human instructions into detailed plans for simple robot tasks. Nevertheless, it remains a significant challenge to handle long-horizon tasks, especially in subtask identification and allocation for cooperative heterogeneous robot teams. To address this issue, we propose a Language Model-Driven Multi-Agent PDDL Planner (LaMMA-P), a novel multi-agent task planning framework that achieves state-of-the-art performance on long-horizon tasks. LaMMA-P integrates the strengths of the LMs’ reasoning capability and the traditional heuristic search planner to achieve a high success rate and efficiency while demonstrating strong generalization across tasks. Additionally, we create MAT-THOR, a comprehensive benchmark that features household tasks with two different levels of complexity based on the AI2-THOR environment. The experimental results demonstrate that LaMMA-P achieves a 105% higher success rate and 36% higher efficiency than existing LM-based multi-agent planners. The experimental videos, code, and datasets of this work as well as the detailed prompts used in each module are available at https://lamma-p.github.io.

arxiv情報

著者	Xiaopan Zhang,Hao Qin,Fuquan Wang,Yue Dong,Jiachen Li
発行日	2024-09-30 17:58:18+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー