LGMCTS: Language-Guided Monte-Carlo Tree Search for Executable Semantic Object Rearrangement

要約

実行可能な意味論的オブジェクトの再配置問題に対する新しいアプローチを紹介します。
この課題では、ロボットは、自然言語の記述によって指示されたパターンに従ってシーン内のオブジェクトを再配置する実行可能な計画を作成しようとします。
StructFormer や StructDiffusion などの既存の手法は、最初にポーズを生成し、次にタスクプランナーを利用してアクションプランを策定するという 2 つのステップで問題に取り組みますが、これとは異なり、私たちの手法はポーズの生成とアクションプランニングに同時に対処します。
この統合は、言語ガイド付きモンテカルロツリー検索 (LGMCTS) を使用して実現します。
定量的評価は 2 つのシミュレーションデータセットで提供され、実際のロボットを使用した定性テストによって補完されます。

要約(オリジナル)

We introduce a novel approach to the executable semantic object rearrangement problem. In this challenge, a robot seeks to create an actionable plan that rearranges objects within a scene according to a pattern dictated by a natural language description. Unlike existing methods such as StructFormer and StructDiffusion, which tackle the issue in two steps by first generating poses and then leveraging a task planner for action plan formulation, our method concurrently addresses pose generation and action planning. We achieve this integration using a Language-Guided Monte-Carlo Tree Search (LGMCTS). Quantitative evaluations are provided on two simulation datasets, and complemented by qualitative tests with a real robot.

arxiv情報

著者	Haonan Chang,Kai Gao,Kowndinya Boyalakuntla,Alex Lee,Baichuan Huang,Harish Udhaya Kumar,Jinjin Yu,Abdeslam Boularias
発行日	2024-10-07 14:55:12+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

LGMCTS: Language-Guided Monte-Carlo Tree Search for Executable Semantic Object Rearrangement

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー