STAP: Sequencing Task-Agnostic Policies

要約

ロボットのスキル習得の進歩により、下流の操作タスク用に学習したスキルの汎用ライブラリを構築することが可能になりました。
ただし、長期計画によく見られるアクション間の依存関係を考慮せずに、これらのスキルを単純に次々に実行することは成功する可能性は低いです。
我々は、操作スキルをトレーニングし、トレーニング中にどのスキルも見たことのない長期的なタスクを解決するために、計画時に操作スキルの幾何学的依存関係を調整するためのスケーラブルなフレームワークであるシーケンシングタスク非依存ポリシー (STAP) を紹介します。
Q 関数がスキルの実現可能性の尺度をエンコードしていることを考慮して、計画に順序付けされたすべてのスキルの共同成功を最大化する最適化問題を定式化します。これは、Q 値の積によって推定されます。
私たちの実験では、この目的関数がグラウンドトゥルース計画の実現可能性を近似しており、計画の目的として使用すると近視眼的な行動が軽減され、それによって長期的なタスクの成功が促進されることが示されています。
さらに、タスクプランナーが提供するスキルシーケンスの幾何学的実現可能性を推定することにより、タスクと動作の計画に STAP をどのように使用できるかを実証します。
私たちはシミュレーションと実際のロボットでアプローチを評価します。
定性的な結果とコードは https://sites.google.com/stanford.edu/stap で入手できます。

要約(オリジナル)

Advances in robotic skill acquisition have made it possible to build general-purpose libraries of learned skills for downstream manipulation tasks. However, naively executing these skills one after the other is unlikely to succeed without accounting for dependencies between actions prevalent in long-horizon plans. We present Sequencing Task-Agnostic Policies (STAP), a scalable framework for training manipulation skills and coordinating their geometric dependencies at planning time to solve long-horizon tasks never seen by any skill during training. Given that Q-functions encode a measure of skill feasibility, we formulate an optimization problem to maximize the joint success of all skills sequenced in a plan, which we estimate by the product of their Q-values. Our experiments indicate that this objective function approximates ground truth plan feasibility and, when used as a planning objective, reduces myopic behavior and thereby promotes long-horizon task success. We further demonstrate how STAP can be used for task and motion planning by estimating the geometric feasibility of skill sequences provided by a task planner. We evaluate our approach in simulation and on a real robot. Qualitative results and code are made available at https://sites.google.com/stanford.edu/stap.

arxiv情報

著者	Christopher Agia,Toki Migimatsu,Jiajun Wu,Jeannette Bohg
発行日	2023-05-31 10:53:34+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

STAP: Sequencing Task-Agnostic Policies

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー