Strengthening Generative Robot Policies through Predictive World Modeling

要約

（i）専門家のデモンストレーションから生成的拡散ベースのポリシーをクローン化する学習制御フレームワークである生成予測制御（GPC）を提示します。
さまざまなロボット操作タスクにわたって、GPCは、シミュレーションおよび現実世界で、状態ベースと視覚ベースの設定の両方でクローニングする動作を常に上回ることを実証します。

要約(オリジナル)

We present generative predictive control (GPC), a learning control framework that (i) clones a generative diffusion-based policy from expert demonstrations, (ii) trains a predictive action-conditioned world model from both expert demonstrations and random explorations, and (iii) synthesizes an online planner that ranks and optimizes the action proposals from (i) by looking ahead into the future using the world model from (ii). Across a variety of robotic manipulation tasks, we demonstrate that GPC consistently outperforms behavior cloning in both state-based and vision-based settings, in simulation and in the real world.

arxiv情報

著者	Han Qi,Haocheng Yin,Aris Zhu,Yilun Du,Heng Yang
発行日	2025-05-22 03:40:40+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Strengthening Generative Robot Policies through Predictive World Modeling

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー