jarxiv | Japanese arxiv | ページ 590

Human-Robot Interaction and Perceived Irrationality: A Study of Trust Dynamics and Error Acknowledgment

投稿日: 2025年5月8日作成者: jarxiv

要約

ロボットがさまざまな産業にますます統合されるようになるにつれて、人間がロボットの故障にどのように反応するかを理解することが重要です。
この研究では、ロボット障害に対する人間の反応を分析することにより、信頼のダイナミクスとシステムの設計を体系的に調べます。
4段階の調査を実施して、人間とロボットの相互作用を通じて信頼がどのように進化するかを調査しました。
最初の段階では、人口統計データと初期信頼レベルを収集しました。
第2段階は、ロボット能力の予備的な期待と認識に焦点を当てました。
3番目の段階では、ロボットの精度とエラーの確認など、相互作用の詳細を調べました。
最後に、第4段階では、相互作用後の認識を評価し、信頼のダイナミクス、赦し、ロボットテクノロジーを推奨する意欲を評価しました。
結果は、ロボットがエラーや制限を認めた場合、ロボットシステムへの信頼が大幅に増加したことを示しています。
さらに、参加者は、将来のタスクのためにロボットを提案する意欲を高め、信頼のダイナミクスを形成するための直接的な関与の重要性を強調しました。
これらの調査結果は、より透明性が高く、応答性が高く、信頼できるロボットシステムを設計するための貴重な洞察を提供します。
人間とロボットの相互作用（HRI）の理解を高めることにより、この研究は、より大きな一般の受け入れと採用を促進するロボット技術の開発に貢献しています。

要約(オリジナル)

As robots become increasingly integrated into various industries, understanding how humans respond to robotic failures is critical. This study systematically examines trust dynamics and system design by analyzing human reactions to robot failures. We conducted a four-stage survey to explore how trust evolves throughout human-robot interactions. The first stage collected demographic data and initial trust levels. The second stage focused on preliminary expectations and perceptions of robotic capabilities. The third stage examined interaction details, including robot precision and error acknowledgment. Finally, the fourth stage assessed post-interaction perceptions, evaluating trust dynamics, forgiveness, and willingness to recommend robotic technologies. Results indicate that trust in robotic systems significantly increased when robots acknowledged their errors or limitations. Additionally, participants showed greater willingness to suggest robots for future tasks, highlighting the importance of direct engagement in shaping trust dynamics. These findings provide valuable insights for designing more transparent, responsive, and trustworthy robotic systems. By enhancing our understanding of human-robot interaction (HRI), this study contributes to the development of robotic technologies that foster greater public acceptance and adoption.

arxiv情報

著者	Ponkoj Chandra Shill,Md. Azizul Hakim
発行日	2025-05-06 21:25:02+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.HC, cs.RO, cs.SE | コメントを受け付けていません

PARC: Physics-based Augmentation with Reinforcement Learning for Character Controllers

投稿日: 2025年5月8日作成者: jarxiv

要約

人間は、壁を登ったり、隙間を飛び越えたりするなど、ダイナミックな操作を行うパルクールの開業医によって例示される、アジャイルな運動能力を備えた多様で複雑な環境をナビゲートすることに優れています。
これらのアジャイルな動きをシミュレートされたキャラクターで再現することは、アジャイル地形の横断行動のモーションキャプチャデータが不足していることと、そのようなデータを取得するための高コストのために、挑戦的なままです。
この作業では、機械学習と物理学ベースのシミュレーションを活用してモーションデータセットを繰り返し、地形横断コントローラーの機能を拡張するために機械学習と物理ベースのシミュレーションを活用するフレームワークであるPARC（物理学ベースの増強）を紹介します。
PARCは、コア地形トラバーサルスキルで構成される小さなデータセットでモーションジェネレーターをトレーニングすることから始めます。
その後、モーションジェネレーターを使用して、新しい地形を横断するための合成データを作成します。
ただし、これらの生成された動きは、誤った連絡先や不連続などのアーティファクトをしばしば示します。
これらのアーティファクトを修正するために、物理ベースの追跡コントローラーをトレーニングして、シミュレーションの動きを模倣します。
次に、修正された動きがデータセットに追加され、次の反復でモーションジェネレーターのトレーニングを継続するために使用されます。
PARCの反復プロセスは、モーションジェネレーターとトラッカーの機能を共同で拡張し、複雑な環境と相互作用するためのアジャイルで多用途のモデルを作成します。
PARCは、モーションデータの希少性と多用途のキャラクターコントローラーの必要性とのギャップを橋渡しするアジャイル地形トラバーサルのコントローラーを開発するための効果的なアプローチを提供します。

要約(オリジナル)

Humans excel in navigating diverse, complex environments with agile motor skills, exemplified by parkour practitioners performing dynamic maneuvers, such as climbing up walls and jumping across gaps. Reproducing these agile movements with simulated characters remains challenging, in part due to the scarcity of motion capture data for agile terrain traversal behaviors and the high cost of acquiring such data. In this work, we introduce PARC (Physics-based Augmentation with Reinforcement Learning for Character Controllers), a framework that leverages machine learning and physics-based simulation to iteratively augment motion datasets and expand the capabilities of terrain traversal controllers. PARC begins by training a motion generator on a small dataset consisting of core terrain traversal skills. The motion generator is then used to produce synthetic data for traversing new terrains. However, these generated motions often exhibit artifacts, such as incorrect contacts or discontinuities. To correct these artifacts, we train a physics-based tracking controller to imitate the motions in simulation. The corrected motions are then added to the dataset, which is used to continue training the motion generator in the next iteration. PARC’s iterative process jointly expands the capabilities of the motion generator and tracker, creating agile and versatile models for interacting with complex environments. PARC provides an effective approach to develop controllers for agile terrain traversal, which bridges the gap between the scarcity of motion data and the need for versatile character controllers.

arxiv情報

著者	Michael Xu,Yi Shi,KangKang Yin,Xue Bin Peng
発行日	2025-05-06 22:29:07+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.AI, cs.GR, cs.LG, cs.RO | コメントを受け付けていません

Online Controller Synthesis for Robot Collision Avoidance: A Case Study

投稿日: 2025年5月8日作成者: jarxiv

要約

動的環境の固有の不確実性は、特に衝突回避などのタスクにおいて、ロボットの動作をモデル化するための重要な課題をもたらします。
このペーパーでは、ディストリビューションのシフトに取り組むことに焦点を当てた、ディープラーニングベースの認識コンポーネントを装備したロボットに合わせたオンラインコントローラー合成フレームワークを紹介します。
私たちのアプローチは、深いニューラルネットワーク認識コンポーネントの定期的な監視と修復メカニズムを統合し、その後に不確実性の再評価を統合します。
これらの不確実性評価は、パラメトリック離散時間マルコフチェーンに注入され、確率的モデルチェックを介して堅牢なコントローラーの合成を可能にします。
修理プロセス中に高いシステムの可用性を確保するために、運用状態間でシームレスに遷移するデュアルコンポーネント構成を提案します。
ロボットの衝突回避に関するケーススタディを通じて、私たちの方法の有効性を実証し、ベースラインアプローチよりも実質的なパフォーマンスの改善を示します。
この作業は、不確実な環境で動作する自律システムの安全性と信頼性を高めるための包括的なスケーラブルなソリューションを提供します。

要約(オリジナル)

The inherent uncertainty of dynamic environments poses significant challenges for modeling robot behavior, particularly in tasks such as collision avoidance. This paper presents an online controller synthesis framework tailored for robots equipped with deep learning-based perception components, with a focus on addressing distribution shifts. Our approach integrates periodic monitoring and repair mechanisms for the deep neural network perception component, followed by uncertainty reassessment. These uncertainty evaluations are injected into a parametric discrete-time markov chain, enabling the synthesis of robust controllers via probabilistic model checking. To ensure high system availability during the repair process, we propose a dual-component configuration that seamlessly transitions between operational states. Through a case study on robot collision avoidance, we demonstrate the efficacy of our method, showcasing substantial performance improvements over baseline approaches. This work provides a comprehensive and scalable solution for enhancing the safety and reliability of autonomous systems operating in uncertain environments.

arxiv情報

著者	Yuheng Fan,Wang Lin
発行日	2025-05-06 23:55:42+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO, cs.SY, eess.SY | コメントを受け付けていません

Context-aware LLM-based Safe Control Against Latent Risks

投稿日: 2025年5月8日作成者: jarxiv

要約

自律制御システムは、潜在的なリスクの存在下で複雑なタスクを実行する際に大きな課題に直面しています。
これに対処するために、大規模な言語モデル（LLM）、数値的最適化、最適化ベースの制御を組み合わせた統合フレームワークを提案し、潜在リスクに対する安全性を確保しながら効率的なサブタスク学習を促進します。
このフレームワークは、複雑なタスクを分解して、潜在的なリスクを説明する一連のコンテキスト認識サブタスクに分類されます。
これらのサブタスクとそのパラメーターは、マルチタイムスケールのプロセス、つまり高層マルチターン内学習、中層LLMチェーンの推論と数値的最適化、および低層モデルの予測制御を通じて洗練されます。
このフレームワークは、低層最適化プロセスと物理シミュレーターから定性的フィードバックと最適化された軌道データを活用することにより、意思決定を改善します。
ロボットアームと自律車両シナリオを含むシミュレートされたケーススタディを通じて、提案されたフレームワークを検証します。
実験は、提案されたフレームワークがコンテキストと潜在的なリスクに基づいてアクションを媒介し、複雑な行動を効率的に学ぶことができることを示しています。

要約(オリジナル)

Autonomous control systems face significant challenges in performing complex tasks in the presence of latent risks. To address this, we propose an integrated framework that combines Large Language Models (LLMs), numerical optimization, and optimization-based control to facilitate efficient subtask learning while ensuring safety against latent risks. The framework decomposes complex tasks into a sequence of context-aware subtasks that account for latent risks. These subtasks and their parameters are then refined through a multi-time-scale process: high-layer multi-turn in-context learning, mid-layer LLM Chain-of-Thought reasoning and numerical optimization, and low-layer model predictive control. The framework iteratively improves decisions by leveraging qualitative feedback and optimized trajectory data from lower-layer optimization processes and a physics simulator. We validate the proposed framework through simulated case studies involving robot arm and autonomous vehicle scenarios. The experiments demonstrate that the proposed framework can mediate actions based on the context and latent risks and learn complex behaviors efficiently.

arxiv情報

著者	Xiyu Deng,Quan Khanh Luu,Anh Van Ho,Yorie Nakahira
発行日	2025-05-06 23:56:25+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO, cs.SY, eess.SY | コメントを受け付けていません

LiRA: Light-Robust Adversary for Model-based Reinforcement Learning in Real World

投稿日: 2025年5月8日作成者: jarxiv

要約

モデルベースの強化学習は、サンプルの効率が高いために多くの注目を集めており、実際のロボットアプリケーションに適用されると予想されています。
現実の世界では、観察不可能な障害が予期しない状況につながる可能性があるため、コントロールパフォーマンスだけでなく堅牢性も改善するためにロボットポリシーを取る必要があります。
敵対学習は堅牢性を改善する効果的な方法ですが、過度の敵は誤動作のリスクを高め、制御パフォーマンスを保守的にしすぎます。
したがって、この研究では、強化学習を適度に堅牢にし、保守的ではないようにするための新しい敵対的な学習フレームワークに対処しています。
この目的のために、敵対的な学習は最初に変分推論で再編成されます。
さらに、許容可能なパフォーマンス劣化内で堅牢性を最大化できる\ textit {light robustness}は、制約として利用されます。
その結果、提案されたフレームワークであるいわゆるLIRAは、敵対的レベルを自動的に調整し、堅牢性と保守性のバランスをとることができます。
LIRAの予想される動作は、数値シミュレーションで確認されています。
さらに、Liraは、2時間未満で収集された実際のデータでのみ、四重類のロボットの力反応性歩行制御を学ぶことに成功しています。

要約(オリジナル)

Model-based reinforcement learning has attracted much attention due to its high sample efficiency and is expected to be applied to real-world robotic applications. In the real world, as unobservable disturbances can lead to unexpected situations, robot policies should be taken to improve not only control performance but also robustness. Adversarial learning is an effective way to improve robustness, but excessive adversary would increase the risk of malfunction, and make the control performance too conservative. Therefore, this study addresses a new adversarial learning framework to make reinforcement learning robust moderately and not conservative too much. To this end, the adversarial learning is first rederived with variational inference. In addition, \textit{light robustness}, which allows for maximizing robustness within an acceptable performance degradation, is utilized as a constraint. As a result, the proposed framework, so-called LiRA, can automatically adjust adversary level, balancing robustness and conservativeness. The expected behaviors of LiRA are confirmed in numerical simulations. In addition, LiRA succeeds in learning a force-reactive gait control of a quadrupedal robot only with real-world data collected less than two hours.

arxiv情報

著者	Taisuke Kobayashi
発行日	2025-05-06 23:59:24+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO | コメントを受け付けていません

Scalable Aerial GNSS Localization for Marine Robots

投稿日: 2025年5月8日作成者: jarxiv

要約

水ロボット工学にとって正確なローカリゼーションは重要ですが、水面上の信号反射と水生GNSSレシーバーのコストが高いため、従来のオンボードグローバルナビゲーション衛星システム（GNSS）アプローチは困難または効果がありません。
慣性ナビゲーション、ドップラー速度ロガー（DVL）、スラム、アコースティックベースの方法などの既存のアプローチは、エラーの蓄積や高い計算の複雑さなどの課題に直面しています。
したがって、より効率的でスケーラブルなソリューションが必要です。
このペーパーでは、水面の近くにある後に海洋ロボットを追跡およびローカライズするためにGNSSのローカリゼーションを装備した空中ドローンを活用する代替アプローチを提案します。
私たちの結果は、この新しい適応により、正確な単一およびマルチロボットマリンロボットのローカリゼーションが可能になることを示しています。

要約(オリジナル)

Accurate localization is crucial for water robotics, yet traditional onboard Global Navigation Satellite System (GNSS) approaches are difficult or ineffective due to signal reflection on the water’s surface and its high cost of aquatic GNSS receivers. Existing approaches, such as inertial navigation, Doppler Velocity Loggers (DVL), SLAM, and acoustic-based methods, face challenges like error accumulation and high computational complexity. Therefore, a more efficient and scalable solution remains necessary. This paper proposes an alternative approach that leverages an aerial drone equipped with GNSS localization to track and localize a marine robot once it is near the surface of the water. Our results show that this novel adaptation enables accurate single and multi-robot marine robot localization.

arxiv情報

著者	Shuo Wen,Edwin Meriaux,Mariana Sosa Guzmán,Charlotte Morissette,Chloe Si,Bobak Baghi,Gregory Dudek
発行日	2025-05-07 03:18:59+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

PNE-SGAN: Probabilistic NDT-Enhanced Semantic Graph Attention Network for LiDAR Loop Closure Detection

投稿日: 2025年5月8日作成者: jarxiv

要約

LIDARループ閉鎖検出（LCD）は、一貫した同時ローカリゼーションとマッピング（SLAM）に不可欠ですが、堅牢性と精度の課題に直面しています。
セマンティックグラフアプローチを含む既存の方法は、多くの場合、粗い幾何学的表現に悩まされ、ノイズ、ダイナミクス、視点の変化に対する時間的堅牢性がありません。
これらの制限を克服するために、確率的NDT強化セマンティックグラフ注意ネットワークであるPne-sganを紹介します。
Pne-sganは、グラフ注意ネットワーク（GAT）を介して処理された、豊富で識別幾何学的ノード機能として、通常の分布変換（NDT）共分散行列を使用してセマンティックグラフを強化します。
重要なことに、グラフの類似性スコアを確率論的な時間フィルタリングフレームワーク（HMM/ベイズフィルターとしてモデル化）に統合し、モーションモデリングに不確実な匂い測定を組み込み、前袋のスムージングを利用してあいまいさを効果的に処理します。
挑戦的なキティシーケンス（00および08）の評価は、それぞれ96.2 \％と95.1 \％の平均精度を達成し、それぞれ最先端のパフォーマンスを実証しています。
PNE-sganは、特に他の人がゆるむ困難な双方向ループシナリオで、既存の方法を大幅に上回ります。
原則的な確率的時間的推論で詳細なNDTジオメトリを相乗的にすることにより、PNE-SGANはLIDAR LCDに非常に正確で堅牢なソリューションを提供し、複雑で大規模な環境でのSLAMの信頼性を高めます。

要約(オリジナル)

LiDAR loop closure detection (LCD) is crucial for consistent Simultaneous Localization and Mapping (SLAM) but faces challenges in robustness and accuracy. Existing methods, including semantic graph approaches, often suffer from coarse geometric representations and lack temporal robustness against noise, dynamics, and viewpoint changes. We introduce PNE-SGAN, a Probabilistic NDT-Enhanced Semantic Graph Attention Network, to overcome these limitations. PNE-SGAN enhances semantic graphs by using Normal Distributions Transform (NDT) covariance matrices as rich, discriminative geometric node features, processed via a Graph Attention Network (GAT). Crucially, it integrates graph similarity scores into a probabilistic temporal filtering framework (modeled as an HMM/Bayes filter), incorporating uncertain odometry for motion modeling and utilizing forward-backward smoothing to effectively handle ambiguities. Evaluations on challenging KITTI sequences (00 and 08) demonstrate state-of-the-art performance, achieving Average Precision of 96.2\% and 95.1\%, respectively. PNE-SGAN significantly outperforms existing methods, particularly in difficult bidirectional loop scenarios where others falter. By synergizing detailed NDT geometry with principled probabilistic temporal reasoning, PNE-SGAN offers a highly accurate and robust solution for LiDAR LCD, enhancing SLAM reliability in complex, large-scale environments.

arxiv情報

著者	Xiong Li,Shulei Liu,Xingning Chen,Yisong Wu,Dong Zhu
発行日	2025-05-07 03:30:04+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Opening Articulated Structures in the Real World

投稿日: 2025年5月8日作成者: jarxiv

要約

以前に見えなかった環境で、以前に見えなかったオブジェクトで有能に動作できるモバイル操作システムを構築するには何が必要ですか？
この作業は、モバイル操作テストベッドとして明確な構造を開くことを使用して、この質問に答えます。
具体的には、私たちの焦点は、特権情報なしでこのタスクのエンドツーエンドのパフォーマンスにあります。つまり、ロボットは、新しいターゲットの明確なオブジェクトが表示されている場所から始まり、オブジェクトにアプローチして正常に開く必要があります。
最初にこのタスクのシステムを開発し、次に13の実際のテストサイトで100以上のエンドツーエンドシステムテストを実施します。
私たちの大規模な研究は、多くの驚くべき発見を明らかにしています：a）モジュラーシステムは、エンドツーエンドの学習システムが1000+以上のデモンストレーションでトレーニングされている場合でも、このタスクのエンドツーエンドの学習システムを上回ります。b）知覚、および正確なエンドエフェクターコントロールではなく、タスクの成功への主要なボトルネックであり、c）最先端の芸術的パラメーターの推定モデルが現場で描かれている場合に、最先端のアーチキュレーションパラメーターの推定モデルが開発されました。
全体として、我々の調査結果は、パイプラインの成分の開発の限界を単独で強調し、システムレベルの研究の必要性を強調し、一般化可能なモバイル操作システムを構築するための実用的なロードマップを提供します。
ビデオ、コード、モデルはプロジェクトWebサイト：https：//arjung128.github.io/opening-articulaturet-crutures/で入手できます。

要約(オリジナル)

What does it take to build mobile manipulation systems that can competently operate on previously unseen objects in previously unseen environments? This work answers this question using opening of articulated structures as a mobile manipulation testbed. Specifically, our focus is on the end-to-end performance on this task without any privileged information, i.e. the robot starts at a location with the novel target articulated object in view, and has to approach the object and successfully open it. We first develop a system for this task, and then conduct 100+ end-to-end system tests across 13 real world test sites. Our large-scale study reveals a number of surprising findings: a) modular systems outperform end-to-end learned systems for this task, even when the end-to-end learned systems are trained on 1000+ demonstrations, b) perception, and not precise end-effector control, is the primary bottleneck to task success, and c) state-of-the-art articulation parameter estimation models developed in isolation struggle when faced with robot-centric viewpoints. Overall, our findings highlight the limitations of developing components of the pipeline in isolation and underscore the need for system-level research, providing a pragmatic roadmap for building generalizable mobile manipulation systems. Videos, code, and models are available on the project website: https://arjung128.github.io/opening-articulated-structures/

arxiv情報

著者	Arjun Gupta,Michelle Zhang,Rishik Sathua,Saurabh Gupta
発行日	2025-05-07 03:38:59+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.AI, cs.CV, cs.LG, cs.RO | コメントを受け付けていません

Bimanual Regrasp Planning and Control for Active Reduction of Object Pose Uncertainty

投稿日: 2025年5月8日作成者: jarxiv

要約

オブジェクトを正確に把握することは、不確実性をもたらすため、挑戦的な作業です。
従来の方法では、カメラと備品を使用してオブジェクトの不確実性を軽減しています。
それらは効果的ですが、オブジェクトジオメトリに基づいたジグの設計や、レーザーを使用して製造された高精度ツールを備えたカメラを較正するなど、集中的な準備が必要です。
この研究では、フィクスチャやカメラを使用せずに、把持したオブジェクトの位置と方向の不確実性を減らす方法を提案します。
私たちの方法は、平行グリッパーの平らな指パッドが、平らな表面接触を介して開口/閉じ方向に沿って不確実性を減らすことができるという概念に基づいています。
平らな指のパッドを備えた平行グリッパーによる3つの直交goの握りは、オブジェクトの位置と方向を一意の状態にまとめて制限します。
概念に導かれて、オブジェクトのポーズの不確実性を積極的に減らすために、2つのロボットアームの3つの直交握りを順次見つけてレバレッジする再aspの計画とアドミタンス制御アプローチを開発します。
さまざまな初期オブジェクトの不確実性に関する提案された方法を評価し、それが良好な再現性があることを確認しました。
実験試験の偏差レベルは、光学追跡システムのレベルと同じ桁であり、強力な相対推論のパフォーマンスを示しています。

要約(オリジナル)

Precisely grasping an object is a challenging task due to pose uncertainties. Conventional methods have used cameras and fixtures to reduce object uncertainty. They are effective but require intensive preparation, such as designing jigs based on the object geometry and calibrating cameras with high-precision tools fabricated using lasers. In this study, we propose a method to reduce the uncertainty of the position and orientation of a grasped object without using a fixture or a camera. Our method is based on the concept that the flat finger pads of a parallel gripper can reduce uncertainty along its opening/closing direction through flat surface contact. Three orthogonal grasps by parallel grippers with flat finger pads collectively constrain an object’s position and orientation to a unique state. Guided by the concepts, we develop a regrasp planning and admittance control approach that sequentially finds and leverages three orthogonal grasps of two robotic arms to actively reduce uncertainties in the object pose. We evaluated the proposed method on different initial object uncertainties and verified that it had good repeatability. The deviation levels of the experimental trials were on the same order of magnitude as those of an optical tracking system, demonstrating strong relative inference performance.

arxiv情報

著者	Ryuta Nagahama,Weiwei Wan,Zhengtao Hu,Kensuke Harada
発行日	2025-05-07 04:53:51+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO | コメントを受け付けていません

$\texttt{SPIN}$: distilling $\texttt{Skill-RRT}$ for long-horizon prehensile and non-prehensile manipulation

投稿日: 2025年5月8日作成者: jarxiv

要約

現在のロボットは、プレヘンシルおよび非摂食スキルのシーケンス、接触豊富な相互作用、および長期的な推論を必要とする長老操作タスクと格闘しています。
$ \ texttt {spin} $（$ \ textbf {s} $ kill $ \ textbf {p} $ lanning to $ \ textbf {in} $ ference）を提示します。
スキルの適用可能性チェックとそのような長期の問題を解決するための中間オブジェクトのポーズサンプリングを組み込んだRRTの拡張である$ \ texttt {skill-rrt} $を提案します。
独立して訓練されたスキルをチェーンするために、移行中のオブジェクトの妨害を最小限に抑えるためにトレーニングされた目標調整されたポリシー$ \ textit {connectors} $を紹介します。
高品質のデモンストレーションは、$ \ texttt {Skill-RRT} $で生成され、オンラインの計算時間を短縮するためにノイズベースのリプレイで蒸留されます。
完全にシミュレーションで訓練された結果のポリシーは、現実世界にゼロショットを転送し、3つの挑戦的な長期操作タスクで80％以上の成功を達成し、最先端の階層RLと計画方法を上回ります。

要約(オリジナル)

Current robots struggle with long-horizon manipulation tasks requiring sequences of prehensile and non-prehensile skills, contact-rich interactions, and long-term reasoning. We present $\texttt{SPIN}$ ($\textbf{S}$kill $\textbf{P}$lanning to $\textbf{IN}$ference), a framework that distills a computationally intensive planning algorithm into a policy via imitation learning. We propose $\texttt{Skill-RRT}$, an extension of RRT that incorporates skill applicability checks and intermediate object pose sampling for solving such long-horizon problems. To chain independently trained skills, we introduce $\textit{connectors}$, goal-conditioned policies trained to minimize object disturbance during transitions. High-quality demonstrations are generated with $\texttt{Skill-RRT}$ and distilled through noise-based replay in order to reduce online computation time. The resulting policy, trained entirely in simulation, transfers zero-shot to the real world and achieves over 80% success across three challenging long-horizon manipulation tasks and outperforms state-of-the-art hierarchical RL and planning methods.

arxiv情報

著者	Haewon Jung,Donguk Lee,Haecheol Park,JunHyeop Kim,Beomjoon Kim
発行日	2025-05-07 05:34:19+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO | コメントを受け付けていません

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント