jarxiv | Japanese arxiv | ページ 689

Path Planning and Optimization for Cuspidal 6R Manipulators

投稿日: 2025年5月1日作成者: jarxiv

要約

帯状脈ロボットは、ある逆運動学（IK）ソリューションから、特異点を交差させることなく別の逆に移動できます。
複数の産業用ロボットは帯状疱疹です。
彼らは美しい機械的デザインを持っている傾向がありますが、パス計画の課題を引き起こします。
タスクスペースパスには、パスに沿った各ポイントに対して有効なIKソリューションがある場合がありますが、連続的なジョイントスペースパスはIKソリューションの選択に依存するか、実行不可能である場合があります。
このホワイトペーパーでは、新しい分析、経路計画、および最適化方法を紹介して、帯状回ロボットのユーティリティを強化します。
最初に、帯状回ロボットを特定する効率的な方法を実証し、3つの平行関節軸を持つABB GOFAと特定のロボットが幹部であることを初めて示します。
次に、タスク空間パスに沿って各ポイントのすべてのIKソリューションを見つけ、IKソリューションに対応する各頂点を接続するグラフを構築することにより、帯状筋ロボットの新しいパス計画方法を提案します。
グラフエッジには、関節速度の最小化など、最適化メトリックに基づいた重みがあります。
最適な実行可能パスは、グラフの最短パスです。
この方法では、単一性を通過する滑らかな経路と同様に、非弦楽路を見つけることができます。
最後に、このパス計画方法をパス最適化アルゴリズムに組み込みます。
固定されたワークスペースツールパスを考えると、ロボットベースフレームのツールパスのオフセットを最適化しながら、連続的な関節運動を保証します。
コードの例は、公開可能なリポジトリで入手できます。

要約(オリジナル)

A cuspidal robot can move from one inverse kinematics (IK) solution to another without crossing a singularity. Multiple industrial robots are cuspidal. They tend to have a beautiful mechanical design, but they pose path planning challenges. A task-space path may have a valid IK solution for each point along the path, but a continuous joint-space path may depend on the choice of the IK solution or even be infeasible. This paper presents new analysis, path planning, and optimization methods to enhance the utility of cuspidal robots. We first demonstrate an efficient method to identify cuspidal robots and show, for the first time, that the ABB GoFa and certain robots with three parallel joint axes are cuspidal. We then propose a new path planning method for cuspidal robots by finding all IK solutions for each point along a task-space path and constructing a graph to connect each vertex corresponding to an IK solution. Graph edges have a weight based on the optimization metric, such as minimizing joint velocity. The optimal feasible path is the shortest path in the graph. This method can find non-singular paths as well as smooth paths which pass through singularities. Finally, we incorporate this path planning method into a path optimization algorithm. Given a fixed workspace toolpath, we optimize the offset of the toolpath in the robot base frame while ensuring continuous joint motion. Code examples are available in a publicly accessible repository.

arxiv情報

著者	Alexander J. Elias,John T. Wen
発行日	2025-04-29 19:59:55+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO | コメントを受け付けていません

Safety with Agency: Human-Centered Safety Filter with Application to AI-Assisted Motorsports

投稿日: 2025年5月1日作成者: jarxiv

要約

人間の機関を損なうことなくシステムの安全性を大幅に向上させる共有自律性のために、人間中心の安全フィルター（HCSF）を提案します。
当社のHCSFは、神経安全価値関数に基づいて構築されています。これは、最初にブラックボックスの相互作用を通じて拡張的に学習し、次に展開時に使用して、新しい状態アクション制御バリア機能（Q-CBF）の安全制約を実施します。
このQ-CBF安全フィルターは、合成とランタイムの安全性の監視と介入の両方について、システムのダイナミクスに関する知識を必要としないため、私たちの方法は複雑なブラックボックス共有の自律システムに容易に適用されます。
特に、HCSFのCBFベースの介入は、多くの従来の安全フィルターによって提供される突然の土壇場の修正を回避し、人間の行動を最小限かつスムーズに変更します。
Assetto Corsa-A Highfichity Car Racing Simulatorを使用して、Assetto Corsa-A High-Fidelity Car Racing Simulatorを使用して、「エッジ上での運転」シナリオでの堅牢性を評価する包括的な対面ユーザー調査におけるアプローチを検証します。
軌道データと、HCSF支援に対するドライバーの認識の両方を、非支援運転と従来の安全フィルターと比較します。
実験結果は、1）支援がないのと比較して、HCSFは人間の代理店や快適さを損なうことなく安全性とユーザーの満足度の両方を改善し、2）従来の安全フィルターと比較して、提案されているHCSFは、堅牢性を維持しながら人間の代理店、快適性、満足度を高めます。

要約(オリジナル)

We propose a human-centered safety filter (HCSF) for shared autonomy that significantly enhances system safety without compromising human agency. Our HCSF is built on a neural safety value function, which we first learn scalably through black-box interactions and then use at deployment to enforce a novel state-action control barrier function (Q-CBF) safety constraint. Since this Q-CBF safety filter does not require any knowledge of the system dynamics for both synthesis and runtime safety monitoring and intervention, our method applies readily to complex, black-box shared autonomy systems. Notably, our HCSF’s CBF-based interventions modify the human’s actions minimally and smoothly, avoiding the abrupt, last-moment corrections delivered by many conventional safety filters. We validate our approach in a comprehensive in-person user study using Assetto Corsa-a high-fidelity car racing simulator with black-box dynamics-to assess robustness in ‘driving on the edge’ scenarios. We compare both trajectory data and drivers’ perceptions of our HCSF assistance against unassisted driving and a conventional safety filter. Experimental results show that 1) compared to having no assistance, our HCSF improves both safety and user satisfaction without compromising human agency or comfort, and 2) relative to a conventional safety filter, our proposed HCSF boosts human agency, comfort, and satisfaction while maintaining robustness.

arxiv情報

著者	Donggeon David Oh,Justin Lidard,Haimin Hu,Himani Sinhmar,Elle Lazarski,Deepak Gopinath,Emily S. Sumner,Jonathan A. DeCastro,Guy Rosman,Naomi Ehrich Leonard,Jaime Fernández Fisac
発行日	2025-04-29 20:04:10+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO, cs.SY, eess.SY | コメントを受け付けていません

Composite Safety Potential Field for Highway Driving Risk Assessment

投稿日: 2025年5月1日作成者: jarxiv

要約

車両安全技術の急速な進歩の時代では、リスク評価の促進が注目の焦点になりました。
衝突警告システム、高度なドライバーアシスタンスシステム（ADA）、自律運転などのテクノロジーは、積極的かつリアルタイムで評価するための運転リスクを駆動する必要があります。
効果的であるために、リスク評価の促進指標は、潜在的な衝突を正確に特定するだけでなく、車両間の安全でシームレスな相互作用を可能にするために人間のような推論を示す必要があります。
既存の安全潜在フィールドモデルは、客観的な安全性と主観的な安全性の両方を考慮することにより、駆動リスクを評価します。
ただし、実際のリスク評価タスクにおけるそれらの実際的な適用性は限られています。
これらのモデルは、それらの構造の任意の性質のためにキャリブレーションをするのがしばしば困難であり、事故統計が不足しているため、キャリブレーションは非効率的です。
さらに、彼らは縦方向と横方向のリスクの両方で一般化するのに苦労しています。
これらの課題に対処するために、Composite Safetyの可能性フィールドフレームワーク、すなわちC-SPFを提案します。これは、空間的近接性に関するドライバーのリスク認識をキャプチャする主観的なフィールドと、推移的な衝突確率を定量化するための客観的なフィールドを把握し、運転リスクを包括的に評価することを提案します。
C-SPFは、軌道データセットからの豊富な2次元間隔データを使用して較正されており、ドライバーの近接リスク認識を効果的にキャプチャし、運転行動のより現実的な説明を提供できるようにします。
自然主義的な駆動データセットの分析は、C-SPFがドライバーの安全性操作を引き起こす縦方向と横方向のリスクの両方をキャプチャできることを示しています。
さらなるケーススタディは、既存のモデルが達成できない能力である隣接する車両に対する車線の変更を放棄したり、横方向の位置を調整したりするなど、横方向のドライバーの行動を説明するC-SPFの能力を強調しています。

要約(オリジナル)

In the era of rapid advancements in vehicle safety technologies, driving risk assessment has become a focal point of attention. Technologies such as collision warning systems, advanced driver assistance systems (ADAS), and autonomous driving require driving risks to be evaluated proactively and in real time. To be effective, driving risk assessment metrics must not only accurately identify potential collisions but also exhibit human-like reasoning to enable safe and seamless interactions between vehicles. Existing safety potential field models assess driving risks by considering both objective and subjective safety factors. However, their practical applicability in real-world risk assessment tasks is limited. These models are often challenging to calibrate due to the arbitrary nature of their structures, and calibration can be inefficient because of the scarcity of accident statistics. Additionally, they struggle to generalize across both longitudinal and lateral risks. To address these challenges, we propose a composite safety potential field framework, namely C-SPF, involving a subjective field to capture drivers’ risk perception about spatial proximity and an objective field to quantify the imminent collision probability, to comprehensively evaluate driving risks. The C-SPF is calibrated using abundant two-dimensional spacing data from trajectory datasets, enabling it to effectively capture drivers’ proximity risk perception and provide a more realistic explanation of driving behaviors. Analysis of a naturalistic driving dataset demonstrates that the C-SPF can capture both longitudinal and lateral risks that trigger drivers’ safety maneuvers. Further case studies highlight the C-SPF’s ability to explain lateral driver behaviors, such as abandoning lane changes or adjusting lateral position relative to adjacent vehicles, which are capabilities that existing models fail to achieve.

arxiv情報

著者	Dachuan Zuo,Zilin Bian,Fan Zuo,Kaan Ozbay
発行日	2025-04-29 20:19:47+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO, stat.AP | コメントを受け付けていません

Task and Joint Space Dual-Arm Compliant Control

投稿日: 2025年5月1日作成者: jarxiv

要約

人間と相互作用したり、繊細な操作タスクを実行したりするロボットは、コンプライアンスを示す必要があります。
ただし、ほとんどの市販のマニピュレーターは硬直しており、重大な摩擦に苦しんでおり、トルク制御モードのエンドエフェクター追跡精度を制限しています。
これに対処するために、ジョイントスペースとタスクスペースのコンプライアンスをスムーズに補間するリアルタイムのオープンソースインピーダンスコントローラーを紹介します。
このハイブリッドアプローチにより、安全な相互作用とサブセンチメートルピン挿入などの正確なタスク実行が保証されます。
2つのKinova Gen3アームを備えたデュアルアームプラットフォームであるFrankにコントローラーを展開し、モデルのないオブザーバーを使用してモデル化された摩擦ダイナミクスを補正します。
このシステムはリアルタイムの有能であり、MoveIT！などの標準的なROSツールと統合されています。
また、高周波軌道ストリーミングをサポートし、学習ベースの方法、最適な制御、または遠隔操作によって生成される軌跡の閉ループ実行を可能にします。
私たちの結果は、高摩擦条件下でさえ、堅牢な追跡と準拠の行動を示しています。
完全なシステムは、https：//github.com/applied-ai-lab/compliant_controllersでオープンソースを利用できます。

要約(オリジナル)

Robots that interact with humans or perform delicate manipulation tasks must exhibit compliance. However, most commercial manipulators are rigid and suffer from significant friction, limiting end-effector tracking accuracy in torque-controlled modes. To address this, we present a real-time, open-source impedance controller that smoothly interpolates between joint-space and task-space compliance. This hybrid approach ensures safe interaction and precise task execution, such as sub-centimetre pin insertions. We deploy our controller on Frank, a dual-arm platform with two Kinova Gen3 arms, and compensate for modelled friction dynamics using a model-free observer. The system is real-time capable and integrates with standard ROS tools like MoveIt!. It also supports high-frequency trajectory streaming, enabling closed-loop execution of trajectories generated by learning-based methods, optimal control, or teleoperation. Our results demonstrate robust tracking and compliant behaviour even under high-friction conditions. The complete system is available open-source at https://github.com/applied-ai-lab/compliant_controllers.

arxiv情報

著者	Alexander L. Mitchell,Tobit Flatscher,Ingmar Posner
発行日	2025-04-29 20:20:41+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO | コメントを受け付けていません

A Koopman Operator-based NMPC Framework for Mobile Robot Navigation under Uncertainty

投稿日: 2025年5月1日作成者: jarxiv

要約

モバイルロボットナビゲーションは、システムの不確実性によって挑戦することができます。
たとえば、地上摩擦は突然変化して滑りを引き起こす可能性があり、ノイズの多いセンサーデータは、フィードバック制御の不正確さにつながる可能性があります。
従来のモデルベースの方法は、そのようなバリエーションを考慮すると制限される場合があり、さまざまな種類の不確実性から脆弱になります。
これに対処する1つの方法は、Koopmanオペレーターを使用して学習した予測モデルを非線形モデル予測制御（NMPC）に活用することです。
このホワイトペーパーでは、確率的摂動を伴うアフィン入力システムを正確に予測できる揚げた双線形モデルを使用したNMPC問題の定式化と解決策を説明します。
システムの制約はKoopmanスペースで定義されていますが、最適化の問題は状態空間で解決され、計算の複雑さを減らします。
システムのKoopmanオペレーターを推定するためのトレーニングデータは、ランダム化制御入力を介して与えられます。
開発されたメソッドの出力により、障害物が入力された環境を閉めるナビゲーション制御が可能になります。
提案された方法の有効性は、添加剤の確率的速度摂動を備えた車輪付きロボット、現実的なデジタルツインロボットによるガゼボシミュレーション、および真のダイナミクスの知識なしに物理的なハードウェア実験を使用して数値シミュレーションを通じてテストされています。

要約(オリジナル)

Mobile robot navigation can be challenged by system uncertainty. For example, ground friction may vary abruptly causing slipping, and noisy sensor data can lead to inaccurate feedback control. Traditional model-based methods may be limited when considering such variations, making them fragile to varying types of uncertainty. One way to address this is by leveraging learned prediction models by means of the Koopman operator into nonlinear model predictive control (NMPC). This paper describes the formulation of, and provides the solution to, an NMPC problem using a lifted bilinear model that can accurately predict affine input systems with stochastic perturbations. System constraints are defined in the Koopman space, while the optimization problem is solved in the state space to reduce computational complexity. Training data to estimate the Koopman operator for the system are given via randomized control inputs. The output of the developed method enables closed-loop navigation control over environments populated with obstacles. The effectiveness of the proposed method has been tested through numerical simulations using a wheeled robot with additive stochastic velocity perturbations, Gazebo simulations with a realistic digital twin robot, and physical hardware experiments without knowledge of the true dynamics.

arxiv情報

著者	Xiaobin Zhang,Mohamed Karim Bouafoura,Lu Shi,Konstantinos Karydis
発行日	2025-04-29 22:49:48+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO, cs.SY, eess.SY | コメントを受け付けていません

PhysicsFC: Learning User-Controlled Skills for a Physics-Based Football Player Controller

投稿日: 2025年5月1日作成者: jarxiv

要約

物理的にシミュレートされたフットボール選手のキャラクターを制御して、さまざまなサッカースキルを実行する方法であるPhysicsFCを提案します。これは、これらのスキルをシームレスに移行しながら、ドリブル、トラップ、移動、キックベースのユーザー入力などです。
サッカースキルごとに潜在変数を生成するスキル固有のポリシーは、サッカーの動きを再現するための基礎として機能する既存の物理ベースのモーションエンミングモデルを使用して訓練されています。
主な機能には、ドリブルポリシーの調整された報酬設計、トラップポリシーの発射体ダイナミクスベースの初期化と組み合わせた2フェーズ報酬構造、および移動ポリシーのデータ包埋された目標条件付き潜在ガイダンス（DEGCL）メソッドが含まれます。
訓練されたスキルポリシーを使用して、提案されたフットボールプレーヤーの有限ステートマシン（PhysicsFC FSM）により、ユーザーはキャラクターをインタラクティブに制御できます。
FSMで定義されているスキルポリシー間のスムーズでアジャイルな移行を確保するために、各スキルポリシーのトレーニング中に適用されるスキルトランジションベースの初期化（STI）を導入します。
競争の激しいトラップとドリブル、ギブアンドゴープレイ、および複数の物理FCエージェントが自然で制御可能な物理ベースのフットボールプレーヤーの行動を生み出す11V11フットボールゲームを含む、物理FCの有効性を紹介するいくつかのインタラクティブなシナリオを開発します。
定量的評価は、提示されたメトリックと実験設計を使用して、個々のスキルポリシーのパフォーマンスとそれらの間の遷移をさらに検証します。

要約(オリジナル)

We propose PhysicsFC, a method for controlling physically simulated football player characters to perform a variety of football skills–such as dribbling, trapping, moving, and kicking–based on user input, while seamlessly transitioning between these skills. Our skill-specific policies, which generate latent variables for each football skill, are trained using an existing physics-based motion embedding model that serves as a foundation for reproducing football motions. Key features include a tailored reward design for the Dribble policy, a two-phase reward structure combined with projectile dynamics-based initialization for the Trap policy, and a Data-Embedded Goal-Conditioned Latent Guidance (DEGCL) method for the Move policy. Using the trained skill policies, the proposed football player finite state machine (PhysicsFC FSM) allows users to interactively control the character. To ensure smooth and agile transitions between skill policies, as defined in the FSM, we introduce the Skill Transition-Based Initialization (STI), which is applied during the training of each skill policy. We develop several interactive scenarios to showcase PhysicsFC’s effectiveness, including competitive trapping and dribbling, give-and-go plays, and 11v11 football games, where multiple PhysicsFC agents produce natural and controllable physics-based football player behaviors. Quantitative evaluations further validate the performance of individual skill policies and the transitions between them, using the presented metrics and experimental designs.

arxiv情報

著者	Minsu Kim,Eunho Jung,Yoonsang Lee
発行日	2025-04-29 22:58:12+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.GR, cs.RO | コメントを受け付けていません

Estimation-Aware Trajectory Optimization with Set-Valued Measurement Uncertainties

投稿日: 2025年5月1日作成者: jarxiv

要約

このホワイトペーパーでは、推定認識軌道を生成するための最適化ベースのフレームワークを提示します。
このセットアップでは、測定（出力）不確実性は状態依存性であり、セット値です。
包む楕円体は、未知の分布を伴う状態依存の不確実性を特徴付けるために採用されています。
次に、セット値の出力マップの規則性の概念が導入され、推定認識軌道生成問題の定式化が促進されます。
具体的には、出力正規マップでは、有限の地平線状態の軌跡に関して凹のセット値の観測可能性測定値を利用できることが実証されています。
この尺度を最大化することにより、幅広いクラスのシステムに対して推定認識軌道を合成できます。
この作業では、軌道計画ルーチンも検討されています。これにより、観測可能性測定は、局所的に線形化されたダイナミクスを備えたシステムに最適化されています。
提案されたアプローチの有効性を説明するために、ビジョンベースの推定を伴う軌道計画のコンテキストにおける代表的な例が提示されています。
さらに、この論文では、エゴサテライトがオンボード機械学習（ML）ベースの推定モジュールを使用してランデブーの軌跡を実現する非協力的なターゲットレンダズブー問題の推定値計画を提示します。

要約(オリジナル)

In this paper, an optimization-based framework for generating estimation-aware trajectories is presented. In this setup, measurement (output) uncertainties are state-dependent and set-valued. Enveloping ellipsoids are employed to characterize state-dependent uncertainties with unknown distributions. The concept of regularity for set-valued output maps is then introduced, facilitating the formulation of the estimation-aware trajectory generation problem. Specifically, it is demonstrated that for output-regular maps, one can utilize a set-valued observability measure that is concave with respect to the finite horizon state trajectories. By maximizing this measure, estimation-aware trajectories can then be synthesized for a broad class of systems. Trajectory planning routines are also examined in this work, by which the observability measure is optimized for systems with locally linearized dynamics. To illustrate the effectiveness of the proposed approach, representative examples in the context of trajectory planning with vision-based estimation are presented. Moreover, the paper presents estimation-aware planning for an uncooperative Target-Rendezvous problem, where an Ego-satellite employs an onboard machine learning (ML)-based estimation module to realize the rendezvous trajectory.

arxiv情報

著者	Aditya Deole,Mehran Mesbahi
発行日	2025-04-30 00:48:20+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO, cs.SY, eess.SY, math.OC | コメントを受け付けていません

Improving Grip Stability Using Passive Compliant Microspine Arrays for Soft Robots in Unstructured Terrain

投稿日: 2025年5月1日作成者: jarxiv

要約

マイクロピングリッパーは、asper延しているためにせん断力と牽引力を高めることにより、表面相互作用を強化する昆虫の脚によく見られる小さな棘です。
このようなマイクロスピンの配列は、ロボットの手足または覆面に統合された場合、不均一な地形を操縦し、傾斜を横断し、さらには壁を登る能力を提供します。
ソフトロボットの適合性と適応性により、複雑で非構造化された地形の横断を含むこれらのアプリケーションの理想的な候補になります。
ただし、マイクロスピンの効果的な統合を通じてグリップの安定性を改善することにより、制御されたラボ環境からフィールドへの移行に関連するソフトロコモーターには、実生活の実現ギャップが残っています。
私たちは、モバイルソフトロボットの移動能力を強化するために、パッシブで準拠したマイクロスピン積み重ねられたアレイ設計を提案します。私たちの場合、運動腱が作動するものです。
効果的なソフトに準拠した剛性統合を備えた標準化されたマイクロスピンアレイ統合方法を提供し、それらを受動的に制御する単一のアクチュエーターから生じる複雑さを減らします。
提示されたデザインは、より頻繁にアクティブな一番下の列の有効性を妨げない一方で、非常に急な/不規則な表面に追加のグリップ機能を提供する2列の積み重ねられたマイクロピンアレイ構成を利用します。
マイクロスピンアレイのさまざまな構成を調査して、表面トポロジの変化を説明し、マイクロスピンあたりのアスペリティの独立した適応性のある把持を可能にします。
コンクリート、レンガ、コンパクトサンド、樹木の根を含むさまざまな粗い表面でフィールドテスト実験は、マイクロスピンアレイの異なる組み合わせを持つ2つのロボットと比較したマイクロスピンのないベースラインで構成される3つのロボットで構成される3つのロボットを含む樹木の根で行われます。
追跡結果は、ミクロピンアレイを含めると平均的な変位が平均して15倍および8倍増加することを示しています。

要約(オリジナル)

Microspine grippers are small spines commonly found on insect legs that reinforce surface interaction by engaging with asperities to increase shear force and traction. An array of such microspines, when integrated into the limbs or undercarriage of a robot, can provide the ability to maneuver uneven terrains, traverse inclines, and even climb walls. Conformability and adaptability of soft robots makes them ideal candidates for these applications involving traversal of complex, unstructured terrains. However, there remains a real-life realization gap for soft locomotors pertaining to their transition from controlled lab environment to the field by improving grip stability through effective integration of microspines. We propose a passive, compliant microspine stacked array design to enhance the locomotion capabilities of mobile soft robots, in our case, ones that are motor tendon actuated. We offer a standardized microspine array integration method with effective soft-compliant stiffness integration, and reduced complexity resulting from a single actuator passively controlling them. The presented design utilizes a two-row, stacked microspine array configuration that offers additional gripping capabilities on extremely steep/irregular surfaces from the top row while not hindering the effectiveness of the more frequently active bottom row. We explore different configurations of the microspine array to account for changing surface topologies and enable independent, adaptable gripping of asperities per microspine. Field test experiments are conducted on various rough surfaces including concrete, brick, compact sand, and tree roots with three robots consisting of a baseline without microspines compared against two robots with different combinations of microspine arrays. Tracking results indicate that the inclusion of microspine arrays increases planar displacement on average by 15 and 8 times.

arxiv情報

著者	Lauren Ervin,Harish Bezawada,Vishesh Vikas
発行日	2025-04-30 03:00:55+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO | コメントを受け付けていません

Fast Online Adaptive Neural MPC via Meta-Learning

投稿日: 2025年5月1日作成者: jarxiv

要約

データ駆動型モデル予測制御（MPC）は、モデルの不確実性の存在下でロボット制御性能を改善する重要な可能性を実証しています。
ただし、既存のアプローチでは、広範なオフラインデータ収集と計算集中トレーニングが必要であることが多く、オンラインで適応する能力が制限されます。
これらの課題に対処するために、このペーパーでは、モデルに依存しないメタラーニング（MAML）と統合されたニューラルネットワークを活用する高速オンライン適応MPCフレームワークを紹介します。
私たちのアプローチは、最小限のオンラインデータとグラデーションステップを使用して、公称システムの動作と真のシステム動作の矛盾を捉える残留ダイナミクスの少数のショット適応に焦点を当てています。
これらのメタ学習された残差モデルを計算効率の良いL4CasadiベースのMPCパイプラインに埋め込むことにより、提案された方法により、迅速なモデル補正が可能になり、予測精度が向上し、リアルタイムの制御パフォーマンスが向上します。
ファンデルポールオシレーター、カートポールシステム、および2D四角体に関するシミュレーション研究を通じてフレームワークを検証します。
結果は、新たに初期化されたニューラルネットワークで拡張された、名目MPCと公称MPCの両方にわたって適応速度と予測精度が大幅に向上し、リアルタイム適応ロボット制御のアプローチの有効性を強調しています。

要約(オリジナル)

Data-driven model predictive control (MPC) has demonstrated significant potential for improving robot control performance in the presence of model uncertainties. However, existing approaches often require extensive offline data collection and computationally intensive training, limiting their ability to adapt online. To address these challenges, this paper presents a fast online adaptive MPC framework that leverages neural networks integrated with Model-Agnostic Meta-Learning (MAML). Our approach focuses on few-shot adaptation of residual dynamics – capturing the discrepancy between nominal and true system behavior – using minimal online data and gradient steps. By embedding these meta-learned residual models into a computationally efficient L4CasADi-based MPC pipeline, the proposed method enables rapid model correction, enhances predictive accuracy, and improves real-time control performance. We validate the framework through simulation studies on a Van der Pol oscillator, a Cart-Pole system, and a 2D quadrotor. Results show significant gains in adaptation speed and prediction accuracy over both nominal MPC and nominal MPC augmented with a freshly initialized neural network, underscoring the effectiveness of our approach for real-time adaptive robot control.

arxiv情報

著者	Yu Mei,Xinyu Zhou,Shuyang Yu,Vaibhav Srivastava,Xiaobo Tan
発行日	2025-04-30 04:17:22+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO, cs.SY, eess.SY | コメントを受け付けていません

CMD: Constraining Multimodal Distribution for Domain Adaptation in Stereo Matching

投稿日: 2025年5月1日作成者: jarxiv

要約

最近、学習ベースのステレオマッチング方法により、パブリックベンチマークが大幅に改善されました。ソフトアラッグとスムーズなL1損失が成功に貢献しています。
ただし、監視されていないドメイン適応シナリオでは、これら2つの操作がターゲットドメインにマルチモーダル格差確率分布をもたらし、一般化が劣化することが多いことがわかります。
この論文では、この問題に対処するために、新しいアプローチ、マルチモーダル分布（CMD）を制約することを提案します。
具体的には、\ textit {不確実性の正規化}および\ textit {anisotropic soft argmin}を導入して、ネットワークがターゲットドメインで主に非モーダルな格差分布を生成し、それによって予測精度を向上させます。
実験的に、提案された方法を複数の代表的なステレオマッチングネットワークに適用し、合成データから非標識現実世界シーンにドメイン適応を実施します。
結果は、トップパフォーマンスとドメインに適応可能なステレオマッチングモデルの両方で、一般化の改善を一貫して示しています。
CMDのコードは、\ href {https://github.com/gallenszl/cmd} {https://github.com/gallenszl/cmd}で利用できます。

要約(オリジナル)

Recently, learning-based stereo matching methods have achieved great improvement in public benchmarks, where soft argmin and smooth L1 loss play a core contribution to their success. However, in unsupervised domain adaptation scenarios, we observe that these two operations often yield multimodal disparity probability distributions in target domains, resulting in degraded generalization. In this paper, we propose a novel approach, Constrain Multi-modal Distribution (CMD), to address this issue. Specifically, we introduce \textit{uncertainty-regularized minimization} and \textit{anisotropic soft argmin} to encourage the network to produce predominantly unimodal disparity distributions in the target domain, thereby improving prediction accuracy. Experimentally, we apply the proposed method to multiple representative stereo-matching networks and conduct domain adaptation from synthetic data to unlabeled real-world scenes. Results consistently demonstrate improved generalization in both top-performing and domain-adaptable stereo-matching models. The code for CMD will be available at: \href{https://github.com/gallenszl/CMD}{https://github.com/gallenszl/CMD}.

arxiv情報

著者	Zhelun Shen,Zhuo Li,Chenming Wu,Zhibo Rao,Lina Liu,Yuchao Dai,Liangjun Zhang
発行日	2025-04-30 04:23:48+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント