jarxiv | Japanese arxiv | ページ 1622

A novel step-by-step procedure for the kinematic calibration of robots using a single draw-wire encoder

投稿日: 2025年2月24日作成者: jarxiv

要約

ロボットの位置決めの精度は、高精度の製造タスクを実行する際の重要な工場です。
多くの場合、再現性に近い値までのマニピュレーターの精度を効果的に改善するために、キャリブレーションは重要な役割を果たします。
文献では、ロボットキャリブレーションへのさまざまなアプローチが提案されており、使用される測定システムと識別アルゴリズムのタイプにかなりの範囲があります。
私たちの目的は、ドローワイヤエンコーダーを介して取得した1D距離測定データのみを使用するパラメーターがその後1つずつ推定される新しいステップバイステップの運動学的キャリブレーション手順を開発することでした。
この目的を追求するために、測定された距離と予測された距離の間の矛盾がその未知のパラメーターにのみ依存するキャリブレーションポイントのセットを見つけるために、分析的アプローチを導き出しました。
これにより、識別プロセスの計算負担が軽減され、その精度が潜在的に改善されます。
シミュレーションと実験テストは、6度のフリードムロボットアームで実施されました。結果により、提案された戦略の妥当性が確認されました。
その結果、提案されている段階的なキャリブレーションアプローチは、実用的で費用対効果が高く、標準のキャリブレーションアプローチに代わる要求の少ない代替品を表し、ロボットキャリブレーションをよりアクセスしやすく、実行しやすくします。

要約(オリジナル)

Robot positioning accuracy is a key factory when performing high-precision manufacturing tasks. To effectively improve the accuracy of a manipulator, often up to a value close to its repeatability, calibration plays a crucial role. In the literature, various approaches to robot calibration have been proposed, and they range considerably in the type of measurement system and identification algorithm used. Our aim was to develop a novel step-by-step kinematic calibration procedure – where the parameters are subsequently estimated one at a time – that only uses 1D distance measurement data obtained through a draw-wire encoder. To pursue this objective, we derived an analytical approach to find, for each unknown parameter, a set of calibration points where the discrepancy between the measured and predicted distances only depends on that unknown parameter. This reduces the computational burden of the identification process while potentially improving its accuracy. Simulations and experimental tests were carried out on a 6 degrees-of-freedom robot arm: the results confirmed the validity of the proposed strategy. As a result, the proposed step-by-step calibration approach represents a practical, cost-effective and computationally less demanding alternative to standard calibration approaches, making robot calibration more accessible and easier to perform.

arxiv情報

著者	Giovanni Boschetti,Teresa Sinico
発行日	2025-02-20 19:20:31+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO | コメントを受け付けていません

Ultra-High-Frequency Harmony: mmWave Radar and Event Camera Orchestrate Accurate Drone Landing

投稿日: 2025年2月24日作成者: jarxiv

要約

正確で効率的で安全なドローン着陸のために、地上プラットフォームはリアルタイムで、下降ドローンを正確に見つけて、指定された場所に導く必要があります。
MMWave Sensingとカメラの組み合わせにより、ローカリゼーションの精度が向上しますが、MMWaveレーダーと比較して従来のフレームカメラのサンプリング周波数が低いと、システムスループットにボトルネックが作成されます。
この作業では、従来のフレームカメラをイベントカメラに置き換えます。イベントカメラは、グラウンドプラットフォームのセットアップ内のMMWaveレーダーとサンプリング周波数で調和する新しいセンサーであり、MME-LOCを導入します。
ドローン着陸。
これらのモダリティ間の\ textIT {時間的一貫性}および\ textIT {空間相補性}を完全に活用するために、2つの革新的なモジュール、\ textIT {一貫性にインストールされたコラボレーショントラッキング}と\ textIT {グラフインフォーム化された適応協力最適化}を提案します。
ドローン測定抽出と効率的なセンサー融合。
大手ドローン配信会社の着陸シナリオにおける広範な実世界の実験は、MME-LOCがローカリゼーションの精度とレイテンシの両方で最先端の方法を上回ることを示しています。

要約(オリジナル)

For precise, efficient, and safe drone landings, ground platforms should real-time, accurately locate descending drones and guide them to designated spots. While mmWave sensing combined with cameras improves localization accuracy, the lower sampling frequency of traditional frame cameras compared to mmWave radar creates bottlenecks in system throughput. In this work, we replace the traditional frame camera with event camera, a novel sensor that harmonizes in sampling frequency with mmWave radar within the ground platform setup, and introduce mmE-Loc, a high-precision, low-latency ground localization system designed for drone landings. To fully leverage the \textit{temporal consistency} and \textit{spatial complementarity} between these modalities, we propose two innovative modules, \textit{consistency-instructed collaborative tracking} and \textit{graph-informed adaptive joint optimization}, for accurate drone measurement extraction and efficient sensor fusion. Extensive real-world experiments in landing scenarios from a leading drone delivery company demonstrate that mmE-Loc outperforms state-of-the-art methods in both localization accuracy and latency.

arxiv情報

著者	Haoyang Wang,Jingao Xu,Xinyu Luo,Xuecheng Chen,Ting Zhang,Ruiyang Duan,Yunhao Liu,Xinlei Chen
発行日	2025-02-20 19:29:08+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

Geometric Freeze-Tag Problem

投稿日: 2025年2月24日作成者: jarxiv

要約

Arkinらによって導入されたフリーズタグ問題（FTP）を研究します。
（Soda’02）、目標は、単一のアクティブロボットから始まる$ n $ロボットのグループを目覚めさせることです。
私たちの焦点は、ロボットが$ \ mathbb {r}^d $に配置され、アクティブになると、ロボットが一定の速度で移動して他の人を目覚めさせることができる問題の幾何学版に焦点を当てています。
目的は、Makespanとしても知られる最後のロボットをアクティブにするのにかかる時間を最小限に抑えることです。
$ l_1 $および$ l_2 $の$ \ mathbb {r}^2 $および$ \ mathbb {r}^3 $の新しい上限を提示します。
$（\ mathbb {r}^2、l_2）$の場合、最大$ 5.4162R $のメイクスパンを達成し、Bonichon et al。
（disc’24）。
$（\ mathbb {r}^3、l_1）$で、$ 13R $の上限を確立します。
ここで、$ r $は、指定された標準の下で最初にアクティブなロボットからのロボットの最大距離を示します。
私たちの知る限り、これらはこれらの規範の下で$ \ mathbb {r}^3 $のMakepanの最初の既知の境界です。
また、ロボットが境界上に配置されている特定のインスタンスについて、$（\ mathbb {r}^3、l_2）$でftpを探索し、実際のシナリオに関するさらなる洞察を提供します。

要約(オリジナル)

We study the Freeze-Tag Problem (FTP), introduced by Arkin et al. (SODA’02), where the goal is to wake up a group of $n$ robots, starting from a single active robot. Our focus is on the geometric version of the problem, where robots are positioned in $\mathbb{R}^d$, and once activated, a robot can move at a constant speed to wake up others. The objective is to minimize the time it takes to activate the last robot, also known as the makespan. We present new upper bounds for the $l_1$ and $l_2$ norms in $\mathbb{R}^2$ and $\mathbb{R}^3$. For $(\mathbb{R}^2, l_2)$, we achieve a makespan of at most $5.4162r$, improving on the previous bound of $7.07r$ by Bonichon et al. (DISC’24). In $(\mathbb{R}^3, l_1)$, we establish an upper bound of $13r$, which leads to a bound of $22.52r$ for $(\mathbb{R}^3, l_2)$. Here, $r$ denotes the maximum distance of a robot from the initially active robot under the given norm. To the best of our knowledge, these are the first known bounds for the makespan in $\mathbb{R}^3$ under these norms. We also explore the FTP in $(\mathbb{R}^3, l_2)$ for specific instances where robots are positioned on a boundary, providing further insights into practical scenarios.

arxiv情報

著者	Sharareh Alipour,Kajal Baghestani,Mahdis Mirzaei,Soroush Sahraei
発行日	2025-02-20 19:46:37+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.DC, cs.RO | コメントを受け付けていません

A Dual-Motor Actuator for Ceiling Robots with High Force and High Speed Capabilities

投稿日: 2025年2月24日作成者: jarxiv

要約

患者移動装置により、病院や介護センターで患者を受動的に動かすことができます。
患者を巻き上げる代わりに、患者の動きを支援することが有益であり、自分で動くことができます。
ただし、患者の支援には、患者移転のみに使用されるものよりも大幅に高い速度で出力力を正確に制御できるデバイスが必要であり、単一のモーターソリューションはオーバーサイズであり、両方の機能を行う効率が悪いことを示します。
このホワイトペーパーでは、患者の移動装置に適応したデュアルモーターアクチュエータとコントロールスキームを紹介します。これは、患者の移転、患者の動きを支援し、転倒を防ぐために使用できる患者のモビリティ機器です。
このプロトタイプは、体重318 kgの患者を持ち上げて、7.8％の精度で最大100 kgの望ましい力で患者を支援できることが示されています。
また、転倒を管理するためのスマートコントロールスキームは、望ましい減速を適用することで落ちている患者を止めることができることが示されています。

要約(オリジナル)

Patient transfer devices allow to move patients passively in hospitals and care centers. Instead of hoisting the patient, it would be beneficial in some cases to assist their movement, enabling them to move by themselves. However, patient assistance requires devices capable of precisely controlling output forces at significantly higher speeds than those used for patient transfers alone, and a single motor solution would be over-sized and show poor efficiency to do both functions. This paper presents a dual-motor actuator and control schemes adapted for a patient mobility equipment that can be used to transfer patients, assist patient in their movement, and help prevent falls. The prototype is shown to be able to lift patients weighing up to 318 kg, to assist a patient with a desired force of up to 100 kg with a precision of 7.8%. Also, a smart control scheme to manage falls is shown to be able to stop a patient who is falling by applying a desired deceleration.

arxiv情報

著者	Ian Lalonde,Jeff Denis,Mathieu Lamy,Alexandre Girard
発行日	2025-02-20 19:50:47+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO, cs.SY, eess.SY | コメントを受け付けていません

Safe Beyond the Horizon: Efficient Sampling-based MPC with Neural Control Barrier Functions

投稿日: 2025年2月24日作成者: jarxiv

要約

実際にモデル予測制御（MPC）を使用する場合の一般的な問題は、予測の地平線を超えた安全仕様の満足度です。
理論的研究では、適切な端子セットの制約または十分に長い予測期間を実施することで安全性を保証できることが示されていますが、これらの手法は適用が困難であり、特に一般的な非線形ダイナミクスの場合には実務家が使用することはめったにありません。
この問題を解決するために、おおよその離散時間制御バリア関数を学習し、変異推論MPC（VIMPC）に組み込むことにより、「ブラックボックス」ダイナミクスへの正確な再帰的実現可能性、計算の軌道、および適用性の間にトレードオフを課します。
サンプリングベースのMPCパラダイム。
結果の状態の制約を処理するために、推定最適制御の分散を大幅に削減し、サンプル効率を改善し、CPUのリアルタイム計画を可能にする新しいサンプリング戦略をさらに提案します。
結果として得られるニューラルシールドVIMPC（NS-VIMPC）コントローラーは、既存のサンプリングベースのMPCコントローラーと比較して、既存のサンプリングベースのMPCコントローラーと比較して、大幅な安全性の向上をもたらします。
シミュレーションと現実世界のハードウェア実験の両方でアプローチを検証します。

要約(オリジナル)

A common problem when using model predictive control (MPC) in practice is the satisfaction of safety specifications beyond the prediction horizon. While theoretical works have shown that safety can be guaranteed by enforcing a suitable terminal set constraint or a sufficiently long prediction horizon, these techniques are difficult to apply and thus are rarely used by practitioners, especially in the case of general nonlinear dynamics. To solve this problem, we impose a tradeoff between exact recursive feasibility, computational tractability, and applicability to ”black-box” dynamics by learning an approximate discrete-time control barrier function and incorporating it into a variational inference MPC (VIMPC), a sampling-based MPC paradigm. To handle the resulting state constraints, we further propose a new sampling strategy that greatly reduces the variance of the estimated optimal control, improving the sample efficiency, and enabling real-time planning on a CPU. The resulting Neural Shield-VIMPC (NS-VIMPC) controller yields substantial safety improvements compared to existing sampling-based MPC controllers, even under badly designed cost functions. We validate our approach in both simulation and real-world hardware experiments.

arxiv情報

著者	Ji Yin,Oswin So,Eric Yang Yu,Chuchu Fan,Panagiotis Tsiotras
発行日	2025-02-20 19:59:11+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.AI, cs.RO, cs.SY, eess.SY | コメントを受け付けていません

DEFT: Differentiable Branched Discrete Elastic Rods for Modeling Furcated DLOs in Real-Time

投稿日: 2025年2月24日作成者: jarxiv

要約

自律ワイヤーハーネスアセンブリには、高精度と信頼性で複雑な分岐ケーブルを操作するためにロボットが必要です。
このプロセスを自動化する上での重要な課題は、これらの柔軟な構造と分岐構造が操作中にどのように振る舞うかを予測することです。
正確な予測がなければ、ロボットが組み立て操作を確実に計画または実行することは困難です。
既存の研究により、単一スレッドの変形可能な線形オブジェクト（DLOS）のモデリングが進歩しましたが、これらのアプローチを分岐した変形可能な線形オブジェクト（BDLOS）に拡張することは、基本的な課題を提示します。
BDLOSのジャンクションポイントは、複数のシングルドロモデルを接続するだけでは適切にキャプチャできない複雑な力の相互作用とひずみ伝播パターンを作成します。
これらの課題に対処するために、このペーパーでは、差別化可能な物理学ベースのモデルと学習フレームワークを組み合わせた新しいフレームワークである、リアルタイム（deft）で覆われたDLOをモデル化するための微分可能な離散分岐弾性ロッドを示します。
ジャンクションポイントでの動的伝播とBDLOの真ん中で把握し、2）リアルタイム推論のための効率的な計算を達成し、3）計画を器用の実証できるようにする
BDLO操作。
包括的な一連の実世界の実験は、最先端の代替案と比較した精度、計算速度、および一般化可能性の観点からDeftの有効性を示しています。
プロジェクトページ：https：//roahmlab.github.io/deft/。

要約(オリジナル)

Autonomous wire harness assembly requires robots to manipulate complex branched cables with high precision and reliability. A key challenge in automating this process is predicting how these flexible and branched structures behave under manipulation. Without accurate predictions, it is difficult for robots to reliably plan or execute assembly operations. While existing research has made progress in modeling single-threaded Deformable Linear Objects (DLOs), extending these approaches to Branched Deformable Linear Objects (BDLOs) presents fundamental challenges. The junction points in BDLOs create complex force interactions and strain propagation patterns that cannot be adequately captured by simply connecting multiple single-DLO models. To address these challenges, this paper presents Differentiable discrete branched Elastic rods for modeling Furcated DLOs in real-Time (DEFT), a novel framework that combines a differentiable physics-based model with a learning framework to: 1) accurately model BDLO dynamics, including dynamic propagation at junction points and grasping in the middle of a BDLO, 2) achieve efficient computation for real-time inference, and 3) enable planning to demonstrate dexterous BDLO manipulation. A comprehensive series of real-world experiments demonstrates DEFT’s efficacy in terms of accuracy, computational speed, and generalizability compared to state-of-the-art alternatives. Project page:https://roahmlab.github.io/DEFT/.

arxiv情報

著者	Yizhou Chen,Xiaoyue Wu,Yeheng Zong,Anran Li,Yuzhen Chen,Julie Wu,Bohao Zhang,Ram Vasudevan
発行日	2025-02-20 20:46:09+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.AI, cs.GR, cs.RO | コメントを受け付けていません

DDAT: Diffusion Policies Enforcing Dynamically Admissible Robot Trajectories

投稿日: 2025年2月24日作成者: jarxiv

要約

拡散モデルは、マルチモーダルの生成機能のおかげで、画像とビデオの作成に優れています。
これらの同じ機能により、拡散モデルはロボット研究でますます人気があり、ロボットの動きを生成するために使用されています。
ただし、拡散モデルの確率的性質は、ロボットの実現可能な動きを表す正確な動的方程式と基本的に容易になっています。
したがって、動的に許容可能なロボット軌跡を生成することは、拡散モデルにとって課題です。
この問題を軽減するために、DDAT：拡散ポリシーを動的に許容できる軌跡のための拡散ポリシーを導入して、拡散モデルを使用してブラックボックスロボットシステムの証拠的に許容可能な軌跡を生成します。
シーケンスの各状態がロボットの運動方程式によってその前身の到達可能なセットに属している場合、一連の状態は動的に容認できる軌跡です。
このような軌跡を生成するために、私たちの拡散ポリシーは、トレーニングと推論の両方で、それらの予測を動的に許容可能なマニホールドに投影し、除生ニューラルネットワークの目的を動的な容認性の制約に合わせます。
これらの投影の自己回帰的性質とロボットダイナミクスのブラックボックスの性質により、これらの投影は非常に困難になります。
したがって、このプロセスを予測された後継者と繰り返す前に、予測された後継者を投影する状態に到達可能な到達可能なセットのポリトピックの過少順位を反復的にサンプリングすることにより、容認性を実施します。
正確な軌跡を生成することにより、この投影により、拡散モデルが継続的に再描画する必要性がなくなり、ワンショットの長期軌道計画が可能になります。
私たちのフレームワークは、クアッドコプターとさまざまなムホコ環境での広範なシミュレーションを通じて、単位のGO1およびGO2での実際の実験とともに、高品質の動的に許容可能なロボット軌道を生成することを実証します。

要約(オリジナル)

Diffusion models excel at creating images and videos thanks to their multimodal generative capabilities. These same capabilities have made diffusion models increasingly popular in robotics research, where they are used for generating robot motion. However, the stochastic nature of diffusion models is fundamentally at odds with the precise dynamical equations describing the feasible motion of robots. Hence, generating dynamically admissible robot trajectories is a challenge for diffusion models. To alleviate this issue, we introduce DDAT: Diffusion policies for Dynamically Admissible Trajectories to generate provably admissible trajectories of black-box robotic systems using diffusion models. A sequence of states is a dynamically admissible trajectory if each state of the sequence belongs to the reachable set of its predecessor by the robot’s equations of motion. To generate such trajectories, our diffusion policies project their predictions onto a dynamically admissible manifold during both training and inference to align the objective of the denoiser neural network with the dynamical admissibility constraint. The auto-regressive nature of these projections along with the black-box nature of robot dynamics render these projections immensely challenging. We thus enforce admissibility by iteratively sampling a polytopic under-approximation of the reachable set of a state onto which we project its predicted successor, before iterating this process with the projected successor. By producing accurate trajectories, this projection eliminates the need for diffusion models to continually replan, enabling one-shot long-horizon trajectory planning. We demonstrate that our framework generates higher quality dynamically admissible robot trajectories through extensive simulations on a quadcopter and various MuJoCo environments, along with real-world experiments on a Unitree GO1 and GO2.

arxiv情報

著者	Jean-Baptiste Bouvier,Kanghyun Ryu,Kartik Nagpal,Qiayuan Liao,Koushil Sreenath,Negar Mehr
発行日	2025-02-20 21:00:21+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO, cs.SY, eess.SY | コメントを受け付けていません

Learning Dynamics of a Ball with Differentiable Factor Graph and Roto-Translational Invariant Representations

投稿日: 2025年2月24日作成者: jarxiv

要約

動的環境のロボットには、アジャイル計画をサポートするために、オブジェクトが環境でどのように移動するかの高速で正確なモデルが必要です。
Ping Pongなどのスポーツでは、分析モデルがしばしば、複雑な空力、弾力性のある行動、およびスライド摩擦のモデリングとローリング摩擦のモデリングの課題により、スピンでボール軌道を正確に予測するのに苦労しています。
一方、データ駆動型の方法の約束にもかかわらず、機械学習は、正確な入力なしで正確で一貫した予測を行うのに苦労しています。
この論文では、ダイナミクスモデルと因子グラフ推定器を共同でトレーニングできるエンドツーエンドの学習フレームワークを提案します。
当社のアプローチは、グラムシュミット（GS）プロセスを活用して、ロトロ翻訳不変表現を抽出してモデルのパフォーマンスを改善し、データ増強法と比較して検証エラーをさらに減らすことができます。
さらに、レイヤー接続で自己培養バイパスを使用して非線形性を高めるネットワークアーキテクチャを提案します。
これらの新しい方法を活用することにより、提案されたアプローチは、最初のバウンス後の頂点で37.2 mmのパドル半径、2回目のバウンスの71.5 mmのRMSEでボールの位置を予測します。

要約(オリジナル)

Robots in dynamic environments need fast, accurate models of how objects move in their environments to support agile planning. In sports such as ping pong, analytical models often struggle to accurately predict ball trajectories with spins due to complex aerodynamics, elastic behaviors, and the challenges of modeling sliding and rolling friction. On the other hand, despite the promise of data-driven methods, machine learning struggles to make accurate, consistent predictions without precise input. In this paper, we propose an end-to-end learning framework that can jointly train a dynamics model and a factor graph estimator. Our approach leverages a Gram-Schmidt (GS) process to extract roto-translational invariant representations to improve the model performance, which can further reduce the validation error compared to data augmentation method. Additionally, we propose a network architecture that enhances nonlinearity by using self-multiplicative bypasses in the layer connections. By leveraging these novel methods, our proposed approach predicts the ball’s position with an RMSE of 37.2 mm of the paddle radius at the apex after the first bounce, and 71.5 mm after the second bounce.

arxiv情報

著者	Qingyu Xiao,Zixuan Wu,Matthew Gombolay
発行日	2025-02-20 21:24:47+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO | コメントを受け付けていません

A Synergistic Framework for Learning Shape Estimation and Shape-Aware Whole-Body Control Policy for Continuum Robots

投稿日: 2025年2月24日作成者: jarxiv

要約

この論文では、形状の推定を学習するための新しい相乗的フレームワークと、腱駆動型の連続体ロボットの形状に対応する全身制御ポリシーを紹介します。
私たちのアプローチは、連続形状推定と形状認識制御を実現するために、2つの拡張された神経通常の微分方程式（アノード）（形状ノードとコントロールノード）の相互作用を活用します。
Shape-NodeはCosserat Rod理論からの事前知識を統合し、モデルの不一致を適応して説明できるようにしますが、コントロールノードはこの形状情報を使用して、モデル予測制御（MPC）ファッションで訓練された全身制御ポリシーを最適化します。
。
この統一されたフレームワークは、貧弱な形状の認識や複雑な非線形ダイナミクスをキャプチャする際の課題など、既存のデータ駆動型メソッドの制限を効果的に克服します。
シミュレーションと現実世界の両方の環境での広範な評価は、形状推定、軌跡追跡、障害物の回避におけるフレームワークの堅牢なパフォーマンスを示しています。
提案された方法は、特に追跡精度と一般化能力の観点から、最先端のエンドツーエンド、ニューラル、および再発性ニューラルネットワーク（RNN）モデルよりも一貫して優れています。

要約(オリジナル)

In this paper, we present a novel synergistic framework for learning shape estimation and a shape-aware whole-body control policy for tendon-driven continuum robots. Our approach leverages the interaction between two Augmented Neural Ordinary Differential Equations (ANODEs) — the Shape-NODE and Control-NODE — to achieve continuous shape estimation and shape-aware control. The Shape-NODE integrates prior knowledge from Cosserat rod theory, allowing it to adapt and account for model mismatches, while the Control-NODE uses this shape information to optimize a whole-body control policy, trained in a Model Predictive Control (MPC) fashion. This unified framework effectively overcomes limitations of existing data-driven methods, such as poor shape awareness and challenges in capturing complex nonlinear dynamics. Extensive evaluations in both simulation and real-world environments demonstrate the framework’s robust performance in shape estimation, trajectory tracking, and obstacle avoidance. The proposed method consistently outperforms state-of-the-art end-to-end, Neural-ODE, and Recurrent Neural Network (RNN) models, particularly in terms of tracking accuracy and generalization capabilities.

arxiv情報

著者	Mohammadreza Kasaei,Farshid Alambeigi,Mohsen Khadem
発行日	2025-02-20 21:27:38+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO | コメントを受け付けていません

Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving Scenarios

投稿日: 2025年2月24日作成者: jarxiv

要約

自律運転システムを進める上で重要な要素はシミュレーションです。
しかし、仮想世界と現実の世界の間には、移動性にはかなり小さな進歩があります。
Lidar Point Cloudsでの3Dオブジェクト検出のこの問題を再検討し、Carlaシミュレーターに基づいてデータセット生成パイプラインを提案します。
ドメインランダム化戦略と慎重なモデリングを利用して、合成データでオブジェクト検出器をトレーニングし、Kittiデータセットに強力な一般化機能を実証することができます。
さらに、さまざまな仮想センサーバリアントを比較して、洞察を収集します。センサー属性は、一般的なドメインギャップに責任を負います。
最後に、実際のデータのごく一部で微調整すると、ベースラインとほぼ一致し、完全なトレーニングセットがわずかに上回ります。

要約(オリジナル)

An important factor in advancing autonomous driving systems is simulation. Yet, there is rather small progress for transferability between the virtual and real world. We revisit this problem for 3D object detection on LiDAR point clouds and propose a dataset generation pipeline based on the CARLA simulator. Utilizing domain randomization strategies and careful modeling, we are able to train an object detector on the synthetic data and demonstrate strong generalization capabilities to the KITTI dataset. Furthermore, we compare different virtual sensor variants to gather insights, which sensor attributes can be responsible for the prevalent domain gap. Finally, fine-tuning with a small portion of real data almost matches the baseline and with the full training set slightly surpasses it.

arxiv情報

著者	Richard Marcus,Christian Vogel,Inga Jatzkowski,Niklas Knoop,Marc Stamminger
発行日	2025-02-20 22:27:42+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.CV, cs.RO | コメントを受け付けていません

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント