jarxiv | Japanese arxiv | ページ 358

Embodied intelligent industrial robotics: Concepts and techniques

投稿日: 2025年5月26日作成者: jarxiv

要約

近年、具体化されたインテリジェントロボット（EIR）は、マルチモーダル認識、自律的な意思決定、および身体的相互作用において大幅に進歩しています。
一部のロボットは、家やショッピングモールなどの汎用シナリオですでにテストされています。
ただし、EIRは現在、産業環境のセマンティクスと産業操作オブジェクト間の規範的制約を深く理解していません。
この論文の目標は、産業シナリオで具体化された知性の研究と応用を進めることです。
このペーパーでは、最初に産業用ロボット工学の歴史と主流のEIRフレームワークをレビューします。
ここでは、具体化されたインテリジェント産業用ロボット（EIIR）の概念が策定され、産業環境向けの知識主導のEIIR技術的枠組みが提案されています。
フレームワークには、世界モデル、高レベルのタスクプランナー、低レベルのスキルコントローラー、シミュレーターの4つの主要なモジュールが含まれます。
各モジュールに関連する手法の開発も徹底的に議論されており、産業用途への適応に関する最近の進歩が強調されています。
最後に、EIIRが産業シナリオで遭遇する重要な課題が要約され、将来の研究方向が提案されています。
著者は、EIIRテクノロジーが次世代の産業用ロボット工学を形作っていると考えています。
EIIRベースの産業システムには、インテリジェントな製造を可能にする強力な可能性があります。
このレビューは、産業具体化された情報に関心のある学者やエンジニアにとって貴重な参照として役立つことが期待されています。
一緒に、学者はこの研究を使用して、EIIR技術の急速な進歩と適用を促進することができます。
著者は、プロジェクトページhttps://github.com/jackyzengl/eiirで新しい研究を追跡し、要約し続けます。

要約(オリジナル)

In recent years, embodied intelligent robotics (EIR) advances significantly in multi-modal perception, autonomous decision-making, and physical interaction. Some robots have already been tested in general-purpose scenarios, such as homes and shopping malls. However, EIR currently lacks a deep understanding of the semantics of industrial environments and the normative constraints between industrial operating objects. The goal of this paper is to advance the research and application of embodied intelligence in industrial scenarios. This paper first reviews the history of industrial robotics and the mainstream EIR frameworks. Herein, the concept of embodied intelligent industrial robotics (EIIR) is formulated and a knowledge-driven EIIR technical framework for industrial environments is proposed. The framework includes four primary modules: a world model, a high-level task planner, a low-level skill controller, and a simulator. The development of techniques related to each module are also thoroughly discussed, and recent progress regarding their adaption to industrial applications is emphasized. Finally, the key challenges that EIIR encounters in industrial scenarios are summarized and future research directions are suggested. The authors believe that EIIR technology is shaping the next generation of industrial robotics. EIIR-based industrial systems have strong potential to enable intelligent manufacturing. It is expected that this review could serve as a valuable reference for scholars and engineers that are interested in industrial embodied intelligence. Together, scholars can use this research to drive the rapid advancement and application of EIIR techniques. The authors would continue to track and summarize new studies in the project page https://github.com/jackyzengl/EIIR.

arxiv情報

著者	Chaoran Zhang,Chenhao Zhang,Zhaobo Xu,Qinghongbing Xie,Pingfa Feng,Long Zeng
発行日	2025-05-23 09:38:03+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO | コメントを受け付けていません

A Bio-mimetic Neuromorphic Model for Heat-evoked Nociceptive Withdrawal Reflex in Upper Limb

投稿日: 2025年5月26日作成者: jarxiv

要約

侵害受容性離脱反射（NWR）は、相互作用を媒介し、潜在的に危険な環境での損傷から身体を保護するメカニズムです。
補綴腕や自律ロボットのユーザーに警告信号をより適切に伝え、適切なNWRをトリガーすることでそれらを保護するために、高速かつ効果的な処理のために温度情報の生物学的表現を使用することが役立ちます。
この作業では、反射アークの構造とエンコードスキームを模倣することにより、熱誘発NWRの神経形態スパイクネットワークを提示します。
ネットワークは、バイオプリュース可能な報酬変調されたスパイクのタイミング依存の可塑性学習アルゴリズムでトレーニングされています。
放射熱の実験でNWRをトリガーする最近の研究で提案されたモデルと他の3つの方法を評価しました。
神経形態モデルのみが、人間と同様の空間的合計（SS）効果と時間的合計（TS）効果を示すことがわかりました。
この神経モデルモデルの生物的透過性の向上は、神経補綴物の感覚フィードバックを改善する可能性があります。

要約(オリジナル)

The nociceptive withdrawal reflex (NWR) is a mechanism to mediate interactions and protect the body from damage in a potentially dangerous environment. To better convey warning signals to users of prosthetic arms or autonomous robots and protect them by triggering a proper NWR, it is useful to use a biological representation of temperature information for fast and effective processing. In this work, we present a neuromorphic spiking network for heat-evoked NWR by mimicking the structure and encoding scheme of the reflex arc. The network is trained with the bio-plausible reward modulated spike timing-dependent plasticity learning algorithm. We evaluated the proposed model and three other methods in recent studies that trigger NWR in an experiment with radiant heat. We found that only the neuromorphic model exhibits the spatial summation (SS) effect and temporal summation (TS) effect similar to humans and can encode the reflex strength matching the intensity of the stimulus in the relative spike latency online. The improved bio-plausibility of this neuromorphic model could improve sensory feedback in neural prostheses.

arxiv情報

著者	Fengyi Wang,J. Rogelio Guadarrama Olvera,Nitish Thako,Gordon Cheng
発行日	2025-05-23 10:41:20+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.NE, cs.RO | コメントを受け付けていません

Object Classification Utilizing Neuromorphic Proprioceptive Signals in Active Exploration: Validated on a Soft Anthropomorphic Hand

投稿日: 2025年5月26日作成者: jarxiv

要約

触覚の知覚における重要な感覚モダリティである固有受容は、身体部分の位置と動きに関するフィードバックを提供することにより、オブジェクトの3D構造を知覚する上で重要な役割を果たします。
固有受容感の回復は、補綴手での手の操作と自然な制御を可能にするために重要です。
その重要性にもかかわらず、固有受容感は人工システムで比較的未開拓です。
この作業では、柔らかい擬人化ロボットハンド（QBソフランド）を柔軟な固有受容センサーと統合する新しいプラットフォームと、さまざまなタイプのスパイクニューロンを使用して、生物学的筋肉の筋膜モデルに囲まれたニューロンのスパイクニューロンを使用してニューロンを使用してニューロンを使用して解釈する分類器を紹介します。
エンコードスキームと分類器は、YCBベンチマークから10個のオブジェクトのアクティブな探索で収集したデータセットで実装およびテストされます。
我々の結果は、特に探査の初期段階で、分類器が既存の学習アプローチよりも正確な推論を達成することを示しています。
このシステムは、触覚フィードバックと神経補綴の分野での発達の可能性を秘めています。

要約(オリジナル)

Proprioception, a key sensory modality in haptic perception, plays a vital role in perceiving the 3D structure of objects by providing feedback on the position and movement of body parts. The restoration of proprioceptive sensation is crucial for enabling in-hand manipulation and natural control in the prosthetic hand. Despite its importance, proprioceptive sensation is relatively unexplored in an artificial system. In this work, we introduce a novel platform that integrates a soft anthropomorphic robot hand (QB SoftHand) with flexible proprioceptive sensors and a classifier that utilizes a hybrid spiking neural network with different types of spiking neurons to interpret neuromorphic proprioceptive signals encoded by a biological muscle spindle model. The encoding scheme and the classifier are implemented and tested on the datasets we collected in the active exploration of ten objects from the YCB benchmark. Our results indicate that the classifier achieves more accurate inferences than existing learning approaches, especially in the early stage of the exploration. This system holds the potential for development in the areas of haptic feedback and neural prosthetics.

arxiv情報

著者	Fengyi Wang,Xiangyu Fu,Nitish Thakor,Gordon Cheng
発行日	2025-05-23 11:02:34+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.NE, cs.RO | コメントを受け付けていません

Feasible Action Space Reduction for Quantifying Causal Responsibility in Continuous Spatial Interactions

投稿日: 2025年5月26日作成者: jarxiv

要約

あるエージェントが別のエージェントに及ぼす因果的影響を理解することは、自動化された車両やモバイルロボットなどの人工的にインテリジェントなシステムを人間に居住した環境に安全に展開するために重要です。
因果関係の既存のモデルは、個別のアクションを備えたシナリオの単純化された抽象化に対処し、したがって、空間的相互作用の責任を理解する際に実際の使用を制限します。
空間的に相互作用するエージェントがシーンに埋め込まれ、各瞬間にアクションに従う必要があるという仮定に基づいて、実行可能なアクション空間削減（恐怖）は、個別のアクションを備えたグリッドワールド設定で因果関係の責任のメトリックとして提案されました。
実際の相互作用には連続的なアクションスペースが含まれるため、このホワイトペーパーでは、空間的相互作用における因果責任を測定するための恐怖メトリックの定式化を提案します。
原型の宇宙共有競合におけるメトリックの有用性を説明し、後方の責任を分析し、エージェントの意思決定を導く将来の見通し責任を推定するためのアプリケーションを紹介します。
私たちの結果は、人工剤を設計および工学するための恐怖指標の可能性、および人間の周りのエージェントの責任を評価するための可能性を強調しています。

要約(オリジナル)

Understanding the causal influence of one agent on another agent is crucial for safely deploying artificially intelligent systems such as automated vehicles and mobile robots into human-inhabited environments. Existing models of causal responsibility deal with simplified abstractions of scenarios with discrete actions, thus, limiting real-world use when understanding responsibility in spatial interactions. Based on the assumption that spatially interacting agents are embedded in a scene and must follow an action at each instant, Feasible Action-Space Reduction (FeAR) was proposed as a metric for causal responsibility in a grid-world setting with discrete actions. Since real-world interactions involve continuous action spaces, this paper proposes a formulation of the FeAR metric for measuring causal responsibility in space-continuous interactions. We illustrate the utility of the metric in prototypical space-sharing conflicts, and showcase its applications for analysing backward-looking responsibility and in estimating forward-looking responsibility to guide agent decision making. Our results highlight the potential of the FeAR metric for designing and engineering artificial agents, as well as for assessing the responsibility of agents around humans.

arxiv情報

著者	Ashwin George,Luciano Cavalcante Siebert,David A. Abbink,Arkady Zgonnikov
発行日	2025-05-23 11:02:44+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.CY, cs.HC, cs.MA, cs.RO | コメントを受け付けていません

Is Single-View Mesh Reconstruction Ready for Robotics?

投稿日: 2025年5月26日作成者: jarxiv

要約

このペーパーでは、ロボット操作でデジタルツイン環境を作成するためのシングルビューメッシュ再構成モデルを評価します。
単一の視点からの3D再構成のコンピュータービジョンの最近の進歩は、ロボット工学コンテキストのための物理環境の仮想レプリカを効率的に作成するための潜在的なブレークスルーを示しています。
ただし、物理シミュレーションとロボット工学アプリケーションへの適合性は未調査のままです。
典型的な入力の処理、衝突のない安定した再構成の生成、閉塞の管理、計算上の制約の満たすことを含む、ロボットのコンテキストでの3D再構成のベンチマーク基準を確立します。
現実的なロボット工学データセットを使用した当社の経験的評価は、コンピュータービジョンベンチマークでの成功にもかかわらず、既存のアプローチがロボット工学固有の要件を満たしていないことを示しています。
マルチビューアプローチに焦点を当てた以前の作業とは対照的に、実用的なロボット工学の実装のためのシングルビュー再構築の制限を定量的に調べます。
私たちの調査結果は、コンピュータービジョンの進歩とロボット工学のニーズの間の重要なギャップを強調し、この交差点で将来の研究を導きます。

要約(オリジナル)

This paper evaluates single-view mesh reconstruction models for creating digital twin environments in robot manipulation. Recent advances in computer vision for 3D reconstruction from single viewpoints present a potential breakthrough for efficiently creating virtual replicas of physical environments for robotics contexts. However, their suitability for physics simulations and robotics applications remains unexplored. We establish benchmarking criteria for 3D reconstruction in robotics contexts, including handling typical inputs, producing collision-free and stable reconstructions, managing occlusions, and meeting computational constraints. Our empirical evaluation using realistic robotics datasets shows that despite success on computer vision benchmarks, existing approaches fail to meet robotics-specific requirements. We quantitively examine limitations of single-view reconstruction for practical robotics implementation, in contrast to prior work that focuses on multi-view approaches. Our findings highlight critical gaps between computer vision advances and robotics needs, guiding future research at this intersection.

arxiv情報

著者	Frederik Nolte,Bernhard Schölkopf,Ingmar Posner
発行日	2025-05-23 14:35:56+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.CV, cs.RO, I.2.10 | コメントを受け付けていません

Classification of assembly tasks combining multiple primitive actions using Transformers and xLSTMs

投稿日: 2025年5月26日作成者: jarxiv

要約

人間が成績を尽くしたアセンブリタスクの分類は、安全性を確保し、ロボットアクションを予測し、ロボット学習を促進するために、共同ロボット工学に不可欠です。
ただし、タスクをより小さなプリミティブアクションにセグメント化することが実行不可能である場合、信頼できる分類を達成することは困難であり、複数の原始アクションを含む長いアセンブリタスクを分類する必要があります。
この研究では、ハンドランドマーク座標に基づいて長いアセンブリシーケンシャルタスクを分類し、2つの定評のある分類器、LSTMとトランス、および最近のモデルXLSTMのパフォーマンスを比較します。
CTベンチマークで提案されているHRCシナリオを使用しました。これには、挿入、ネジの留め具、スナップフィッティングなどのアクションを組み合わせた長いアセンブリタスクが含まれます。
テストは、トレーニングシーケンスを実行したヒトオペレーターと3人の新しいオペレーターの両方から収集されたシーケンスを使用して実施されました。
LSTM、トランス、およびXLSTMモデルの実質パッドシーケンスのテスト結果は、トレーニングオペレーターで72.9％、95.0％、93.2％、新しいオペレーターでそれぞれ43.5％、54.3％、60.8％でした。
LSTMモデルは、他の2つのアプローチと比較して明らかにパフォーマンスが低下しています。
予想通り、XLSTMモデルは新しいオペレーターにより良い一般化能力を示しましたが、変圧器とXLSTMの両方がトレーニングされたオペレーターの満足のいく結果を達成しました。
結果は、このタイプの分類で、XLSTMモデルがトランスに対してわずかなエッジを提供することを明確に示しています。

要約(オリジナル)

The classification of human-performed assembly tasks is essential in collaborative robotics to ensure safety, anticipate robot actions, and facilitate robot learning. However, achieving reliable classification is challenging when segmenting tasks into smaller primitive actions is unfeasible, requiring us to classify long assembly tasks that encompass multiple primitive actions. In this study, we propose classifying long assembly sequential tasks based on hand landmark coordinates and compare the performance of two well-established classifiers, LSTM and Transformer, as well as a recent model, xLSTM. We used the HRC scenario proposed in the CT benchmark, which includes long assembly tasks that combine actions such as insertions, screw fastenings, and snap fittings. Testing was conducted using sequences gathered from both the human operator who performed the training sequences and three new operators. The testing results of real-padded sequences for the LSTM, Transformer, and xLSTM models was 72.9%, 95.0% and 93.2% for the training operator, and 43.5%, 54.3% and 60.8% for the new operators, respectively. The LSTM model clearly underperformed compared to the other two approaches. As expected, both the Transformer and xLSTM achieved satisfactory results for the operator they were trained on, though the xLSTM model demonstrated better generalization capabilities to new operators. The results clearly show that for this type of classification, the xLSTM model offers a slight edge over Transformers.

arxiv情報

著者	Miguel Neves,Pedro Neto
発行日	2025-05-23 15:14:32+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO | コメントを受け付けていません

Survival of the fastest — algorithm-guided evolution of light-powered underwater microrobots

投稿日: 2025年5月26日作成者: jarxiv

要約

複数のパラメーターに応じて、ソフトロボットは数値的にモデル化するのが難しいさまざまな移動モードを示すことができます。
その結果、複数のエアロおよび流体力学的プロセスが動きに影響を与える場合、特に低いレイノルズ数を特徴とする小規模システムでは、パフォーマンスの向上は複雑です。
この作業では、実験結果（測定された水泳速度）を2つの進化アルゴリズムのフィットネス関数として適用することにより、光力発電ミリメートルの水中スイマーの移動を最適化します：粒子群最適化と遺伝的アルゴリズム。
さまざまな特性（表現型）を備えたこれらの柔らかく駆動型のロボットを迅速に製造できるため、多くの競合するロボットを使用して連続した世代の水泳速度を改善するための最適化実験のための優れたプラットフォームを提供します。
興味深いことに、自然の進化と同様に、予期しない遺伝子の組み合わせは、速度の8倍の増加や自己展開の水中運動モードの発見など、驚くほど良い結果をもたらしました。

要約(オリジナル)

Depending on multiple parameters, soft robots can exhibit different modes of locomotion that are difficult to model numerically. As a result, improving their performance is complex, especially in small-scale systems characterized by low Reynolds numbers, when multiple aero- and hydrodynamical processes influence their movement. In this work, we optimize light-powered millimetre-scale underwater swimmer locomotion by applying experimental results – measured swimming speed – as the fitness function in two evolutionary algorithms: particle swarm optimization and genetic algorithm. As these soft, light-powered robots with different characteristics (phenotypes) can be fabricated quickly, they provide a great platform for optimisation experiments, using many competing robots to improve swimming speed over consecutive generations. Interestingly, just like in natural evolution, unexpected gene combinations led to surprisingly good results, including eight-fold increase in speed or the discovery of a self-oscillating underwater locomotion mode.

arxiv情報

著者	Mikołaj Rogóż,Zofia Dziekan,Piotr Wasylczyk
発行日	2025-05-23 15:58:30+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cond-mat.mtrl-sci, cs.RO | コメントを受け付けていません

ERPoT: Effective and Reliable Pose Tracking for Mobile Robots Using Lightweight Polygon Maps

投稿日: 2025年5月26日作成者: jarxiv

要約

このペーパーでは、革新的な以前のポリゴンマップによって支えられた、大規模な屋外で挑戦的な屋内環境で動作するモバイルロボット向けに、ERPOTと呼ばれる効果的で信頼性の高いポーズ追跡ソリューションを紹介します。
特に、マップサイズが環境の拡大に伴い成長するにつれて発生する課題を克服するために、複数のポリゴンで構成される以前のマップの新しい形式が提案されています。
環境占有率を簡潔かつ正確に描写するためにポリゴンの使用の恩恵を受けて、以前のポリゴンマップは、長期的に信頼できるポーズ追跡を実現し、コンパクトな形を確保します。
さらに重要なことに、ポーズトラッキングは純粋なライダーモードで実行され、密な3Dポイントクラウドは、地面の除去と障害物の選択により、スパース2Dスキャンに変換されます。
これに基づいて、ポイントポリゴンマッチングによるポーズ推定のための新しいコスト関数が導入され、ポイントツーエッジとポイントツーエッジの2つの異なる制約フォームが含まれています。
この研究では、私たちの主な焦点は、軽量でコンパクトな以前のマップ構築と、効果的で信頼できるロボットのポーズ追跡という2つの重要な側面にあります。
どちらの側面も、さまざまな環境にさまざまなライダーセンサーを備えた多様なモバイルプラットフォームを介して、将来のナビゲーションの基礎柱として機能します。
公開されているデータセットと自己録画データセットに基づく比較実験が行われ、評価結果は、他の6つのアプローチで信頼性、以前のマップサイズ、ポーズ推定誤差、ランタイムに関するERPOTの優れたパフォーマンスを示しています。
対応するコードはhttps://github.com/ghm0819/erpotでアクセスでき、補足ビデオはhttps://youtu.be/cseml5frw1qにあります。

要約(オリジナル)

This paper presents an effective and reliable pose tracking solution, termed ERPoT, for mobile robots operating in large-scale outdoor and challenging indoor environments, underpinned by an innovative prior polygon map. Especially, to overcome the challenge that arises as the map size grows with the expansion of the environment, the novel form of a prior map composed of multiple polygons is proposed. Benefiting from the use of polygons to concisely and accurately depict environmental occupancy, the prior polygon map achieves long-term reliable pose tracking while ensuring a compact form. More importantly, pose tracking is carried out under pure LiDAR mode, and the dense 3D point cloud is transformed into a sparse 2D scan through ground removal and obstacle selection. On this basis, a novel cost function for pose estimation through point-polygon matching is introduced, encompassing two distinct constraint forms: point-to-vertex and point-to-edge. In this study, our primary focus lies on two crucial aspects: lightweight and compact prior map construction, as well as effective and reliable robot pose tracking. Both aspects serve as the foundational pillars for future navigation across diverse mobile platforms equipped with different LiDAR sensors in varied environments. Comparative experiments based on the publicly available datasets and our self-recorded datasets are conducted, and evaluation results show the superior performance of ERPoT on reliability, prior map size, pose estimation error, and runtime over the other six approaches. The corresponding code can be accessed at https://github.com/ghm0819/ERPoT, and the supplementary video is at https://youtu.be/cseml5FrW1Q.

arxiv情報

著者	Haiming Gao,Qibo Qiu,Hongyan Liu,Dingkun Liang,Chaoqun Wang,Xuebo Zhang
発行日	2025-05-23 16:22:17+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.RO, I.2.9 | コメントを受け付けていません

Rotational Multi-material 3D Printing of Soft Robotic Matter with Asymmetrical Embedded Pneumatics

投稿日: 2025年5月26日作成者: jarxiv

要約

ソフトロボットの問題の迅速な設計と製造は、形状のモーフィング、作動、およびウェアラブルデバイスに興味を高めることです。
ここでは、プログラム可能な形状のモーフィング挙動を示す埋め込まれた空気圧を備えたソフトロボット素材を作成するための容易な製造方法を報告します。
回転マルチマテリアル3D印刷を使用して、エラストマーシェルと逃亡コアで構成される非対称コアシェルフィラメントは、1Dおよび2Dモチーフでパターン化されています。
ノズルの設計、回転速度、プリントパスを正確に制御することにより、各印刷されたフィラメントに沿ったパターン化された逃亡コアの局所的な方向、形状、および断面積を制御できます。
エラストマーマトリックスが硬化すると、逃亡中のコアが除去され、空気圧の作動を促進する埋め込まれた導管が残ります。
接続されたFermat Spirals Pathingアプローチを使用して、手で触発されたグリッパーなどのより複雑なソフトロボットに必要な希望の印刷パスを自動的に生成できます。
統合されたデザインおよび印刷アプローチにより、無数の形状のモーフィング遷移をオンデマンドで示すソフトロボットの問題を迅速に構築できます。

要約(オリジナル)

The rapid design and fabrication of soft robotic matter is of growing interest for shape morphing, actuation, and wearable devices. Here, we report a facile fabrication method for creating soft robotic materials with embedded pneumatics that exhibit programmable shape morphing behavior. Using rotational multi-material 3D printing, asymmetrical core-shell filaments composed of elastomeric shells and fugitive cores are patterned in 1D and 2D motifs. By precisely controlling the nozzle design, rotation rate, and print path, one can control the local orientation, shape, and cross-sectional area of the patterned fugitive core along each printed filament. Once the elastomeric matrix is cured, the fugitive cores are removed, leaving behind embedded conduits that facilitate pneumatic actuation. Using a connected Fermat spirals pathing approach, one can automatically generate desired print paths required for more complex soft robots, such as hand-inspired grippers. Our integrated design and printing approach enables one to rapidly build soft robotic matter that exhibits myriad shape morphing transitions on demand.

arxiv情報

著者	Jackson K. Wilt,Natalie M. Larson,Jennifer A. Lewis
発行日	2025-05-23 16:48:15+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cond-mat.soft, cs.RO | コメントを受け付けていません

Revisiting Adversarial Perception Attacks and Defense Methods on Autonomous Driving Systems

投稿日: 2025年5月26日作成者: jarxiv

要約

自律運転システム（ADS）は、敵対的な攻撃に対して脆弱なままである深い学習ベースの知覚モデルにますます依存しています。
この論文では、敵対的な攻撃と防御方法を再検討し、道路標識の認識と鉛オブジェクトの検出と予測（相対距離など）に焦点を当てています。
レベル2の生産広告、Comma $。$ AIによるOpenPilot、および広く採用されているYoloモデルを使用して、敵対的な摂動の影響を体系的に調べ、敵対的訓練、画像処理、対照学習、拡散モデルなどの防御技術を評価します。
私たちの実験は、複雑な攻撃の緩和におけるこれらの方法の長所と制限の両方を強調しています。
モデルの堅牢性のターゲット評価を通じて、ADS認識システムの脆弱性についてより深い洞察を提供し、より回復力のある防衛戦略を開発するためのガイダンスを提供することを目指しています。

要約(オリジナル)

Autonomous driving systems (ADS) increasingly rely on deep learning-based perception models, which remain vulnerable to adversarial attacks. In this paper, we revisit adversarial attacks and defense methods, focusing on road sign recognition and lead object detection and prediction (e.g., relative distance). Using a Level-2 production ADS, OpenPilot by Comma$.$ai, and the widely adopted YOLO model, we systematically examine the impact of adversarial perturbations and assess defense techniques, including adversarial training, image processing, contrastive learning, and diffusion models. Our experiments highlight both the strengths and limitations of these methods in mitigating complex attacks. Through targeted evaluations of model robustness, we aim to provide deeper insights into the vulnerabilities of ADS perception systems and contribute guidance for developing more resilient defense strategies.

arxiv情報

著者	Cheng Chen,Yuhong Wang,Nafis S Munir,Xiangwei Zhou,Xugui Zhou
発行日	2025-05-23 17:44:21+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.CR, cs.RO | コメントを受け付けていません

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント