Vision-based Lifting of 2D Object Detections for Automated Driving

要約

画像ベースの3Dオブジェクトの検出は、ほとんどの最新の車で安価なオンボードカメラがすでに利用可能であるため、自律運転の必然的な部分です。
正確な深さ情報のため、現在、ほとんどの最先端の3Dオブジェクト検出器は、LIDARデータに大きく依存しています。
この論文では、既存のビジョンベースの2Dアルゴリズムの結果を、LIDARの費用対効果の高い代替品としてカメラのみを使用して3D検出に持ち上げるパイプラインを提案します。
既存のアプローチとは対照的に、私たちは車だけでなく、あらゆるタイプの道路利用者に焦点を当てています。
私たちの知る限り、私たちは2D CNNを使用して、各2D検出のポイントクラウドを処理して、計算の取り組みを可能な限り低く保つ最初のものです。
挑戦的なKitti 3Dオブジェクト検出ベンチマークに関する私たちの評価は、ランタイムが3分の1しかない間、最先端の画像ベースのアプローチに匹敵する結果を示しています。

要約(オリジナル)

Image-based 3D object detection is an inevitable part of autonomous driving because cheap onboard cameras are already available in most modern cars. Because of the accurate depth information, currently, most state-of-the-art 3D object detectors heavily rely on LiDAR data. In this paper, we propose a pipeline which lifts the results of existing vision-based 2D algorithms to 3D detections using only cameras as a cost-effective alternative to LiDAR. In contrast to existing approaches, we focus not only on cars but on all types of road users. To the best of our knowledge, we are the first using a 2D CNN to process the point cloud for each 2D detection to keep the computational effort as low as possible. Our evaluation on the challenging KITTI 3D object detection benchmark shows results comparable to state-of-the-art image-based approaches while having a runtime of only a third.

arxiv情報

著者	Hendrik Königshof,Kun Li,Christoph Stiller
発行日	2025-06-13 14:40:12+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Vision-based Lifting of 2D Object Detections for Automated Driving

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー