An End-to-End Framework of Road User Detection, Tracking, and Prediction from Monocular Images

要約

複数の物体の検出と追跡を伴う認識と、軌道予測は、自動運転の 2 つの主要なタスクです。
ただし、現在、これらはほとんどが個別に研究されており、その結果、ほとんどの軌道予測モジュールは、現実世界のシナリオで検出モジュールと追跡モジュールから抽出された軌道にノイズが多いことを考慮せずに、グラウンドトゥルース軌道に基づいて開発されています。
これらのノイズの多い軌道は、軌道予測器のパフォーマンスに重大な影響を与える可能性があり、重大な予測エラーにつながる可能性があります。
この論文では、ODTP (Online Detection, Tracking and Prediction) と呼ばれる、検出、追跡、および軌道予測のためのエンドツーエンドのフレームワークを構築します。
最先端のオンラインマルチオブジェクト追跡モデル QD-3DT を知覚に採用し、純粋にグラウンドトゥルース軌道に依存せずに、検出結果に基づいて軌道予測器 DCENet++ を直接トレーニングします。
自動運転用に広く使用されている nuScenes データセットで ODTP のパフォーマンスを評価します。
広範な実験により、ODPT が高性能のエンドツーエンドの軌道予測を実現することが示されています。
DCENet++ は、強化されたダイナミックマップを備えており、基本モデルよりも正確な軌道を予測します。
また、ノイズの多い検出結果に基づいてトレーニングされた他の生成的および決定論的軌道予測モデルと比較した場合にも、より堅牢です。

要約(オリジナル)

Perception that involves multi-object detection and tracking, and trajectory prediction are two major tasks of autonomous driving. However, they are currently mostly studied separately, which results in most trajectory prediction modules being developed based on ground truth trajectories without taking into account that trajectories extracted from the detection and tracking modules in real-world scenarios are noisy. These noisy trajectories can have a significant impact on the performance of the trajectory predictor and can lead to serious prediction errors. In this paper, we build an end-to-end framework for detection, tracking, and trajectory prediction called ODTP (Online Detection, Tracking and Prediction). It adopts the state-of-the-art online multi-object tracking model, QD-3DT, for perception and trains the trajectory predictor, DCENet++, directly based on the detection results without purely relying on ground truth trajectories. We evaluate the performance of ODTP on the widely used nuScenes dataset for autonomous driving. Extensive experiments show that ODPT achieves high performance end-to-end trajectory prediction. DCENet++, with the enhanced dynamic maps, predicts more accurate trajectories than its base model. It is also more robust when compared with other generative and deterministic trajectory prediction models trained on noisy detection results.

arxiv情報

著者	Hao Cheng,Mengmeng liu,Lin Chen
発行日	2023-08-09 15:46:25+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

An End-to-End Framework of Road User Detection, Tracking, and Prediction from Monocular Images

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー