On-Road Object Importance Estimation: A New Dataset and A Model with Multi-Fold Top-Down Guidance

要約

この論文は、ドライバーの視点からキャプチャされたビデオシーケンスを入力として利用する、道路上の物体の重要度の推定の問題に取り組みます。
この問題は、より安全でスマートな運転システムにとって重要ですが、この問題の調査は依然として限られています。
一方で、コミュニティでは公開されている大規模なデータセットが不足しています。
このジレンマに対処するために、この論文では Traffic Object Importance (TOI) という名前の新しい大規模データセットを提供します。
一方、既存の方法では、ボトムアップ機能または単一分割ガイダンスのいずれかのみが考慮されることが多く、非常に動的で多様なトラフィックシナリオの処理に制限が生じます。
既存の方法とは異なり、この論文では、複数のトップダウンガイダンスとボトムアップ機能を統合するモデルを提案します。
具体的には、3 種類のトップダウンの誘導要素 (ドライバーの意図、意味論的コンテキスト、交通ルール) がモデルに統合されています。
これらの要素はオブジェクトの重要度を推定するために重要ですが、既存の方法ではこれらを同時に考慮するものはありません。
私たちの知る限り、この論文は、多重トップダウン誘導要素とボトムアップ機能を融合する最初の路上物体の重要度推定モデルを提案しています。
広範な実験により、私たちのモデルが最先端の手法を大幅に上回り、最近提案されたモデル (つまり、目標) と比較して 23.1% の平均精度 (AP) の向上を達成したことが実証されました。

要約(オリジナル)

This paper addresses the problem of on-road object importance estimation, which utilizes video sequences captured from the driver’s perspective as the input. Although this problem is significant for safer and smarter driving systems, the exploration of this problem remains limited. On one hand, publicly-available large-scale datasets are scarce in the community. To address this dilemma, this paper contributes a new large-scale dataset named Traffic Object Importance (TOI). On the other hand, existing methods often only consider either bottom-up feature or single-fold guidance, leading to limitations in handling highly dynamic and diverse traffic scenarios. Different from existing methods, this paper proposes a model that integrates multi-fold top-down guidance with the bottom-up feature. Specifically, three kinds of top-down guidance factors (ie, driver intention, semantic context, and traffic rule) are integrated into our model. These factors are important for object importance estimation, but none of the existing methods simultaneously consider them. To our knowledge, this paper proposes the first on-road object importance estimation model that fuses multi-fold top-down guidance factors with bottom-up feature. Extensive experiments demonstrate that our model outperforms state-of-the-art methods by large margins, achieving 23.1% Average Precision (AP) improvement compared with the recently proposed model (ie, Goal).

arxiv情報

著者	Zhixiong Nan,Yilong Chen,Tianfei Zhou,Tao Xiang
発行日	2024-11-26 06:37:10+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

On-Road Object Importance Estimation: A New Dataset and A Model with Multi-Fold Top-Down Guidance

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー