SDTagNet: Leveraging Text-Annotated Navigation Maps for Online HD Map Construction

要約

自動運転車は、安全に動作するために詳細かつ正確な環境情報に依存しています。
高解像度（HD）マップは有望なソリューションを提供しますが、メンテナンスコストの高いコストは、スケーラブルな展開に対する大きな障壁をもたらします。
この課題は、ライブセンサーデータからローカルHDマップを生成するオンラインHDマップ構築方法によって対処されています。
ただし、これらの方法は、オンボードセンサーの短い知覚範囲によって本質的に制限されています。
この制限を克服し、一般的なパフォーマンスを向上させるために、最近のアプローチでは、以前の標準定義（SD）マップの使用を検討しました。
openstreetMapなどの広く利用可能なSDマップの情報を完全に利用して、遠い範囲検出精度を強化する最初のオンラインHDマップ構築方法であるSDTagnetを提案します。
私たちのアプローチでは、2つの重要なイノベーションを紹介します。
まず、以前の作業とは対照的に、手動で選択されたクラスを持つポリラインSDマップデータだけでなく、テキストアノテーションの形で追加のセマンティック情報を組み込みます。
このようにして、SDベクトルマップトークンをNLP由来の機能で濃縮し、事前定義された仕様または徹底的なクラスの分類法への依存度を排除します。
次に、すべてのタイプのマップ要素を均一に統合するために、直交要素識別子とともにポイントレベルのSDマップエンコーダーを導入します。
Argoverse 2およびNuscenesの実験は、これがマップ認識パフォーマンスを最大+5.9マップ（ +45％）W.R.T.
プライアーのないマップ構造と+3.2マップ（ +20％）W.R.T。
すでにSDマッププライアーを使用している以前のアプローチ。
コードはhttps://github.com/immel-f/sdtagnetで入手できます

要約(オリジナル)

Autonomous vehicles rely on detailed and accurate environmental information to operate safely. High definition (HD) maps offer a promising solution, but their high maintenance cost poses a significant barrier to scalable deployment. This challenge is addressed by online HD map construction methods, which generate local HD maps from live sensor data. However, these methods are inherently limited by the short perception range of onboard sensors. To overcome this limitation and improve general performance, recent approaches have explored the use of standard definition (SD) maps as prior, which are significantly easier to maintain. We propose SDTagNet, the first online HD map construction method that fully utilizes the information of widely available SD maps, like OpenStreetMap, to enhance far range detection accuracy. Our approach introduces two key innovations. First, in contrast to previous work, we incorporate not only polyline SD map data with manually selected classes, but additional semantic information in the form of textual annotations. In this way, we enrich SD vector map tokens with NLP-derived features, eliminating the dependency on predefined specifications or exhaustive class taxonomies. Second, we introduce a point-level SD map encoder together with orthogonal element identifiers to uniformly integrate all types of map elements. Experiments on Argoverse 2 and nuScenes show that this boosts map perception performance by up to +5.9 mAP (+45%) w.r.t. map construction without priors and up to +3.2 mAP (+20%) w.r.t. previous approaches that already use SD map priors. Code is available at https://github.com/immel-f/SDTagNet

arxiv情報

著者	Fabian Immel,Jan-Hendrik Pauls,Richard Fehler,Frank Bieder,Jonas Merkert,Christoph Stiller
発行日	2025-06-10 17:16:00+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

SDTagNet: Leveraging Text-Annotated Navigation Maps for Online HD Map Construction

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー