BEVLoc: Cross-View Localization and Matching via Birds-Eye-View Synthesis

要約

屋外ロボット工学において、特に GPS が存在しない、または信頼性が低い場合、地上と空中のマッチングは重要かつ困難な作業です。
建物や大きく密集した森林などの構造物は干渉を引き起こすため、全地球測位推定には GNSS の置き換えが必要になります。
本当の難しさは、許容可能な位置特定を実現するために、地上画像と航空画像の遠近感の違いを調整することにあります。
自動運転コミュニティからインスピレーションを得て、オフロード環境で航空地図と照合して位置特定するための鳥瞰図 (BEV) シーン表現を合成するための新しいフレームワークを提案します。
ドメイン固有のハードネガティブマイニングによる対照学習を活用して、合成された BEV と航空地図の間の類似した表現を学習するようにネットワークをトレーニングします。
推論中、BEVLoc は、粗いものから細かいものへのマッチング戦略を通じて、航空地図内で最も可能性の高い場所の特定をガイドします。
私たちの結果は、意味の多様性が限られた非常に困難な森林環境における有望な初期結果を示しています。
私たちはモデルの粗いマッチングと細かいマッチングのパフォーマンスを分析し、モデルの生のマッチング能力と GNSS の代替としてのパフォーマンスの両方を評価します。
私たちの仕事は、オフロードマップのローカリゼーションを掘り下げながら、ローカリゼーションの将来の開発のための基礎的なベースラインを確立します。
コードは https://github.com/rpl-cmu/bevloc から入手できます。

要約(オリジナル)

Ground to aerial matching is a crucial and challenging task in outdoor robotics, particularly when GPS is absent or unreliable. Structures like buildings or large dense forests create interference, requiring GNSS replacements for global positioning estimates. The true difficulty lies in reconciling the perspective difference between the ground and air images for acceptable localization. Taking inspiration from the autonomous driving community, we propose a novel framework for synthesizing a birds-eye-view (BEV) scene representation to match and localize against an aerial map in off-road environments. We leverage contrastive learning with domain specific hard negative mining to train a network to learn similar representations between the synthesized BEV and the aerial map. During inference, BEVLoc guides the identification of the most probable locations within the aerial map through a coarse-to-fine matching strategy. Our results demonstrate promising initial outcomes in extremely difficult forest environments with limited semantic diversity. We analyze our model’s performance for coarse and fine matching, assessing both the raw matching capability of our model and its performance as a GNSS replacement. Our work delves into off-road map localization while establishing a foundational baseline for future developments in localization. Our code is available at: https://github.com/rpl-cmu/bevloc

arxiv情報

著者	Christopher Klammer,Michael Kaess
発行日	2024-10-08 22:45:53+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

BEVLoc: Cross-View Localization and Matching via Birds-Eye-View Synthesis

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー