A Framework for Learning Scoring Rules in Autonomous Driving Planning Systems

要約

自律駆動システムでは、モーション計画は一般に2段階のプロセスとして実装されます。まず、軌道提案者が複数の候補の軌跡を生成し、スコアリングメカニズムが実行に最も適した軌跡を選択します。
この重要な選択段階では、ルールベースのスコアリングメカニズムは、形式化された人間的理解可能な形式で運転の好み、安全上の制約、および交通規制を明示的にエンコードできるため、特に魅力的です。
ただし、これらのスコアリングルールを手動で作成するには、重要な課題があります。ルールには、多くの場合、複雑な相互依存関係が含まれ、慎重なパラメーターチューニングが必要であり、実際の駆動データに存在するニュアンスを完全にキャプチャできない場合があります。
この作業では、時間論的論理で表される解釈可能なスコアリングルールを学ぶことでこのギャップを埋める新しいフレームワークであるFloraを紹介します。
私たちの方法は、多様な運転シナリオ全体で微妙な関係を捉えた学習可能なロジック構造を特徴としており、Nuplanで収集された現実世界の運転デモからルールとパラメーターの両方を最適化します。
トレーニングデータには肯定的な例のみが含まれているにもかかわらず、私たちのアプローチは運転行動を評価することを効果的に学びます（運転デモンストレーションの成功）。
閉ループ計画シミュレーションの評価は、学習したスコアリングルールが、解釈可能性を維持しながら、専門家が設計したルールやニューラルネットワークスコアリングモデルを含む既存の手法を上回ることを示しています。
この作業では、さまざまな軌道提案者とシームレスに統合するためのプラグインモジュールとして設計された自律運転システムのスコアリングメカニズムを強化するためのデータ駆動型のアプローチを紹介します。
私たちのビデオとコードはXiong.zikang.me/floraで入手できます。

要約(オリジナル)

In autonomous driving systems, motion planning is commonly implemented as a two-stage process: first, a trajectory proposer generates multiple candidate trajectories, then a scoring mechanism selects the most suitable trajectory for execution. For this critical selection stage, rule-based scoring mechanisms are particularly appealing as they can explicitly encode driving preferences, safety constraints, and traffic regulations in a formalized, human-understandable format. However, manually crafting these scoring rules presents significant challenges: the rules often contain complex interdependencies, require careful parameter tuning, and may not fully capture the nuances present in real-world driving data. This work introduces FLoRA, a novel framework that bridges this gap by learning interpretable scoring rules represented in temporal logic. Our method features a learnable logic structure that captures nuanced relationships across diverse driving scenarios, optimizing both rules and parameters directly from real-world driving demonstrations collected in NuPlan. Our approach effectively learns to evaluate driving behavior even though the training data only contains positive examples (successful driving demonstrations). Evaluations in closed-loop planning simulations demonstrate that our learned scoring rules outperform existing techniques, including expert-designed rules and neural network scoring models, while maintaining interpretability. This work introduces a data-driven approach to enhance the scoring mechanism in autonomous driving systems, designed as a plug-in module to seamlessly integrate with various trajectory proposers. Our video and code are available on xiong.zikang.me/FLoRA.

arxiv情報

著者	Zikang Xiong,Joe Kurian Eappen,Suresh Jagannathan
発行日	2025-02-17 02:06:57+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

A Framework for Learning Scoring Rules in Autonomous Driving Planning Systems

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー