Towards Reliable and Interpretable Traffic Crash Pattern Prediction and Safety Interventions Using Customized Large Language Models

要約

クラッシュイベントの予測は、クラッシュ分布とその貢献要因を理解するために重要であり、それにより、積極的な交通安全ポリシーの介入の設計を可能にします。
ただし、既存の方法は、数値特性、テキストレポート、クラッシュ画像、環境条件、ドライバーの行動記録など、トラフィッククラッシュデータのさまざまなソース間の複雑な相互作用を解釈するのに苦労しています。
その結果、彼らはしばしば、これらの多様なデータソースに埋め込まれた豊富なセマンティック情報と複雑な相互関係をキャプチャできず、重要なクラッシュリスク要因を特定する能力を制限します。
この研究では、LLMSを適応させてクラッシュ予測を再構成し、テキストベースの推論として帰属を特徴とするTrackedafeを提案します。
58,903の実際のレポートを含むマルチモーダルクラッシュデータセットと、属するインフラストラクチャ、環境、ドライバー、および車両情報が収集され、Trackafeイベントデータセットにテキスト化されます。
このデータセットでLLMをカスタマイズして微調整することにより、TrafficSafe LLMは、ベースラインよりもF1スコアの42％の平均改善を達成します。
これらの予測を解釈し、貢献要因を明らかにするために、条件付きリスク分析を可能にする文レベルの機能属性フレームワークであるTrackedafe Attributionを導入します。
調査結果は、アルコール障害の運転が重度のクラッシュの主要な要因であり、攻撃的および減損関連の行動は、他のドライバーの行動と比較して深刻なクラッシュの貢献度のほぼ2倍の貢献をしていることを示しています。
さらに、TrafficsAfeの属性は、モデルトレーニング中に極めて重要な機能を強調し、反復パフォーマンスの改善のために戦略的クラッシュデータ収集を導きます。
提案されているTrackedafeは、交通安全研究の変革的な飛躍を提供し、高度なAIテクノロジーを責任ある、実用的で命を救う結果に変換するための青写真を提供します。

要約(オリジナル)

Predicting crash events is crucial for understanding crash distributions and their contributing factors, thereby enabling the design of proactive traffic safety policy interventions. However, existing methods struggle to interpret the complex interplay among various sources of traffic crash data, including numeric characteristics, textual reports, crash imagery, environmental conditions, and driver behavior records. As a result, they often fail to capture the rich semantic information and intricate interrelationships embedded in these diverse data sources, limiting their ability to identify critical crash risk factors. In this research, we propose TrafficSafe, a framework that adapts LLMs to reframe crash prediction and feature attribution as text-based reasoning. A multi-modal crash dataset including 58,903 real-world reports together with belonged infrastructure, environmental, driver, and vehicle information is collected and textualized into TrafficSafe Event Dataset. By customizing and fine-tuning LLMs on this dataset, the TrafficSafe LLM achieves a 42% average improvement in F1-score over baselines. To interpret these predictions and uncover contributing factors, we introduce TrafficSafe Attribution, a sentence-level feature attribution framework enabling conditional risk analysis. Findings show that alcohol-impaired driving is the leading factor in severe crashes, with aggressive and impairment-related behaviors having nearly twice the contribution for severe crashes compared to other driver behaviors. Furthermore, TrafficSafe Attribution highlights pivotal features during model training, guiding strategic crash data collection for iterative performance improvements. The proposed TrafficSafe offers a transformative leap in traffic safety research, providing a blueprint for translating advanced AI technologies into responsible, actionable, and life-saving outcomes.

arxiv情報

著者	Yang Zhao,Pu Wang,Yibo Zhao,Hongru Du,Hao Frank Yang
発行日	2025-05-21 17:38:02+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Towards Reliable and Interpretable Traffic Crash Pattern Prediction and Safety Interventions Using Customized Large Language Models

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー