Multi-Task Learning for Fatigue Detection and Face Recognition of Drivers via Tree-Style Space-Channel Attention Fusion Network

要約

運転シナリオでは、自動車の予防安全システムにディープラーニング技術が組み込まれることが増えています。
これらのシステムは通常、疲労運転の検出やドライバーの ID の認識など、複数のタスクを同時に処理する必要があります。
ただし、複数の単一タスクモデルを組み合わせる従来の並列スタイルのアプローチでは、同様のタスクを処理するときにリソースを浪費する傾向があります。
したがって、マルチタスク学習のための新しいツリー形式のマルチタスクモデリングアプローチを提案します。これは、共有バックボーンに根ざし、モデルパイプラインが深くなるにつれて、より多くの専用の個別のモジュールブランチが追加されます。
ツリー形式のアプローチに従って、ドライバーの疲労検出とドライバーを識別するための顔認識を同時に実行するマルチタスク学習モデルを提案します。
このモデルは、共通の特徴抽出バックボーンモジュールを共有し、さらに分離された特徴抽出および分類モジュールブランチを備えています。
専用ブランチは、空間およびチャネルアテンションメカニズムを活用および組み合わせて、空間チャネル融合アテンション強化機能を生成し、検出パフォーマンスの向上につながります。
利用できるのはシングルタスクデータセットのみであるため、シングルタスクデータセットのみを使用してマルチタスクモデルをトレーニングするための交互の更新と勾配累積を含む手法を導入します。
ツリー形式のマルチタスク学習モデルの有効性は、広範な検証を通じて検証されています。

要約(オリジナル)

In driving scenarios, automobile active safety systems are increasingly incorporating deep learning technology. These systems typically need to handle multiple tasks simultaneously, such as detecting fatigue driving and recognizing the driver’s identity. However, the traditional parallel-style approach of combining multiple single-task models tends to waste resources when dealing with similar tasks. Therefore, we propose a novel tree-style multi-task modeling approach for multi-task learning, which rooted at a shared backbone, more dedicated separate module branches are appended as the model pipeline goes deeper. Following the tree-style approach, we propose a multi-task learning model for simultaneously performing driver fatigue detection and face recognition for identifying a driver. This model shares a common feature extraction backbone module, with further separated feature extraction and classification module branches. The dedicated branches exploit and combine spatial and channel attention mechanisms to generate space-channel fused-attention enhanced features, leading to improved detection performance. As only single-task datasets are available, we introduce techniques including alternating updation and gradient accumulation for training our multi-task model using only the single-task datasets. The effectiveness of our tree-style multi-task learning model is verified through extensive validations.

arxiv情報

著者	Shulei Qu,Zhenguo Gao,Xiaowei Chen,Na Li,Yakai Wang,Xiaoxiao Wu
発行日	2024-05-13 15:34:20+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Multi-Task Learning for Fatigue Detection and Face Recognition of Drivers via Tree-Style Space-Channel Attention Fusion Network

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー