Improving the Multi-label Atomic Activity Recognition by Robust Visual Feature and Advanced Attention @ ROAD++ Atomic Activity Recognition 2024

要約

Road++ Track3 は、交通シナリオにおけるマルチラベルのアトミックアクティビティ認識タスクを提案します。これは、64 クラスのマルチラベルビデオアクション認識タスクとして標準化できます。
マルチラベル原子活動認識タスクでは、視覚的特徴抽出の堅牢性が依然として重要な課題であり、これはモデルのパフォーマンスと汎化能力に直接影響します。
これらの問題に対処するために、私たちのチームはデータ処理、モデル、後処理という 3 つの側面を最適化しました。
まず、適切な解像度とビデオサンプリング戦略が選択され、固定のサンプリング戦略が検証セットとテストセットに設定されます。
次に、モデルのトレーニングに関して、チームは特徴抽出用にさまざまなビジュアルバックボーンネットワークを選択し、トレーニングセットと検証セットでトレーニングされ、テストセットで推論されるアクションスロットモデルを導入します。
最後に、後処理として、チームはさまざまなモデルの長所と短所を組み合わせて加重融合を行い、テストセットの最終的な mAP は 58% となり、チャレンジのベースラインより 4% 高くなりました。

要約(オリジナル)

Road++ Track3 proposes a multi-label atomic activity recognition task in traffic scenarios, which can be standardized as a 64-class multi-label video action recognition task. In the multi-label atomic activity recognition task, the robustness of visual feature extraction remains a key challenge, which directly affects the model performance and generalization ability. To cope with these issues, our team optimized three aspects: data processing, model and post-processing. Firstly, the appropriate resolution and video sampling strategy are selected, and a fixed sampling strategy is set on the validation and test sets. Secondly, in terms of model training, the team selects a variety of visual backbone networks for feature extraction, and then introduces the action-slot model, which is trained on the training and validation sets, and reasoned on the test set. Finally, for post-processing, the team combined the strengths and weaknesses of different models for weighted fusion, and the final mAP on the test set was 58%, which is 4% higher than the challenge baseline.

arxiv情報

著者	Jiamin Cao,Lingqi Wang,Kexin Zhang,Yuting Yang,Licheng Jiao,Yuwei Guo
発行日	2024-10-21 14:10:14+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Improving the Multi-label Atomic Activity Recognition by Robust Visual Feature and Advanced Attention @ ROAD++ Atomic Activity Recognition 2024

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー