SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification

要約

この論文では、野生で撮影されたウミガメの写真を含む、初の公開された大規模な長期データセット — \href{https://www.kaggle.com/datasets/wildlifedatasets/seaturtleid2022}{SeaTurtleID2022} を紹介します。
このデータセットには、13 年間以内に収集された 438 頭の固有の個体の 8,729 枚の写真が含まれており、動物の再識別のための最長期間のデータセットとなっています。
すべての写真には、身元、遭遇タイムスタンプ、身体部分セグメンテーションマスクなどのさまざまな注釈が含まれています。
標準的な「ランダム」分割の代わりに、このデータセットでは 2 つの現実的で環境に配慮した分割が可能です: (i) 異なる日/年のトレーニング、検証、テストデータを含む \textit{時間認識の閉セット}、および (ii)
) テストセットと検証セットに新しい未知の個人が含まれる \textit{時間認識オープンセット}。
ランダムな分割はパフォーマンスの過大評価につながるため、再識別方法のベンチマークには時間認識の分割が不可欠であることを示します。
さらに、さまざまな体の部分にわたるベースラインのインスタンスのセグメンテーションと再識別パフォーマンスが提供されます。
最後に、ウミガメを再識別するためのエンドツーエンドのシステムを提案し、評価します。
ヘッドインスタンスセグメンテーション用のハイブリッドタスクカスケードと ArcFace でトレーニングされた特徴抽出機能に基づく提案されたシステムは、86.8\% の精度を達成しました。

要約(オリジナル)

This paper introduces the first public large-scale, long-span dataset with sea turtle photographs captured in the wild — \href{https://www.kaggle.com/datasets/wildlifedatasets/seaturtleid2022}{SeaTurtleID2022}. The dataset contains 8729 photographs of 438 unique individuals collected within 13 years, making it the longest-spanned dataset for animal re-identification. All photographs include various annotations, e.g., identity, encounter timestamp, and body parts segmentation masks. Instead of standard ‘random’ splits, the dataset allows for two realistic and ecologically motivated splits: (i) a \textit{time-aware closed-set} with training, validation, and test data from different days/years, and (ii) a \textit{time-aware open-set} with new unknown individuals in test and validation sets. We show that time-aware splits are essential for benchmarking re-identification methods, as random splits lead to performance overestimation. Furthermore, a baseline instance segmentation and re-identification performance over various body parts is provided. Finally, an end-to-end system for sea turtle re-identification is proposed and evaluated. The proposed system based on Hybrid Task Cascade for head instance segmentation and ArcFace-trained feature-extractor achieved an accuracy of 86.8\%.

arxiv情報

著者	Lukáš Adam,Vojtěch Čermák,Kostas Papafitsoros,Lukáš Picek
発行日	2024-05-01 13:16:09+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー