Common Limitations of Image Processing Metrics: A Picture Story

要約

自動画像解析の重要性は継続的に高まっているが、最近のメタ研究により、アルゴリズムの検証に関して大きな欠陥があることが明らかになった。パフォーマンスメトリクスは、使用される自動アルゴリズムの有意義で客観的、かつ透明性のあるパフォーマンス評価と検証のために特に重要であるが、与えられた画像解析タスクに特定のメトリクスを使用する際の実用的な落とし穴には比較的注意が払われていない。これらは通常、（1）クラスの不均衡や小さなターゲット構造の存在下での挙動など、固有のメトリクス特性の無視、（2）テストケースの非独立性など、固有のデータセット特性の無視、（3）メトリクスが反映すべき実際の生物医学領域の関心の無視に関連するものである。この Living Dynamically ドキュメントは、画像解析の分野で一般的に適用されているパフォーマンスメトリクスの重要な限界を説明することを目的としています。この文脈では、画像レベルの分類、セマンティックセグメンテーション、インスタンスセグメンテーション、またはオブジェクト検出タスクとして言い換えることができる生物医学画像分析問題に焦点を当てている。現在のバージョンは、世界60以上の機関の画像解析専門家からなる国際コンソーシアムによって行われた、メトリクスに関するデルファイプロセスに基づくものである。

要約(オリジナル)

While the importance of automatic image analysis is continuously increasing, recent meta-research revealed major flaws with respect to algorithm validation. Performance metrics are particularly key for meaningful, objective, and transparent performance assessment and validation of the used automatic algorithms, but relatively little attention has been given to the practical pitfalls when using specific metrics for a given image analysis task. These are typically related to (1) the disregard of inherent metric properties, such as the behaviour in the presence of class imbalance or small target structures, (2) the disregard of inherent data set properties, such as the non-independence of the test cases, and (3) the disregard of the actual biomedical domain interest that the metrics should reflect. This living dynamically document has the purpose to illustrate important limitations of performance metrics commonly applied in the field of image analysis. In this context, it focuses on biomedical image analysis problems that can be phrased as image-level classification, semantic segmentation, instance segmentation, or object detection task. The current version is based on a Delphi process on metrics conducted by an international consortium of image analysis experts from more than 60 institutions worldwide.

arxiv情報

著者	Annika Reinke,Minu D. Tizabi,Carole H. Sudre,Matthias Eisenmann,Tim Rädsch,Michael Baumgartner,Laura Acion,Michela Antonelli,Tal Arbel,Spyridon Bakas,Peter Bankhead,Arriel Benis,M. Jorge Cardoso,Veronika Cheplygina,Evangelia Christodoulou,Beth Cimini,Gary S. Collins,Keyvan Farahani,Bram van Ginneken,Ben Glocker,Patrick Godau,Fred Hamprecht,Daniel A. Hashimoto,Doreen Heckmann-Nötzel,Michael M. Hoffmann,Merel Huisman,Fabian Isensee,Pierre Jannin,Charles E. Kahn,Alexandros Karargyris,Alan Karthikesalingam,Bernhard Kainz,Emre Kavur,Hannes Kenngott,Jens Kleesiek,Thijs Kooi,Michal Kozubek,Anna Kreshuk,Tahsin Kurc,Bennett A. Landman,Geert Litjens,Amin Madani,Klaus Maier-Hein,Anne L. Martel,Peter Mattson,Erik Meijering,Bjoern Menze,David Moher,Karel G. M. Moons,Henning Müller,Brennan Nichyporuk,Felix Nickel,Jens Petersen,Gorkem Polat,Nasir Rajpoot,Mauricio Reyes,Nicola Rieke,Michael Riegler,Hassan Rivaz,Julio Saez-Rodriguez,Clarisa Sanchez Gutierrez,Julien Schroeter,Anindo Saha,Shravya Shetty,Maarten van Smeden,Bram Stieltjes,Ronald M. Summers,Abdel A. Taha,Sotirios A. Tsaftaris,Ben Van Calster,Gaël Varoquaux,Manuel Wiesenfarth,Ziv R. Yaniv,Annette Kopp-Schneider,Paul Jäger,Lena Maier-Hein
発行日	2022-06-07 07:38:44+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, DeepL

Common Limitations of Image Processing Metrics: A Picture Story

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー