Deciphering the Definition of Adversarial Robustness for post-hoc OOD Detectors

要約

分散除外（OOD）入力の検出は、実際のシナリオに深い学習モデルを安全に展開するために重要です。
近年、多くのOOD検出器が開発されており、ベンチマークさえ標準化されています。つまり、OpenOoodです。
事後検出器の数は急速に成長しています。
彼らは、事前に訓練された分類器を自然な分布シフトから保護し、現実世界のシナリオの準備ができていると主張するオプションを示しています。
ただし、ほとんどの研究では、敵対例（ADEX）に対処する上でのその有効性が無視されています。
OOD検出器がその実験にADEXが含まれている場合、ADEXの均一なパラメーターがないため、OOD検出器のパフォーマンスを正確に評価することが困難になります。
このペーパーでは、さまざまな回避攻撃に対する16の事後検出器の敵対的な堅牢性を調査します。
また、敵対的な堅牢性に役立つOOD検出器における敵対的防御のロードマップについても議論しています。
制限を確認するには、任意のOOD検出器にレベル1（統一データセットのADEX）を追加する必要があると考えています。
ロードマップ（適応攻撃に対する防御）の最後のレベルは、敵対的な機械学習（AML）の視点からの完全性を追加しました。

要約(オリジナル)

Detecting out-of-distribution (OOD) inputs is critical for safely deploying deep learning models in real-world scenarios. In recent years, many OOD detectors have been developed, and even the benchmarking has been standardized, i.e. OpenOOD. The number of post-hoc detectors is growing fast. They are showing an option to protect a pre-trained classifier against natural distribution shifts and claim to be ready for real-world scenarios. However, its effectiveness in dealing with adversarial examples (AdEx) has been neglected in most studies. In cases where an OOD detector includes AdEx in its experiments, the lack of uniform parameters for AdEx makes it difficult to accurately evaluate the performance of the OOD detector. This paper investigates the adversarial robustness of 16 post-hoc detectors against various evasion attacks. It also discusses a roadmap for adversarial defense in OOD detectors that would help adversarial robustness. We believe that level 1 (AdEx on a unified dataset) should be added to any OOD detector to see the limitations. The last level in the roadmap (defense against adaptive attacks) we added for integrity from an adversarial machine learning (AML) point of view, which we do not believe is the ultimate goal for OOD detectors.

arxiv情報

著者	Peter Lorenz,Mario Fernandez,Jens Müller,Ullrich Köthe
発行日	2025-01-29 04:48:16+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Deciphering the Definition of Adversarial Robustness for post-hoc OOD Detectors

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー