Hardness of Deceptive Certificate Selection

要約

AI の理論的な解釈可能性の保証に向けた最近の進歩は、インタラクティブな証明システムに基づいた分類器によって行われています。
証明者はデータポイントから証明書を選択し、クラスを決定する検証者にそれを送信します。
機械学習のコンテキストでは、このような証明書はクラスの情報を提供する機能となる可能性があります。
高い健全性と完全性を備えたセットアップの場合、交換される証明書には、データポイントの真のクラスとの高い相互情報量が必要です。
ただし、この保証はデータセットの非対称特徴相関の限界に依存しており、この特性はこれまでのところ高次元データの推定が困難です。
それは W\’aldchen et al. で推測されました。
AFC を悪用するのは計算的に困難であるということをここで証明します。
私たちは、有益でない証明書を使用しながら、AFC を悪用して高い完全性と健全性を達成することを目的とした、悪意のある証明者と検証者のデュオについて考えます。
このタスクは $\mathsf{NP}$ 困難であり、$\mathcal{O}(m^{1/8 – \epsilon})$ よりも良く近似できないことを示します。ここで、$m$ は可能な数です
密対ランダム予想の下で $\epsilon>0$ の証明書。
これは、対話型分類は計算的に悪用されにくいため、AFC が現実世界のタスクでの対話型分類の使用を妨げるべきではないという証拠です。

要約(オリジナル)

Recent progress towards theoretical interpretability guarantees for AI has been made with classifiers that are based on interactive proof systems. A prover selects a certificate from the datapoint and sends it to a verifier who decides the class. In the context of machine learning, such a certificate can be a feature that is informative of the class. For a setup with high soundness and completeness, the exchanged certificates must have a high mutual information with the true class of the datapoint. However, this guarantee relies on a bound on the Asymmetric Feature Correlation of the dataset, a property that so far is difficult to estimate for high-dimensional data. It was conjectured in W\’aldchen et al. that it is computationally hard to exploit the AFC, which is what we prove here. We consider a malicious prover-verifier duo that aims to exploit the AFC to achieve high completeness and soundness while using uninformative certificates. We show that this task is $\mathsf{NP}$-hard and cannot be approximated better than $\mathcal{O}(m^{1/8 – \epsilon})$, where $m$ is the number of possible certificates, for $\epsilon>0$ under the Dense-vs-Random conjecture. This is some evidence that AFC should not prevent the use of interactive classification for real-world tasks, as it is computationally hard to be exploited.

arxiv情報

著者	Stephan Wäldchen
発行日	2023-06-07 15:12:16+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Hardness of Deceptive Certificate Selection

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー