Transparent Neighborhood Approximation for Text Classifier Explanation

要約

最近の文献では、モデルに依存しない説明を導き出す際の近傍構築の重要な役割が強調されており、特にテキスト分類子の説明において、合成インスタンスの品質を向上させるために生成モデルを導入する傾向が高まっています。
これらのアプローチは、テキストの構造化されていない性質によって引き起こされる近傍構築の課題を克服し、それによって説明の品質を向上させます。
ただし、配備されたジェネレーターは通常、ニューラルネットワークを介して実装されており、固有の説明可能性に欠けており、説明プロセス自体の透明性をめぐる議論が巻き起こっています。
近傍の品質を維持しながらこの制限に対処するために、この文書では、ブラックボックステキストジェネレーターの代替として確率ベースの編集方法を紹介します。
このアプローチでは、テキスト内のコンテキストに基づいた操作を実装することで、隣接するテキストを生成します。
ジェネレータベースの構築プロセスを再帰的確率ベースの編集に置き換えた結果の説明手法である XPROB (Explainer with Probability Based Editing) は、2 つの現実世界のデータセットに対して行われた評価によれば、競合するパフォーマンスを示します。
さらに、XPROB の完全に透明でより制御可能な構築プロセスにより、ジェネレーターベースのエクスプローラーと比較して優れた安定性が実現します。

要約(オリジナル)

Recent literature highlights the critical role of neighborhood construction in deriving model-agnostic explanations, with a growing trend toward deploying generative models to improve synthetic instance quality, especially for explaining text classifiers. These approaches overcome the challenges in neighborhood construction posed by the unstructured nature of texts, thereby improving the quality of explanations. However, the deployed generators are usually implemented via neural networks and lack inherent explainability, sparking arguments over the transparency of the explanation process itself. To address this limitation while preserving neighborhood quality, this paper introduces a probability-based editing method as an alternative to black-box text generators. This approach generates neighboring texts by implementing manipulations based on in-text contexts. Substituting the generator-based construction process with recursive probability-based editing, the resultant explanation method, XPROB (explainer with probability-based editing), exhibits competitive performance according to the evaluation conducted on two real-world datasets. Additionally, XPROB’s fully transparent and more controllable construction process leads to superior stability compared to the generator-based explainers.

arxiv情報

著者	Yi Cai,Arthur Zimek,Eirini Ntoutsi,Gerhard Wunder
発行日	2024-11-25 10:10:09+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Transparent Neighborhood Approximation for Text Classifier Explanation

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー