Achieving Diversity in Counterfactual Explanations: a Review and Discussion

要約

【タイトル】カウンターファクチャルな説明の多様性の実現：レビューと議論

【要約】
– カウンターファクチャルな例は、訓練された決定モデルの予測をユーザーに説明するために使用され、関連する予測を変更するために行うべき修正を示します。
– カウンターファクチャルな例は、良好な説明の基準を量化するいくつかの基準を組み合わせた最適化問題の解として一般的に定義されていますが、どの基準が良好な説明を満たすかはユーザーニーズによって異なるため、適切なプロパティを選択することは困難です。
– この問題を回避するには、複数の異なるカウンターファクチャルな例を生成し、予測を説明する方法とすることがあります。この論文は、この「多様性の概念」に対して多数の定義が提案されていることを説明し、その原則とユーザーニーズに基づく仮説について説明します。
– さらに、多様性のカテゴリ化を行い、研究上の課題を特定します。

要約(オリジナル)

In the field of Explainable Artificial Intelligence (XAI), counterfactual examples explain to a user the predictions of a trained decision model by indicating the modifications to be made to the instance so as to change its associated prediction. These counterfactual examples are generally defined as solutions to an optimization problem whose cost function combines several criteria that quantify desiderata for a good explanation meeting user needs. A large variety of such appropriate properties can be considered, as the user needs are generally unknown and differ from one user to another; their selection and formalization is difficult. To circumvent this issue, several approaches propose to generate, rather than a single one, a set of diverse counterfactual examples to explain a prediction. This paper proposes a review of the numerous, sometimes conflicting, definitions that have been proposed for this notion of diversity. It discusses their underlying principles as well as the hypotheses on the user needs they rely on and proposes to categorize them along several dimensions (explicit vs implicit, universe in which they are defined, level at which they apply), leading to the identification of further research challenges on this topic.

arxiv情報

著者	Thibault Laugel,Adulam Jeyasothy,Marie-Jeanne Lesot,Christophe Marsala,Marcin Detyniecki
発行日	2023-05-10 02:09:19+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, OpenAI

Achieving Diversity in Counterfactual Explanations: a Review and Discussion

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー