Learning Personalized Decision Support Policies

要約

人間の個々の意思決定者は、意思決定の結果を改善するためにさまざまな形式のサポートから恩恵を受ける可能性がありますが、それぞれの形式のサポートがより良い結果をもたらすのはいつでしょうか?
この研究では、意思決定支援ツールへのアクセスをパーソナライズすることが、AI 支援の適切な使用をインスタンス化するための効果的なメカニズムとなり得ると仮定しています。
具体的には、与えられた入力に対して、最初は事前情報のない意思決定者にどの形式のサポートを提供するかを選択する意思決定支援ポリシーを学習するという一般的な問題を提案します。
私たちは、パーソナライズされた意思決定支援ポリシーを学習するための対話型ツール $\texttt{Modiste}$ を開発しています。
$\texttt{Modiste}$ は、確率的コンテキストバンディット手法を活用して、意思決定者ごとに意思決定サポートポリシーをカスタマイズし、サポートのコストなどの補助的な目的を考慮した複数の目的設定の拡張をサポートします。
パーソナライズされたポリシーはオフラインポリシーよりも優れており、コストを意識した設定では、パフォーマンスの低下を最小限に抑えながら発生コストを削減できることがわかりました。
私たちの実験には、視覚と言語のタスクに関するさまざまな現実的な形式のサポート（専門家の合意や大規模な言語モデルからの予測など）が含まれています。
私たちの人体実験は計算実験を検証し、$\texttt{Modiste}$ と対話する実際のユーザーにとってパーソナライゼーションが実際にメリットをもたらす可能性があることを実証しています。

要約(オリジナル)

Individual human decision-makers may benefit from different forms of support to improve decision outcomes, but when each form of support will yield better outcomes? In this work, we posit that personalizing access to decision support tools can be an effective mechanism for instantiating the appropriate use of AI assistance. Specifically, we propose the general problem of learning a decision support policy that, for a given input, chooses which form of support to provide to decision-makers for whom we initially have no prior information. We develop $\texttt{Modiste}$, an interactive tool to learn personalized decision support policies. $\texttt{Modiste}$ leverages stochastic contextual bandit techniques to personalize a decision support policy for each decision-maker and supports extensions to the multi-objective setting to account for auxiliary objectives like the cost of support. We find that personalized policies outperform offline policies, and, in the cost-aware setting, reduce the incurred cost with minimal degradation to performance. Our experiments include various realistic forms of support (e.g., expert consensus and predictions from a large language model) on vision and language tasks. Our human subject experiments validate our computational experiments, demonstrating that personalization can yield benefits in practice for real users, who interact with $\texttt{Modiste}$.

arxiv情報

著者	Umang Bhatt,Valerie Chen,Katherine M. Collins,Parameswaran Kamalaruban,Emma Kallina,Adrian Weller,Ameet Talwalkar
発行日	2024-05-27 14:10:24+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Learning Personalized Decision Support Policies

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー