On the Minimax Regret in Online Ranking with Top-k Feedback

要約

オンラインランキングでは、学習アルゴリズムが一連のアイテムを順番にランク付けし、そのランク付けに関するフィードバックを関連性スコアの形式で受け取ります。
関連性スコアの取得には通常、人間による注釈が必要となるため、フィードバックをランキングの上位 $k$ 項目に制限する部分的なフィードバック設定を検討することは非常に興味深いことです。
Chaudhuri と Tewari [2017] は、上位 $k$ のフィードバックを使用してオンラインランキングアルゴリズムを分析するフレームワークを開発しました。
彼らの研究における重要な要素は、部分的なモニタリングによる技術の使用でした。
この論文では、上位の $k$ フィードバックによるオンラインランキングをさらに調査し、Chaudhuri と Tewari [2017] によって提起されたいくつかの未解決の問題を解決します。
すべての $k$ に対する上位 $k$ フィードバックモデルと、ペアワイズ損失、割引累積利得、および精度@n のランキングパフォーマンス尺度を使用して、ミニマックス後悔率の完全な特性評価を提供します。
さらに、Precision@n の最小後悔率を達成する効率的なアルゴリズムを提供します。

要約(オリジナル)

In online ranking, a learning algorithm sequentially ranks a set of items and receives feedback on its ranking in the form of relevance scores. Since obtaining relevance scores typically involves human annotation, it is of great interest to consider a partial feedback setting where feedback is restricted to the top-$k$ items in the rankings. Chaudhuri and Tewari [2017] developed a framework to analyze online ranking algorithms with top $k$ feedback. A key element in their work was the use of techniques from partial monitoring. In this paper, we further investigate online ranking with top $k$ feedback and solve some open problems posed by Chaudhuri and Tewari [2017]. We provide a full characterization of minimax regret rates with the top $k$ feedback model for all $k$ and for the following ranking performance measures: Pairwise Loss, Discounted Cumulative Gain, and Precision@n. In addition, we give an efficient algorithm that achieves the minimax regret rate for Precision@n.

arxiv情報

著者	Mingyuan Zhang,Ambuj Tewari
発行日	2024-04-12 14:28:39+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

On the Minimax Regret in Online Ranking with Top-k Feedback

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー