Wikipedia is Not a Dictionary, Delete! Text Classification as a Proxy for Analysing Wiki Deletion Discussions

要約

WikipediaやWikidataなどの共同知識ハブの自動コンテンツモデレートは、複数の要因により重要でありながら挑戦的な作業です。
このホワイトペーパーでは、いくつかのWikiと3つの言語で削除がマークされた記事を中心に行われる議論のデータベースを作成し、それを使用して、さまざまなタスクのさまざまなLMSを評価します（議論の結果を予測することから、個々のコメントが指摘している可能性がある）。
私たちの結果は、とりわけ、削除につながる議論が予測が容易であり、驚くべきことに、自己生産されたタグ（キープ、削除、またはリダイレクト）は、おそらくユーザーのためらいやコメント内の審議のために、常に分類子をガイドするのに役立つとは限らないことを明らかにしています。

要約(オリジナル)

Automated content moderation for collaborative knowledge hubs like Wikipedia or Wikidata is an important yet challenging task due to multiple factors. In this paper, we construct a database of discussions happening around articles marked for deletion in several Wikis and in three languages, which we then use to evaluate a range of LMs on different tasks (from predicting the outcome of the discussion to identifying the implicit policy an individual comment might be pointing to). Our results reveal, among others, that discussions leading to deletion are easier to predict, and that, surprisingly, self-produced tags (keep, delete or redirect) don’t always help guiding the classifiers, presumably because of users’ hesitation or deliberation within comments.

arxiv情報

著者	Hsuvas Borkakoty,Luis Espinosa-Anke
発行日	2025-03-13 12:07:35+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Wikipedia is Not a Dictionary, Delete! Text Classification as a Proxy for Analysing Wiki Deletion Discussions

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー