Using LLMs to discover emerging coded antisemitic hate-speech emergence in extremist social media

要約

オンラインでのヘイトスピーチの蔓延は、ソーシャルメディアプラットフォームにとって難しい問題を引き起こしています。
特定の課題は、ユーザーへの帰属意識の創出と検出の回避の両方に関心のあるグループによるコード化言語の使用に関連しています。
コード化言語は急速に進化し、その使用法は時間の経過とともに変化します。
この論文では、新たに出現した暗号化されたヘイトを含んだ用語を検出するための方法論を提案します。
この方法論は、オンラインの反ユダヤ主義的言説の文脈でテストされます。
このアプローチでは、過激派ユーザーがよく使用するソーシャルメディアプラットフォームから収集された投稿が考慮されています。
投稿は、ユダヤ人に対する既知の憎悪の言説に関連するシード表現を使用してスクレイピングされています。
この方法は、各投稿を最も代表する表現を特定し、コーパス全体におけるそれらの頻度を計算することから始まります。
文法的に一貫性のない表現や以前に遭遇した表現をフィルタリングして除外し、新しく出現した整形式の用語に焦点を当てます。
続いて、微調整された大規模言語モデルを使用して既知の反ユダヤ主義用語との意味的類似性が評価され、既知の憎しみの表現から遠すぎる表現がフィルタリングされます。
明らかにユダヤ人のトピックに関連する用語を含む緊急の反ユダヤ主義的な表現は削除され、コード化された憎悪の表現のみが返されます。

要約(オリジナル)

Online hate speech proliferation has created a difficult problem for social media platforms. A particular challenge relates to the use of coded language by groups interested in both creating a sense of belonging for its users and evading detection. Coded language evolves quickly and its use varies over time. This paper proposes a methodology for detecting emerging coded hate-laden terminology. The methodology is tested in the context of online antisemitic discourse. The approach considers posts scraped from social media platforms, often used by extremist users. The posts are scraped using seed expressions related to previously known discourse of hatred towards Jews. The method begins by identifying the expressions most representative of each post and calculating their frequency in the whole corpus. It filters out grammatically incoherent expressions as well as previously encountered ones so as to focus on emergent well-formed terminology. This is followed by an assessment of semantic similarity to known antisemitic terminology using a fine-tuned large language model, and subsequent filtering out of the expressions that are too distant from known expressions of hatred. Emergent antisemitic expressions containing terms clearly relating to Jewish topics are then removed to return only coded expressions of hatred.

arxiv情報

著者	Dhanush Kikkisetti,Raza Ul Mustafa,Wendy Melillo,Roberto Corizzo,Zois Boukouvalas,Jeff Gill,Nathalie Japkowicz
発行日	2024-01-19 17:40:50+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Using LLMs to discover emerging coded antisemitic hate-speech emergence in extremist social media

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー