Bridging Social Media and Search Engines: Dredge Words and the Detection of Unreliable Domains

要約

プロアクティブなコンテンツモデレーションでは、プラットフォームが Web サイトの信頼性を迅速かつ継続的に評価する必要があります。
ユーザーがたどる信頼性の低い Web サイトへの直接的および間接的な経路を活用して、Web グラフと大規模なソーシャルメディアコンテキストの両方を統合した Web サイトの信頼性分類および検出システムを開発します。
さらに、信頼性の低いドメインが検索エンジンで上位にランクされる単語、用語、フレーズをドレッジするという概念を導入し、ソーシャルメディアでの使用状況を初めて調査します。
ウェブグラフとソーシャルメディアコンテキストを組み合わせた当社のグラフニューラルネットワークは、ウェブサイトの信頼性分類において最先端の結果を生成し、信頼性の低いドメインの上位 K の識別を大幅に向上させます。
さらに、ソーシャルメディアとオンラインコマースプラットフォームの両方との強いつながりを強調する、ドレッジワードの新しいデータセットをリリースします。

要約(オリジナル)

Proactive content moderation requires platforms to rapidly and continuously evaluate the credibility of websites. Leveraging the direct and indirect paths users follow to unreliable websites, we develop a website credibility classification and discovery system that integrates both webgraph and large-scale social media contexts. We additionally introduce the concept of dredge words, terms or phrases for which unreliable domains rank highly on search engines, and provide the first exploration of their usage on social media. Our graph neural networks that combine webgraph and social media contexts generate to state-of-the-art results in website credibility classification and significantly improves the top-k identification of unreliable domains. Additionally, we release a novel dataset of dredge words, highlighting their strong connections to both social media and online commerce platforms.

arxiv情報

著者	Evan M. Williams,Peter Carragher,Kathleen M. Carley
発行日	2024-09-17 16:20:53+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Bridging Social Media and Search Engines: Dredge Words and the Detection of Unreliable Domains

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー