Disentangling Dialect from Social Bias via Multitask Learning to Improve Fairness


ほとんどの NLP メソッドは、そのような変動の影響を受けません。
このギャップを埋めるために、偏った言語の 5 つの側面の検出における方言間のパフォーマンスの差異と、それらを軽減する方法を調査します。


Dialects introduce syntactic and lexical variations in language that occur in regional or social groups. Most NLP methods are not sensitive to such variations. This may lead to unfair behavior of the methods, conveying negative bias towards dialect speakers. While previous work has studied dialect-related fairness for aspects like hate speech, other aspects of biased language, such as lewdness, remain fully unexplored. To fill this gap, we investigate performance disparities between dialects in the detection of five aspects of biased language and how to mitigate them. To alleviate bias, we present a multitask learning approach that models dialect language as an auxiliary task to incorporate syntactic and lexical variations. In our experiments with African-American English dialect, we provide empirical evidence that complementing common learning approaches with dialect modeling improves their fairness. Furthermore, the results suggest that multitask learning achieves state-of-the-art performance and helps to detect properties of biased language more reliably.


著者 Maximilian Spliethöver,Sai Nikhil Menon,Henning Wachsmuth
発行日 2024-06-14 12:39:39+00:00
arxivサイト arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.CL パーマリンク