Fair Mixed Effects Support Vector Machine


機械学習の公平性は、トレーニング データに存在するバイアスを軽減し、差別的な結果につながる可能性のある不完全性をモデル化することを目的としています。
ただし、この仮定は社会現象を記述するデータには当てはまらないことが多く、データ ポイントはクラスターに基づいて作成されることがよくあります。
両方の問題を同時に処理できる、公平な混合効果サポート ベクター マシン アルゴリズムを紹介します。


To ensure unbiased and ethical automated predictions, fairness must be a core principle in machine learning applications. Fairness in machine learning aims to mitigate biases present in the training data and model imperfections that could lead to discriminatory outcomes. This is achieved by preventing the model from making decisions based on sensitive characteristics like ethnicity or sexual orientation. A fundamental assumption in machine learning is the independence of observations. However, this assumption often does not hold true for data describing social phenomena, where data points are often clustered based. Hence, if the machine learning models do not account for the cluster correlations, the results may be biased. Especially high is the bias in cases where the cluster assignment is correlated to the variable of interest. We present a fair mixed effects support vector machine algorithm that can handle both problems simultaneously. With a reproducible simulation study we demonstrate the impact of clustered data on the quality of fair machine learning predictions.


著者 João Vitor Pamplona,Jan Pablo Burgard
発行日 2024-05-10 12:25:06+00:00
