Transformers and Ensemble methods: A solution for Hate Speech Detection in Arabic languages

要約

このホワイトペーパーでは、CERIST NLP チャレンジ 2022 のサブタスクの 1 つであるヘイトスピーチ検出の共有タスクへの参加について説明します。実験では、2 つのアンサンブルアプローチを使用して、6 つの変換モデルとそれらの組み合わせのパフォーマンスを評価します。
5 分割クロス検証シナリオでのトレーニングセットの最良の結果は、多数決に基づくアンサンブルアプローチを使用して得られました。
テストセットでこのアプローチを評価した結果、F1 スコアは 0.60、精度は 0.86 でした。

要約(オリジナル)

This paper describes our participation in the shared task of hate speech detection, which is one of the subtasks of the CERIST NLP Challenge 2022. Our experiments evaluate the performance of six transformer models and their combination using 2 ensemble approaches. The best results on the training set, in a five-fold cross validation scenario, were obtained by using the ensemble approach based on the majority vote. The evaluation of this approach on the test set resulted in an F1-score of 0.60 and an Accuracy of 0.86.

arxiv情報

著者	Angel Felipe Magnossão de Paula,Imene Bensalem,Paolo Rosso,Wajdi Zaghouani
発行日	2023-03-17 08:02:54+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Transformers and Ensemble methods: A solution for Hate Speech Detection in Arabic languages

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー