FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition

要約

少数の名前付きエンティティ認識 (NER) は、限られたリソースドメインにおけるエンティティのタグ付けには不可欠であるため、近年適切な注目を集めています。
少数ショット NER の既存のアプローチは、主にドメイン内設定で評価されます。
対照的に、いくつかのラベル付きドメイン内の例を使用して、これらの本質的に忠実なモデルがクロスドメイン NER でどのように動作するかについてはほとんど知られていません。
この論文では、モデルの汎化能力を向上させるための 2 段階の理論的根拠を中心としたデータ拡張方法を提案します。
いくつかのデータセットの結果は、モデルに依存しない手法が、データ拡張手法やプロンプトチューニング手法などの以前の最先端の手法と比較して、クロスドメイン NER タスクのパフォーマンスを大幅に向上させることを示しています。
私たちのコードは https://github.com/lifan-yuan/FactMix で入手できます。

要約(オリジナル)

Few-shot Named Entity Recognition (NER) is imperative for entity tagging in limited resource domains and thus received proper attention in recent years. Existing approaches for few-shot NER are evaluated mainly under in-domain settings. In contrast, little is known about how these inherently faithful models perform in cross-domain NER using a few labeled in-domain examples. This paper proposes a two-step rationale-centric data augmentation method to improve the model’s generalization ability. Results on several datasets show that our model-agnostic method significantly improves the performance of cross-domain NER tasks compared to previous state-of-the-art methods, including the data augmentation and prompt-tuning methods. Our codes are available at https://github.com/lifan-yuan/FactMix.

arxiv情報

著者	Linyi Yang,Lifan Yuan,Leyang Cui,Wenyang Gao,Yue Zhang
発行日	2023-12-26 03:48:59+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー