Prompting Encoder Models for Zero-Shot Classification: A Cross-Domain Study in Italian


言語モデル (LM) を効果的に使用するには、特殊な分野やリソースの少ない言語で注釈付きデータが限られているという課題に対処することが重要です。
ほとんどの大規模言語モデル (LLM) は汎用の英語コーパスでトレーニングされていますが、イタリア語、特に専門用語や官僚的専門用語に特化して調整されたモデルには顕著なギャップがあります。
このペーパーでは、これらの特殊なコンテキストでパフォーマンスを向上させるための技術を促進するとともに、より小型のドメイン固有のエンコーダ LM を採用する実現可能性を検討します。
さらに、キャリブレーション技術とドメイン内言語化ツールの適用により、エンコーダー モデルの有効性が大幅に向上します。


Addressing the challenge of limited annotated data in specialized fields and low-resource languages is crucial for the effective use of Language Models (LMs). While most Large Language Models (LLMs) are trained on general-purpose English corpora, there is a notable gap in models specifically tailored for Italian, particularly for technical and bureaucratic jargon. This paper explores the feasibility of employing smaller, domain-specific encoder LMs alongside prompting techniques to enhance performance in these specialized contexts. Our study concentrates on the Italian bureaucratic and legal language, experimenting with both general-purpose and further pre-trained encoder-only models. We evaluated the models on downstream tasks such as document classification and entity typing and conducted intrinsic evaluations using Pseudo-Log-Likelihood. The results indicate that while further pre-trained models may show diminished robustness in general knowledge, they exhibit superior adaptability for domain-specific tasks, even in a zero-shot setting. Furthermore, the application of calibration techniques and in-domain verbalizers significantly enhances the efficacy of encoder models. These domain-specialized models prove to be particularly advantageous in scenarios where in-domain resources or expertise are scarce. In conclusion, our findings offer new insights into the use of Italian models in specialized contexts, which may have a significant impact on both research and industrial applications in the digital transformation era.


著者 Serena Auriemma,Martina Miliani,Mauro Madeddu,Alessandro Bondielli,Lucia Passaro,Alessandro Lenci
発行日 2024-07-30 08:50:16+00:00
