DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance


新しいアンサンブルメソッド – 多様な指紋アンサンブル(DFPE)を提案します。これは、複数のLLMの補完的な強度を活用して、より堅牢なパフォーマンスを実現します。


Large Language Models (LLMs) have shown remarkable capabilities across various natural language processing tasks but often struggle to excel uniformly in diverse or complex domains. We propose a novel ensemble method – Diverse Fingerprint Ensemble (DFPE), which leverages the complementary strengths of multiple LLMs to achieve more robust performance. Our approach involves: (1) clustering models based on response ‘fingerprints’ patterns, (2) applying a quantile-based filtering mechanism to remove underperforming models at a per-subject level, and (3) assigning adaptive weights to remaining models based on their subject-wise validation accuracy. In experiments on the Massive Multitask Language Understanding (MMLU) benchmark, DFPE outperforms the best single model by 3% overall accuracy and 5% in discipline-level accuracy. This method increases the robustness and generalization of LLMs and underscores how model selection, diversity preservation, and performance-driven weighting can effectively address challenging, multi-faceted language understanding tasks.


著者 Seffi Cohen,Niv Goldshlager,Nurit Cohen-Inger,Bracha Shapira,Lior Rokach
発行日 2025-01-29 08:44:45+00:00
arxivサイト arxiv_id(pdf)

カテゴリー: cs.AI, cs.CL, cs.LG パーマリンク