Mergenetic: a Simple Evolutionary Model Merging Library

要約

モデルのマージにより、既存のモデルの機能を新しいモデルに組み合わせることができます – 追加のトレーニングなしでポストホックです。
これにより、低コストと消費者GPUの融合をサポートするライブラリの可用性のおかげで、ますます人気が高まっています。
最近の研究では、合併と進化的アルゴリズムがパフォーマンスを高めることができることが示されていますが、現在、言語モデルのそのような戦略での柔軟な実験をサポートしているフレームワークはありません。
進化モデルの合併のためのオープンソースライブラリであるMergeneticを紹介します。
マージーは、軽量のフィットネス推定器を組み込んで評価コストを削減しながら、マージメソッドと進化的アルゴリズムの簡単な構成を可能にします。
その設計について説明し、マージーが控えめなハードウェアを使用してタスクと言語間で競争力のある結果を生成することを実証します。

要約(オリジナル)

Model merging allows combining the capabilities of existing models into a new one – post hoc, without additional training. This has made it increasingly popular thanks to its low cost and the availability of libraries that support merging on consumer GPUs. Recent work shows that pairing merging with evolutionary algorithms can boost performance, but no framework currently supports flexible experimentation with such strategies in language models. We introduce Mergenetic, an open-source library for evolutionary model merging. Mergenetic enables easy composition of merging methods and evolutionary algorithms while incorporating lightweight fitness estimators to reduce evaluation costs. We describe its design and demonstrate that Mergenetic produces competitive results across tasks and languages using modest hardware.

arxiv情報

著者	Adrian Robert Minut,Tommaso Mencattini,Andrea Santilli,Donato Crisostomi,Emanuele Rodolà
発行日	2025-05-16 16:43:23+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

Mergenetic: a Simple Evolutionary Model Merging Library

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー