jarxiv | Japanese arxiv | ページ 1550

ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

投稿日: 2025年2月28日作成者: jarxiv

要約

まばらに活性化された混合混合物（MOE）モデルは、計算予算を増やすことなくモデル容量を拡大するために広く採用されています。
ただし、バニラTOPKルーターは、不連続で変化する可能性のない方法で訓練されており、パフォーマンスとスケーラビリティを制限しています。
この問題に対処するために、レミーを提案します。これは、リリーターをルーターとして使用して、従来のTOPK+ソフトマックスルーティングにシンプルで効果的なドロップイン交換を提供する完全に微分可能なMOEアーキテクチャを提案します。
さらに、専門家の負荷のバランスをとりながら、ルーターのスパースを調節する方法を提案します。
Remoeの連続性は、トークンとレイヤー間の計算の効率的な動的割り当てを可能にし、ドメインの専門化も示します。
私たちの実験は、Remoeが、さまざまなモデルサイズ、専門家数、および粒度のレベルにわたってバニラTopkルーティングのMOEを常に上回ることを示しています。
さらに、Remoeは、従来のMOEアーキテクチャを上回る専門家の数に関して優れたスケーラビリティを示します。
Megatron-LMに基づく実装は、https：//github.com/thu-ml/remoeで入手できます。

要約(オリジナル)

Sparsely activated Mixture-of-Experts (MoE) models are widely adopted to scale up model capacity without increasing the computation budget. However, vanilla TopK routers are trained in a discontinuous, non-differentiable way, limiting their performance and scalability. To address this issue, we propose ReMoE, a fully differentiable MoE architecture that offers a simple yet effective drop-in replacement for the conventional TopK+Softmax routing, utilizing ReLU as the router instead. We further propose methods to regulate the router’s sparsity while balancing the load among experts. ReMoE’s continuous nature enables efficient dynamic allocation of computation across tokens and layers, while also exhibiting domain specialization. Our experiments demonstrate that ReMoE consistently outperforms vanilla TopK-routed MoE across various model sizes, expert counts, and levels of granularity. Furthermore, ReMoE exhibits superior scalability with respect to the number of experts, surpassing traditional MoE architectures. The implementation based on Megatron-LM is available at https://github.com/thu-ml/ReMoE.

arxiv情報

著者	Ziteng Wang,Jun Zhu,Jianfei Chen
発行日	2025-02-27 16:33:09+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.LG | コメントを受け付けていません

Formation of Representations in Neural Networks

投稿日: 2025年2月28日作成者: jarxiv

要約

神経表現を理解することは、ニューラルネットワークのブラックボックスを開き、最新のAIシステムの科学的理解を促進するのに役立ちます。
しかし、現代のニューラルネットワークで複雑で構造化され、移転可能な表現がどのように出現しているかは、謎のままです。
以前の結果に基づいて、神経ネットワークのほとんどの隠れた層の表現形成を普遍的に管理するために、6つのアライメント関係のセットを仮定する標準表現仮説（CRH）を提案します。
CRHの下では、潜在表現（R）、重量（W）、およびニューロン勾配（g）がトレーニング中に相互に整列します。
このアライメントは、ニューラルネットワークが自然にコンパクトな表現を学習することを意味します。コンパクトな表現では、ニューロンと重量がタスクに照らされる変換に不変であることを意味します。
次に、CRHの破壊がR、W、およびGの間の相互の電力関係の出現につながることを示します。これは、多項式アライメント仮説（PAH）と呼ばれます。
勾配ノイズと正則化のバランスが標準表現の出現に不可欠であることを証明する最小限の吸収理論を提示します。
CRHとPAHは、単一のフレームワークで、神経崩壊や神経特徴Ansatzを含む主要な主要な深い学習現象を統一するエキサイティングな可能性につながります。

要約(オリジナル)

Understanding neural representations will help open the black box of neural networks and advance our scientific understanding of modern AI systems. However, how complex, structured, and transferable representations emerge in modern neural networks has remained a mystery. Building on previous results, we propose the Canonical Representation Hypothesis (CRH), which posits a set of six alignment relations to universally govern the formation of representations in most hidden layers of a neural network. Under the CRH, the latent representations (R), weights (W), and neuron gradients (G) become mutually aligned during training. This alignment implies that neural networks naturally learn compact representations, where neurons and weights are invariant to task-irrelevant transformations. We then show that the breaking of CRH leads to the emergence of reciprocal power-law relations between R, W, and G, which we refer to as the Polynomial Alignment Hypothesis (PAH). We present a minimal-assumption theory proving that the balance between gradient noise and regularization is crucial for the emergence of the canonical representation. The CRH and PAH lead to an exciting possibility of unifying major key deep learning phenomena, including neural collapse and the neural feature ansatz, in a single framework.

arxiv情報

著者	Liu Ziyin,Isaac Chuang,Tomer Galanti,Tomaso Poggio
発行日	2025-02-27 16:35:09+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cond-mat.dis-nn, cs.LG | コメントを受け付けていません

Random Latent Exploration for Deep Reinforcement Learning

投稿日: 2025年2月28日作成者: jarxiv

要約

強化学習（RL）におけるシンプルで効果的な探索戦略であるランダム潜在探査（RLE）を紹介します。
平均して、RLEはエージェントのアクションを混乱させるノイズベースの方法と、新しい行動を試みるためにエージェントに報いるボーナスベースの探査を上回ります。
RLEの中心的なアイデアは、潜在的な空間でランダムにサンプリングされた目標を追求することにより、エージェントが環境のさまざまな部分を探索することを奨励することです。
RLEは、複雑なボーナス計算を回避しますが、ボーナスベースの方法の深い探査の利点を保持するため、ノイズベースの方法と同じくらい簡単です。
私たちの実験は、RLEが離散（ATARIなど）と連続制御タスク（ISAACジムなど）の両方で平均してパフォーマンスを向上させ、既存のRLアルゴリズムのシンプルで一般的なプラグインを維持しながら探索を強化することを示しています。
プロジェクトのウェブサイトとコード：https：//srinathm1359.github.io/random-latent-exploration

要約(オリジナル)

We introduce Random Latent Exploration (RLE), a simple yet effective exploration strategy in reinforcement learning (RL). On average, RLE outperforms noise-based methods, which perturb the agent’s actions, and bonus-based exploration, which rewards the agent for attempting novel behaviors. The core idea of RLE is to encourage the agent to explore different parts of the environment by pursuing randomly sampled goals in a latent space. RLE is as simple as noise-based methods, as it avoids complex bonus calculations but retains the deep exploration benefits of bonus-based methods. Our experiments show that RLE improves performance on average in both discrete (e.g., Atari) and continuous control tasks (e.g., Isaac Gym), enhancing exploration while remaining a simple and general plug-in for existing RL algorithms. Project website and code: https://srinathm1359.github.io/random-latent-exploration

arxiv情報

著者	Srinath Mahankali,Zhang-Wei Hong,Ayush Sekhari,Alexander Rakhlin,Pulkit Agrawal
発行日	2025-02-27 16:47:34+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.LG | コメントを受け付けていません

Understanding the Limits of Deep Tabular Methods with Temporal Shift

投稿日: 2025年2月28日作成者: jarxiv

要約

深い層のモデルは、I.I.D。で顕著な成功を示しています。
さまざまな構造化されたデータタスクで優れているデータ。
ただし、時間の経過とともに進化するデータ分布に傾向と定期的なパターンが存在する、時間の分布シフト下ではパフォーマンスが悪化することがよくあります。
この論文では、時間的依存関係をキャプチャする際のこの失敗の根本的な理由を探ります。
まず、トレーニングプロトコルを調査し、モデルの選択の実行方法に関する重要な問題を明らかにします。
既存のアプローチでは、検証セットを分割するために時間的順序付けを使用していますが、ランダム分割でさえモデルのパフォーマンスを大幅に改善できることを示します。
トレーニングデータとテスト時間の間の時間遅れを最小化することにより、検証のバイアスを減らしながら、提案されたトレーニングプロトコルは、さまざまな方法で一般化を大幅に改善します。
さらに、一時的なデータが深い層の表現にどのように影響するかを分析し、これらのモデルがしばしば重要な周期的情報とトレンド情報をキャプチャできないことを明らかにします。
このギャップに対処するために、フーリエシリーズの拡張に基づいてプラグアンドプレイの時間埋め込み方法を導入し、時間的パターンを学習および組み込み、時間的シフトを処理するための適応アプローチを提供します。
私たちの実験は、この時間的埋め込みと改善されたトレーニングプロトコルと組み合わされており、時間的表形式データから学習するためのより効果的で堅牢なフレームワークを提供することを示しています。

要約(オリジナル)

Deep tabular models have demonstrated remarkable success on i.i.d. data, excelling in a variety of structured data tasks. However, their performance often deteriorates under temporal distribution shifts, where trends and periodic patterns are present in the evolving data distribution over time. In this paper, we explore the underlying reasons for this failure in capturing temporal dependencies. We begin by investigating the training protocol, revealing a key issue in how model selection perform. While existing approaches use temporal ordering for splitting validation set, we show that even a random split can significantly improve model performance. By minimizing the time lag between training data and test time, while reducing the bias in validation, our proposed training protocol significantly improves generalization across various methods. Furthermore, we analyze how temporal data affects deep tabular representations, uncovering that these models often fail to capture crucial periodic and trend information. To address this gap, we introduce a plug-and-play temporal embedding method based on Fourier series expansion to learn and incorporate temporal patterns, offering an adaptive approach to handle temporal shifts. Our experiments demonstrate that this temporal embedding, combined with the improved training protocol, provides a more effective and robust framework for learning from temporal tabular data.

arxiv情報

著者	Hao-Run Cai,Han-Jia Ye
発行日	2025-02-27 16:48:53+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: 68T05, cs.LG, I.2.6 | コメントを受け付けていません

On the Importance of Reward Design in Reinforcement Learning-based Dynamic Algorithm Configuration: A Case Study on OneMax with (1+($λ$,$λ$))-GA

投稿日: 2025年2月28日作成者: jarxiv

要約

動的アルゴリズム構成（DAC）は、特に機械学習と深い学習アルゴリズムの有病率において、近年、大きな注目を集めています。
多くの研究が、アルゴリズムの構成に関連する最適化の課題に対処するために、強化学習（RL）における意思決定の堅牢性を活用しています。
ただし、RLエージェントを適切に動作させることは、特に報酬設計において、非自明なタスクであり、ドメインの専門知識に基づいてかなりの量の手作りの知識を必要とします。
この作業では、$（1+（\ lambda、\ lambda））$ -gaの$（1+（\ lambda、\ lambda））の人口サイズの制御に関する事例研究を介して、DACのコンテキストでの報酬設計の重要性を研究します。
設計が不十分な報酬は、探査の欠如のために最適なポリシーを学習するRLエージェントの能力を妨げる可能性があることを観察しました。
これらの課題に対処するために、RLエージェントによる環境の探求の強化を促進するために、報酬形成メカニズムの適用を提案します。
私たちの仕事は、$（1+（\ lambda、\ lambda））$-gaを動的に構成する際のRLの能力を実証するだけでなく、さまざまなサイズのOnemax問題にわたるRLエージェントのスケーラビリティにおける報酬形状の利点も確認します。

要約(オリジナル)

Dynamic Algorithm Configuration (DAC) has garnered significant attention in recent years, particularly in the prevalence of machine learning and deep learning algorithms. Numerous studies have leveraged the robustness of decision-making in Reinforcement Learning (RL) to address the optimization challenges associated with algorithm configuration. However, making an RL agent work properly is a non-trivial task, especially in reward design, which necessitates a substantial amount of handcrafted knowledge based on domain expertise. In this work, we study the importance of reward design in the context of DAC via a case study on controlling the population size of the $(1+(\lambda,\lambda))$-GA optimizing OneMax. We observed that a poorly designed reward can hinder the RL agent’s ability to learn an optimal policy because of a lack of exploration, leading to both scalability and learning divergence issues. To address those challenges, we propose the application of a reward shaping mechanism to facilitate enhanced exploration of the environment by the RL agent. Our work not only demonstrates the ability of RL in dynamically configuring the $(1+(\lambda,\lambda))$-GA, but also confirms the advantages of reward shaping in the scalability of RL agents across various sizes of OneMax problems.

arxiv情報

著者	Tai Nguyen,Phong Le,André Biendenkapp,Carola Doerr,Nguyen Dang
発行日	2025-02-27 16:53:28+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.LG, cs.NE | コメントを受け付けていません

A Counterfactual Analysis of the Dishonest Casino

投稿日: 2025年2月28日作成者: jarxiv

要約

不正なカジノは、HMMSとグラフィカルモデルを導入するために教育環境で使用される有名な隠しマルコフモデル（HMM）です。
ここでは、ダイロールのシーケンスが観察され、カジノはフェアとロードされたダイを切り替えます。
通常、目標は、観測されたロールを使用して、公正およびロードされたサイコロのパターンを推測し、フィルタリング、スムージング、およびViterBiアルゴリズムにつながることです。
ただし、このホワイトペーパーでは、賞金のどれだけがカジノの不正行為に起因するものであることを調査します。
これに対処するために、HMMと一致する構造的因果モデル（SCM）を導入し、不正行為（EWAC）（部分的に識別可能な）に起因する予想される賞金を線形プログラム（LPS）を使用して境界することができます。
数値実験を通じて、これらの境界を計算し、独立性、コモントニック、およびモノトニックのコピュラに基づいてベンチマークSCMを使用して直感を開発します。
SCMの時程度の状態でより緊密な境界が得られることを示しますが、緩い境界により、ほぼ明示的なLPソリューションが可能になります。
経路ごとの単調性や反事実的安定性などのドメイン固有の知識は、線形制約を介して組み込むことができます。
また、期間数が無限になるため、EWACが制限において完全に識別できる時間平均を示します。
私たちの仕事は、因果的推論における境界の反事実に貢献し、ダイナミックなHMM設定でLPの境界を開発した最初のものであり、反事実的な推論が教えられる教育的文脈に利益をもたらします。

要約(オリジナル)

The dishonest casino is a well-known hidden Markov model (HMM) used in educational settings to introduce HMMs and graphical models. Here, a sequence of die rolls is observed, with the casino switching between a fair and a loaded die. Typically, the goal is to use the observed rolls to infer the pattern of fair and loaded dice, leading to filtering, smoothing, and Viterbi algorithms. This paper, however, explores how much of the winnings is attributable to the casino’s cheating, a counterfactual question beyond the scope of HMM primitives. To address this, we introduce a structural causal model (SCM) consistent with the HMM and show the expected winnings attributable to cheating (EWAC) (which is only partially identifiable) can be bounded using linear programs (LPs). Through numerical experiments, we compute these bounds and develop intuition using benchmark SCMs based on independence, comonotonic, and counter-monotonic copulas. We show that tighter bounds are obtained with a time-homogeneity condition on the SCM, while looser bounds allow for an almost explicit LP solution. Domain-specific knowledge such as pathwise monotonicity or counterfactual stability can be incorporated via linear constraints. We also show the time-average EWAC is fully identifiable in the limit as the number of time periods goes to infinity. Our work contributes to bounding counterfactuals in causal inference and is the first to develop LP bounds in a dynamic HMM setting, benefiting educational contexts where counterfactual inference is taught.

arxiv情報

著者	Martin Haugh,Raghav Singal
発行日	2025-02-27 17:01:51+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.LG | コメントを受け付けていません

Online Meta-learning for AutoML in Real-time (OnMAR)

投稿日: 2025年2月28日作成者: jarxiv

要約

Automated Machine Learning（Automl）は、最適化技術を使用して機械学習（ML）アルゴリズムを設計することに焦点を当てた研究分野です。これは、人間が手動アルゴリズム設計を実行する必要性を軽減します。
リアルタイムAutomlを使用すると、MLアルゴリズムがタスクに適用されている間に設計プロセスが発生します。
リアルタイムの自動車は、既存のリアルタイムの自動車技術がデザインの品質とデザインの作成にかかる時間に関して改善する必要があるため、新興の研究分野です。
これらの問題に対処するために、この調査では、自動車のオンラインメタ学習をリアルタイム（ONMAR）アプローチで提案しています。
Meta-Learningは、MLアルゴリズムによって行われた最適化プロセスに関する情報をメタ機能の形式で収集します。
メタ機能は、最適化プロセスを最適化するためにメタ学習者と組み合わせて使用されます。
ONMARアプローチでは、メタ学習者を使用して、ML設計の精度を予測します。
Meta-Learnerによって予測される精度が十分である場合、設計が使用され、予測される精度が低い場合、最適化手法は新しい設計を作成します。
遺伝的アルゴリズム（GA）は、ONMARアプローチの一部として使用される最適化手法です。
さまざまなメタ学習者（K-Nearest Neighbors、Random Forest、Xgboost）がテストされています。
ONMARアプローチはモデルに依存している（つまり、単一のリアルタイムAutomlアプリケーションに固有のものではない）ため、3つの異なるリアルタイムAutomlアプリケーションで評価されます。つまり、画像クラスタリングアルゴリズムの作成、畳み込みニューラルネットワークのハイパーパラメータの構成、ビデオ分類パイプラインの構成。
ONMARアプローチは効果的であり、既存のリアルタイムAutomlアプローチをマッチングまたはアウトパフォームしており、ランタイムが高速であるという利点が追加されています。

要約(オリジナル)

Automated machine learning (AutoML) is a research area focusing on using optimisation techniques to design machine learning (ML) algorithms, alleviating the need for a human to perform manual algorithm design. Real-time AutoML enables the design process to happen while the ML algorithm is being applied to a task. Real-time AutoML is an emerging research area, as such existing real-time AutoML techniques need improvement with respect to the quality of designs and time taken to create designs. To address these issues, this study proposes an Online Meta-learning for AutoML in Real-time (OnMAR) approach. Meta-learning gathers information about the optimisation process undertaken by the ML algorithm in the form of meta-features. Meta-features are used in conjunction with a meta-learner to optimise the optimisation process. The OnMAR approach uses a meta-learner to predict the accuracy of an ML design. If the accuracy predicted by the meta-learner is sufficient, the design is used, and if the predicted accuracy is low, an optimisation technique creates a new design. A genetic algorithm (GA) is the optimisation technique used as part of the OnMAR approach. Different meta-learners (k-nearest neighbours, random forest and XGBoost) are tested. The OnMAR approach is model-agnostic (i.e. not specific to a single real-time AutoML application) and therefore evaluated on three different real-time AutoML applications, namely: composing an image clustering algorithm, configuring the hyper-parameters of a convolutional neural network, and configuring a video classification pipeline. The OnMAR approach is effective, matching or outperforming existing real-time AutoML approaches, with the added benefit of a faster runtime.

arxiv情報

著者	Mia Gerber,Anna Sergeevna Bosman,Johan Pieter de Villiers
発行日	2025-02-27 17:07:32+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.LG | コメントを受け付けていません

Mitigating the Backdoor Effect for Multi-Task Model Merging via Safety-Aware Subspace

投稿日: 2025年2月28日作成者: jarxiv

要約

モデルのマージは、複数のシングルタスクの微調整されたモデルを統合されたモデルに統合するための費用対効果の高いアプローチとして大きな注目を集めています。
ただし、既存のモデルのマージテクニックは、主にタスク固有のモデル間の競合の解決に焦点を当てており、潜在的なセキュリティの脅威、特にオープンフォースモデルのエコシステムにおけるバックドア攻撃のリスクを見落としていることがよくあります。
この論文では、最初に、既存のモデルのマージメソッドの脆弱性をバックドア攻撃に調査し、バックドアの継承とバックドア転送という2つの重要な課題を特定します。
これらの問題に対処するために、タスクの干渉とバックドアの脆弱性を同時に軽減する、新しい防衛を意識したマージ（DAM）アプローチを提案します。
具体的には、DAMはデュアルマスクを使用したメタラーニングベースの最適化方法を採用して、モデルマージの共有および安全性の高いサブスペースを識別します。
これらのマスクは交互に最適化されています。タスク共有マスクは、タスク全体で一般的な有益なパラメーターを識別し、干渉を減らしながらタスク固有の知識を維持することを目的としています。
このデュアルマスク設計により、有用な知識の保存と潜在的な脆弱性の除去を慎重にバランスさせることができます。
既存のマージ方法と比較して、DAMはパフォーマンスとセキュリティの間でより好ましいバランスをとっており、攻撃の成功率を2〜10パーセントポイント削減しながら、精度は約1％しか犠牲にしません。
さらに、ダムは、さまざまな種類のバックドア攻撃と、マージプロセスに関与する妥協したモデルの数にわたって、堅牢なパフォーマンスと幅広い適用性を示します。
私たちのコードとモデルは、https：//github.com/yangjinluan/damで入手できます。

要約(オリジナル)

Model merging has gained significant attention as a cost-effective approach to integrate multiple single-task fine-tuned models into a unified one that can perform well on multiple tasks. However, existing model merging techniques primarily focus on resolving conflicts between task-specific models, they often overlook potential security threats, particularly the risk of backdoor attacks in the open-source model ecosystem. In this paper, we first investigate the vulnerabilities of existing model merging methods to backdoor attacks, identifying two critical challenges: backdoor succession and backdoor transfer. To address these issues, we propose a novel Defense-Aware Merging (DAM) approach that simultaneously mitigates task interference and backdoor vulnerabilities. Specifically, DAM employs a meta-learning-based optimization method with dual masks to identify a shared and safety-aware subspace for model merging. These masks are alternately optimized: the Task-Shared mask identifies common beneficial parameters across tasks, aiming to preserve task-specific knowledge while reducing interference, while the Backdoor-Detection mask isolates potentially harmful parameters to neutralize security threats. This dual-mask design allows us to carefully balance the preservation of useful knowledge and the removal of potential vulnerabilities. Compared to existing merging methods, DAM achieves a more favorable balance between performance and security, reducing the attack success rate by 2-10 percentage points while sacrificing only about 1% in accuracy. Furthermore, DAM exhibits robust performance and broad applicability across various types of backdoor attacks and the number of compromised models involved in the merging process. Our codes and models are available at https://github.com/Yangjinluan/DAM.

arxiv情報

著者	Jinluan Yang,Anke Tang,Didi Zhu,Zhengyu Chen,Li Shen,Fei Wu
発行日	2025-02-27 17:08:19+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.CR, cs.LG | コメントを受け付けていません

Continual learning with the neural tangent ensemble

投稿日: 2025年2月28日作成者: jarxiv

要約

継続的な学習のための自然戦略は、固定機能のベイジアンアンサンブルを比較検討することです。
これは、（単一の）ニューラルネットワークをアンサンブルとして解釈できる場合、忘れずに学習する効果的なアルゴリズムを設計できることを示唆しています。
この可能性を実現するために、Nパラメーターを持つニューラルネットワーク分類器は、N分類器の重み付きアンサンブルとして解釈できること、および怠zyな体制が学習中にこれらの分類器が固定されていることを観察します。
これらの分類器を神経接線の専門家と呼び、ラベルに有効な確率分布を出力します。
次に、過去のデータが与えられた各専門家の可能性と事後確率を導き出します。
驚くべきことに、これらの専門家の事後更新は、ネットワークの重みよりも確率的勾配降下（SGD）のスケーリングされた投影形態と同等です。
怠zyな体制から離れて、ネットワークは、時間の経過とともに改善する適応的な専門家のアンサンブルと見なすことができます。
これらの結果は、専門家のベイジアンアンサンブルとしてのニューラルネットワークの新しい解釈を提供し、継続的な学習設定で壊滅的な忘却を理解し、緩和するための原則的な枠組みを提供します。

要約(オリジナル)

A natural strategy for continual learning is to weigh a Bayesian ensemble of fixed functions. This suggests that if a (single) neural network could be interpreted as an ensemble, one could design effective algorithms that learn without forgetting. To realize this possibility, we observe that a neural network classifier with N parameters can be interpreted as a weighted ensemble of N classifiers, and that in the lazy regime limit these classifiers are fixed throughout learning. We call these classifiers the neural tangent experts and show they output valid probability distributions over the labels. We then derive the likelihood and posterior probability of each expert given past data. Surprisingly, the posterior updates for these experts are equivalent to a scaled and projected form of stochastic gradient descent (SGD) over the network weights. Away from the lazy regime, networks can be seen as ensembles of adaptive experts which improve over time. These results offer a new interpretation of neural networks as Bayesian ensembles of experts, providing a principled framework for understanding and mitigating catastrophic forgetting in continual learning settings.

arxiv情報

著者	Ari S. Benjamin,Christian Pehle,Kyle Daruwalla
発行日	2025-02-27 17:09:51+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.LG, cs.NE | コメントを受け付けていません

Conformal Tail Risk Control for Large Language Model Alignment

投稿日: 2025年2月28日作成者: jarxiv

要約

大規模な言語モデル（LLMS）の最近の開発により、さまざまなタスクに対する広範な使用が行われています。
社会におけるLLMの有病率は、彼らのパフォーマンスの信頼性に対する保証を懇願します。
特に、リスクに敏感なアプリケーションは、予想外に悪い結果、つまり尾のイベント、たとえば有毒な答え、屈辱的な言語、攻撃的な出力に細心の注意を払う必要があります。
人間の注釈を取得することの費用がかかる性質により、これらのテールイベントを定量化するプロセスを自動化するために、汎用スコアリングモデルが作成されています。
この現象は、それぞれのスコアリングメカニズム間に潜在的なヒューマシンの不整合を導入します。
この作業では、Blackboxモデルの軽量キャリブレーションフレームワークを提示し、人間と機械の整合性を証明できる保証を保証します。
私たちのフレームワークは、LLMが生じた損失の加重平均を高い信頼性で特徴付ける歪みリスク測定を制御するための厳密なアプローチを提供します。
私たちの方法の理論的基盤は、コンフォーマルリスク制御と従来の統計ファミリー、つまりL-statisticsとの関係に依存しています。
フレームワークの有用性を実証するために、ヒューマンマシンの不整合の問題に対処する包括的な実験を実施します。

要約(オリジナル)

Recent developments in large language models (LLMs) have led to their widespread usage for various tasks. The prevalence of LLMs in society implores the assurance on the reliability of their performance. In particular, risk-sensitive applications demand meticulous attention to unexpectedly poor outcomes, i.e., tail events, for instance, toxic answers, humiliating language, and offensive outputs. Due to the costly nature of acquiring human annotations, general-purpose scoring models have been created to automate the process of quantifying these tail events. This phenomenon introduces potential human-machine misalignment between the respective scoring mechanisms. In this work, we present a lightweight calibration framework for blackbox models that ensures the alignment of humans and machines with provable guarantees. Our framework provides a rigorous approach to controlling any distortion risk measure that is characterized by a weighted average of quantiles of the loss incurred by the LLM with high confidence. The theoretical foundation of our method relies on the connection between conformal risk control and a traditional family of statistics, i.e., L-statistics. To demonstrate the utility of our framework, we conduct comprehensive experiments that address the issue of human-machine misalignment.

arxiv情報

著者	Catherine Yu-Chi Chen,Jingyan Shen,Zhun Deng,Lihua Lei
発行日	2025-02-27 17:10:54+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, Google

カテゴリー: cs.LG, stat.ML | コメントを受け付けていません

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント