「cs.LG」カテゴリーアーカイブ

Accelerated Training through Iterative Gradient Propagation Along the Residual Path

投稿日: 2025年1月29日作成者: jarxiv

要約深い学習の礎であるにもかかわらず、バックプロパゲーションは、非常に深いモデ … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Solving Roughly Forced Nonlinear PDEs via Misspecified Kernel Methods and Neural Networks

投稿日: 2025年1月29日作成者: jarxiv

要約ガウスプロセス（GPS）またはニューラルネットワーク（NNS）を使用して、 … 続きを読む →

カテゴリー: cs.LG, cs.NA, math.NA | コメントを受け付けていません

Unlocking Transparent Alignment Through Enhanced Inverse Constitutional AI for Principle Extraction

投稿日: 2025年1月29日作成者: jarxiv

要約人間のフィードバック（RLHF）や直接選好最適化（DPO）からの強化学習な … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Evidence on the Regularisation Properties of Maximum-Entropy Reinforcement Learning

投稿日: 2025年1月29日作成者: jarxiv

要約最大エンゴロピー強化学習を通じて学習したポリシーの一般化と堅牢性の特性は、 … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Convergence of two-timescale gradient descent ascent dynamics: finite-dimensional and mean-field perspectives

投稿日: 2025年1月29日作成者: jarxiv

要約ツータイムスケール勾配降下（GDA）は、MIN-MAXゲームでNASH平衡 … 続きを読む →

カテゴリー: cs.LG, cs.NA, math.NA, math.OC | コメントを受け付けていません

CoRe-Net: Co-Operational Regressor Network with Progressive Transfer Learning for Blind Radar Signal Restoration

投稿日: 2025年1月29日作成者: jarxiv

要約実世界のレーダー信号は、センサーノイズ、エコー、干渉、意図的な詰まり、タイ … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Scanning Trojaned Models Using Out-of-Distribution Samples

投稿日: 2025年1月29日作成者: jarxiv

要約深いニューラルネットワークでのトロイの木馬（バックドア）のスキャンは、実世 … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Refusal in LLMs is an Affine Function

投稿日: 2025年1月29日作成者: jarxiv

要約アクティベーションに直接介入することにより、言語モデルの動作を操縦するため … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

SLIM: Let LLM Learn More and Forget Less with Soft LoRA and Identity Mixture

投稿日: 2025年1月29日作成者: jarxiv

要約多くの努力がなされていますが、多くのアプリケーションでのトレーニング予算、 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

投稿日: 2025年1月29日作成者: jarxiv

要約トークン化は大規模な言語モデル（LLM）の基本的なコンポーネントですが、モ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Accelerated Training through Iterative Gradient Propagation Along the Residual Path

Solving Roughly Forced Nonlinear PDEs via Misspecified Kernel Methods and Neural Networks

Unlocking Transparent Alignment Through Enhanced Inverse Constitutional AI for Principle Extraction

Evidence on the Regularisation Properties of Maximum-Entropy Reinforcement Learning

Convergence of two-timescale gradient descent ascent dynamics: finite-dimensional and mean-field perspectives

CoRe-Net: Co-Operational Regressor Network with Progressive Transfer Learning for Blind Radar Signal Restoration

Scanning Trojaned Models Using Out-of-Distribution Samples

Refusal in LLMs is an Affine Function

SLIM: Let LLM Learn More and Forget Less with Soft LoRA and Identity Mixture

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

最近の投稿

最近のコメント

アーカイブ

カテゴリー