「cs.LG」カテゴリーアーカイブ

Control, Transport and Sampling: Towards Better Loss Design

投稿日: 2024年10月11日作成者: jarxiv

要約拡散ベースのサンプリング、最適輸送、およびシュオーディンガー橋問題への共有 … 続きを読む →

カテゴリー: cs.LG, stat.CO, stat.ML | コメントを受け付けていません

Stability-Aware Training of Machine Learning Force Fields with Differentiable Boltzmann Estimators

投稿日: 2024年10月11日作成者: jarxiv

要約機械学習力場 (MLFF) は、分子動力学 (MD) シミュレーションの非 … 続きを読む →

カテゴリー: cond-mat.dis-nn, cond-mat.mtrl-sci, cs.LG, physics.chem-ph, physics.comp-ph | コメントを受け付けていません

Features are fate: a theory of transfer learning in high-dimensional regression

投稿日: 2024年10月11日作成者: jarxiv

要約大規模な事前トレーニング済みニューラルネットワークの出現により、そのよう … 続きを読む →

カテゴリー: cs.LG, stat.ML | コメントを受け付けていません

Adam Exploits $\ell_\infty$-geometry of Loss Landscape via Coordinate-wise Adaptivity

投稿日: 2024年10月11日作成者: jarxiv

要約 Adam は、言語モデルをトレーニングする際に SGD よりも優れたパフォ … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Efficient Dictionary Learning with Switch Sparse Autoencoders

投稿日: 2024年10月11日作成者: jarxiv

要約スパースオートエンコーダ (SAE) は、ニューラルネットワークの活性 … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Private Language Models via Truncated Laplacian Mechanism

投稿日: 2024年10月11日作成者: jarxiv

要約 NLP タスクの深層学習モデルは、さまざまな種類のプライバシー攻撃を受けや … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

VerifierQ: Enhancing LLM Test Time Compute with Q-Learning-based Verifiers

投稿日: 2024年10月11日作成者: jarxiv

要約特に検証モデルの使用によるテスト時間の計算における最近の進歩により、大規模 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

How Powerful are Decoder-Only Transformer Neural Models?

投稿日: 2024年10月11日作成者: jarxiv

要約この記事では、現代の大規模言語モデル (LLM) を支える一般的なトランス … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Think Beyond Size: Dynamic Prompting for More Effective Reasoning

投稿日: 2024年10月11日作成者: jarxiv

要約この文書では、大規模言語モデル (LLM) の推論機能の向上を目的とした新 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning

投稿日: 2024年10月11日作成者: jarxiv

要約大規模な言語モデルで推論を改善するための有望なアプローチは、プロセス報酬モ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Control, Transport and Sampling: Towards Better Loss Design

Stability-Aware Training of Machine Learning Force Fields with Differentiable Boltzmann Estimators

Features are fate: a theory of transfer learning in high-dimensional regression

Adam Exploits $\ell_\infty$-geometry of Loss Landscape via Coordinate-wise Adaptivity

Efficient Dictionary Learning with Switch Sparse Autoencoders

Private Language Models via Truncated Laplacian Mechanism

VerifierQ: Enhancing LLM Test Time Compute with Q-Learning-based Verifiers

How Powerful are Decoder-Only Transformer Neural Models?

Think Beyond Size: Dynamic Prompting for More Effective Reasoning

Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning

最近の投稿

最近のコメント

アーカイブ

カテゴリー