月別アーカイブ: 2024年5月

Chain of Thought Empowers Transformers to Solve Inherently Serial Problems

投稿日: 2024年5月24日作成者: jarxiv

要約一連の中間ステップ、別名思考連鎖 (CoT) を生成するようにモデルに指示 … 続きを読む →

カテゴリー: cs.CC, cs.LG, stat.ML | コメントを受け付けていません

Lorentz-Equivariant Geometric Algebra Transformers for High-Energy Physics

投稿日: 2024年5月24日作成者: jarxiv

要約素粒子物理実験から科学的理解を引き出すには、高精度かつ優れたデータ効率で多 … 続きを読む →

カテゴリー: cs.LG, hep-ph, physics.data-an, stat.ML | コメントを受け付けていません

Scalable Optimization in the Modular Norm

投稿日: 2024年5月24日作成者: jarxiv

要約現代の深層学習のパフォーマンスを向上させるために、層の数とサイズの両方の点 … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Deep learning lattice gauge theories

投稿日: 2024年5月24日作成者: jarxiv

要約モンテカルロ法は、格子ゲージ理論の強結合挙動に対する深い洞察をもたらし、ハ … 続きを読む →

カテゴリー: cond-mat.dis-nn, cond-mat.str-el, cs.LG, hep-lat, hep-th | コメントを受け付けていません

Analysis of Atom-level pretraining with QM data for Graph Neural Networks Molecular property models

投稿日: 2024年5月24日作成者: jarxiv

要約定量的構造活性相関 (QSAR) モデルの深層学習は急速かつ大幅に進歩して … 続きを読む →

カテゴリー: cs.LG, physics.chem-ph, quant-ph | コメントを受け付けていません

Differentiable Annealed Importance Sampling Minimizes The Jensen-Shannon Divergence Between Initial and Target Distribution

投稿日: 2024年5月24日作成者: jarxiv

要約 Geffner & Domke (2021) および Zhang … 続きを読む →

カテゴリー: cs.LG, stat.ML | コメントを受け付けていません

Local Causal Discovery for Structural Evidence of Direct Discrimination

投稿日: 2024年5月24日作成者: jarxiv

要約公平性は、ポリシー設計とアルゴリズムによる意思決定における重要な目標です。 … 続きを読む →

カテゴリー: cs.LG, stat.ML | コメントを受け付けていません

PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression

投稿日: 2024年5月24日作成者: jarxiv

要約大規模言語モデル (LLM) の「極端な」圧縮、つまりパラメーターあたり … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Not All Language Model Features Are Linear

投稿日: 2024年5月24日作成者: jarxiv

要約最近の研究では、言語モデルが活性化空間内の概念 (「特徴」) の 1 次元 … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Synthetic Data Generation for Intersectional Fairness by Leveraging Hierarchical Group Structure

投稿日: 2024年5月24日作成者: jarxiv

要約このペーパーでは、分類タスクにおける交差の公平性を強化するために特別に調整 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

月別アーカイブ: 2024年5月

Chain of Thought Empowers Transformers to Solve Inherently Serial Problems

Lorentz-Equivariant Geometric Algebra Transformers for High-Energy Physics

Scalable Optimization in the Modular Norm

Deep learning lattice gauge theories

Analysis of Atom-level pretraining with QM data for Graph Neural Networks Molecular property models

Differentiable Annealed Importance Sampling Minimizes The Jensen-Shannon Divergence Between Initial and Target Distribution

Local Causal Discovery for Structural Evidence of Direct Discrimination

PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression

Not All Language Model Features Are Linear

Synthetic Data Generation for Intersectional Fairness by Leveraging Hierarchical Group Structure

最近の投稿

最近のコメント

アーカイブ

カテゴリー