「cs.LG」カテゴリーアーカイブ

Synthesizing Diverse Network Flow Datasets with Scalable Dynamic Multigraph Generation

投稿日: 2025年5月13日作成者: jarxiv

要約プライバシー、セキュリティ、および計算上の制約のため、実際のネットワークデ … 続きを読む →

カテゴリー: cs.LG, cs.NI | コメントを受け付けていません

MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering

投稿日: 2025年5月13日作成者: jarxiv

要約 Iterative Machine Learning Engineerin … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Relative Overfitting and Accept-Reject Framework

投稿日: 2025年5月13日作成者: jarxiv

要約現在、大規模な言語モデル（LLMS）のスケーリング法則は、課題とボトルネッ … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Analytic theory of dropout regularization

投稿日: 2025年5月13日作成者: jarxiv

要約ドロップアウトは、過剰適合を緩和するために人工ニューラルネットワークのトレ … 続きを読む →

カテゴリー: cond-mat.dis-nn, cond-mat.stat-mech, cs.LG, stat.ML | コメントを受け付けていません

A Theoretical Framework for Explaining Reinforcement Learning with Shapley Values

投稿日: 2025年5月13日作成者: jarxiv

要約強化学習エージェントは超人的なパフォーマンスを達成できますが、彼らの決定は … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Automatically Differentiable Model Updating (ADiMU): conventional, hybrid, and neural network material model discovery including history-dependency

投稿日: 2025年5月13日作成者: jarxiv

要約フルフィールドの変位とグローバルな力データ（グローバル、間接発見）またはひ … 続きを読む →

カテゴリー: cs.LG, cs.NA, math.NA, physics.comp-ph | コメントを受け付けていません

Understanding Stragglers in Large Model Training Using What-if Analysis

投稿日: 2025年5月13日作成者: jarxiv

要約大規模な言語モデル（LLM）トレーニングは、今日最も要求の厳しい分散計算の … 続きを読む →

カテゴリー: cs.DC, cs.LG | コメントを受け付けていません

Semantic Retention and Extreme Compression in LLMs: Can We Have Both?

投稿日: 2025年5月13日作成者: jarxiv

要約大規模な言語モデル（LLM）の展開における指数関数的な成長により、計算コス … 続きを読む →

カテゴリー: (Primary), 68T50, cs.AI, cs.CL, cs.LG, I.2.6 | コメントを受け付けていません

HREB-CRF: Hierarchical Reduced-bias EMA for Chinese Named Entity Recognition

投稿日: 2025年5月13日作成者: jarxiv

要約誤った境界区分、複雑な意味表現、および発音と意味の違いは、しばしば中国の名 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

A Statistical Case Against Empirical Human-AI Alignment

投稿日: 2025年5月13日作成者: jarxiv

要約経験的な人間とaiの調整は、観察された人間の行動に沿ってAIシステムを行動 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, stat.OT | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Synthesizing Diverse Network Flow Datasets with Scalable Dynamic Multigraph Generation

MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering

Relative Overfitting and Accept-Reject Framework

Analytic theory of dropout regularization

A Theoretical Framework for Explaining Reinforcement Learning with Shapley Values

Automatically Differentiable Model Updating (ADiMU): conventional, hybrid, and neural network material model discovery including history-dependency

Understanding Stragglers in Large Model Training Using What-if Analysis

Semantic Retention and Extreme Compression in LLMs: Can We Have Both?

HREB-CRF: Hierarchical Reduced-bias EMA for Chinese Named Entity Recognition

A Statistical Case Against Empirical Human-AI Alignment

最近の投稿

最近のコメント

アーカイブ

カテゴリー