「cs.LG」カテゴリーアーカイブ

Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers

投稿日: 2025年6月13日作成者: jarxiv

要約大規模な言語モデル（LLM）は微調整を通じて新しい知識を獲得できますが、こ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization

投稿日: 2025年6月13日作成者: jarxiv

要約機械的解釈可能性の中心的な目標は、その出力を因果的に説明する大規模な言語モ … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Improving LLM Safety Alignment with Dual-Objective Optimization

投稿日: 2025年6月13日作成者: jarxiv

要約大規模な言語モデル（LLM）の既存のトレーニング時間安全アライメント手法は … 続きを読む →

カテゴリー: cs.CL, cs.CR, cs.LG | コメントを受け付けていません

Build the web for agents, not agents for the web

投稿日: 2025年6月13日作成者: jarxiv

要約大規模な言語モデル（LLMS）とマルチモーダルのカウンターパートの最近の進 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

On the Geometry of Receiver Operating Characteristic and Precision-Recall Curves

投稿日: 2025年6月13日作成者: jarxiv

要約バイナリ分類問題における受信機動作特性（ROC）および精密リコール（PR） … 続きを読む →

カテゴリー: cs.AI, cs.LG, math.ST, stat.ML, stat.TH | コメントを受け付けていません

Efficiency Robustness of Dynamic Deep Learning Systems

投稿日: 2025年6月13日作成者: jarxiv

要約ディープラーニングシステム（DLSS）は、モバイルデバイスやIoTデバイス … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Accelerating Diffusion Large Language Models with SlowFast: The Three Golden Principles

投稿日: 2025年6月13日作成者: jarxiv

要約拡散ベースの言語モデル（DLLM）は、並列トークンの生成を有効にし、推論潜 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Sample Complexity and Representation Ability of Test-time Scaling Paradigms

投稿日: 2025年6月13日作成者: jarxiv

要約テスト時間スケーリングパラダイムは、複雑なタスク上の大規模な言語モデル（L … 続きを読む →

カテゴリー: cs.AI, cs.LG, stat.ML | コメントを受け付けていません

Multi-group Uncertainty Quantification for Long-form Text Generation

投稿日: 2025年6月13日作成者: jarxiv

要約過去の作品は、不確実性の定量化を大規模な言語モデル（LLM）出力にどのよう … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Data-Driven Prediction of Dynamic Interactions Between Robot Appendage and Granular Material

投稿日: 2025年6月13日作成者: jarxiv

要約特定の長さのスケールでの粒状地形とのロボット運動相互作用に関する基本的な洞 … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.NA, cs.RO, math.NA | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers

Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization

Improving LLM Safety Alignment with Dual-Objective Optimization

Build the web for agents, not agents for the web

On the Geometry of Receiver Operating Characteristic and Precision-Recall Curves

Efficiency Robustness of Dynamic Deep Learning Systems

Accelerating Diffusion Large Language Models with SlowFast: The Three Golden Principles

Sample Complexity and Representation Ability of Test-time Scaling Paradigms

Multi-group Uncertainty Quantification for Long-form Text Generation

Data-Driven Prediction of Dynamic Interactions Between Robot Appendage and Granular Material

最近の投稿

最近のコメント

アーカイブ

カテゴリー