「cs.LG」カテゴリーアーカイブ

ZClip: Adaptive Spike Mitigation for LLM Pre-Training

投稿日: 2025年4月4日作成者: jarxiv

要約大規模言語モデル（LLM）の学習には、勾配の不安定性や損失スパイクなど、多 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Token-Driven GammaTune: Adaptive Calibration for Enhanced Speculative Decoding

投稿日: 2025年4月4日作成者: jarxiv

要約投機的復号は、より小さなドラフトモデルを使ってトークンを提案し、それをより … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

How to Train Long-Context Language Models (Effectively)

投稿日: 2025年4月4日作成者: jarxiv

要約我々は、ロングコンテクスト情報を効果的に利用するための言語モデル(LM)の … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Efficient LLM Inference using Dynamic Input Pruning and Cache-Aware Masking

投稿日: 2025年4月4日作成者: jarxiv

要約モバイル・デバイスの計算能力はますます向上しているが、DRAM帯域幅の改善 … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Multi-Modal Framing Analysis of News

投稿日: 2025年4月4日作成者: jarxiv

要約政治的コミュニケーションの自動フレーム分析は、計算社会科学において人気のあ … 続きを読む →

カテゴリー: cs.CL, cs.CY, cs.LG | コメントを受け付けていません

Reasoning Inconsistencies and How to Mitigate Them in Deep Learning

投稿日: 2025年4月4日作成者: jarxiv

要約近年のディープラーニングモデルと技術の進歩により、多様なタスクやモダリティ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.LO | コメントを受け付けていません

Knowledge Graph Completion with Mixed Geometry Tensor Factorization

投稿日: 2025年4月4日作成者: jarxiv

要約本論文では、低ランクテンソル近似による知識グラフ補完のための新しい幾何学的 … 続きを読む →

カテゴリー: cs.AI, cs.IR, cs.LG, stat.ML | コメントを受け付けていません

Iterated $Q$-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning

投稿日: 2025年4月4日作成者: jarxiv

要約強化学習法の大部分は、作用値関数の効果的な推定を得るために必要な計算量とデ … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Improving Counterfactual Truthfulness for Molecular Property Prediction through Uncertainty Quantification

投稿日: 2025年4月4日作成者: jarxiv

要約説明可能なAI（xAI）の介入は、複雑なブラックボックスモデルの解釈可能性 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Learning Geometrically-Informed Lyapunov Functions with Deep Diffeomorphic RBF Networks

投稿日: 2025年4月4日作成者: jarxiv

要約学習ベースの自律システムの実用化には、データから証明関数の形で安全保証を柔 … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.SY, eess.SY | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

ZClip: Adaptive Spike Mitigation for LLM Pre-Training

Token-Driven GammaTune: Adaptive Calibration for Enhanced Speculative Decoding

How to Train Long-Context Language Models (Effectively)

Efficient LLM Inference using Dynamic Input Pruning and Cache-Aware Masking

Multi-Modal Framing Analysis of News

Reasoning Inconsistencies and How to Mitigate Them in Deep Learning

Knowledge Graph Completion with Mixed Geometry Tensor Factorization

Iterated $Q$-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning

Improving Counterfactual Truthfulness for Molecular Property Prediction through Uncertainty Quantification

Learning Geometrically-Informed Lyapunov Functions with Deep Diffeomorphic RBF Networks

最近の投稿

最近のコメント

アーカイブ

カテゴリー