「cs.LG」カテゴリーアーカイブ

How do language models learn facts? Dynamics, curricula and hallucinations

投稿日: 2025年3月28日作成者: jarxiv

要約大規模な言語モデルは、トレーニング前に膨大な知識を蓄積しますが、この獲得を … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Learning to Represent Individual Differences for Choice Decision Making

投稿日: 2025年3月28日作成者: jarxiv

要約意思決定は多くの複雑な要因の影響を受けるため、人間の意思決定は予測するのが … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

Effective Skill Unlearning through Intervention and Abstention

投稿日: 2025年3月28日作成者: jarxiv

要約大規模な言語モデル（LLM）は、さまざまなドメインで顕著なスキルを示してい … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

SWI: Speaking with Intent in Large Language Models

投稿日: 2025年3月28日作成者: jarxiv

要約通常、明確に定式化され計画されている意図は、推論と問題解決のための認知フレ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, I.2.7 | コメントを受け付けていません

Critical Iterative Denoising: A Discrete Generative Model Applied to Graphs

投稿日: 2025年3月28日作成者: jarxiv

要約離散拡散モデルとフローマッチングモデルには、グラフを含む離散構造の生成モデ … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Layer- and Timestep-Adaptive Differentiable Token Compression Ratios for Efficient Diffusion Transformers

投稿日: 2025年3月28日作成者: jarxiv

要約拡散変圧器（DITS）は、最先端の（SOTA）画像生成の品質を達成しました … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

When Astronomy Meets AI: Manazel For Crescent Visibility Prediction in Morocco

投稿日: 2025年3月28日作成者: jarxiv

要約各ヒジュリ月の初めの正確な決定は、宗教的、文化的、および行政目的に不可欠で … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG | コメントを受け付けていません

Towards Fully Automated Decision-Making Systems for Greenhouse Control: Challenges and Opportunities

投稿日: 2025年3月28日作成者: jarxiv

要約機械学習は、複雑なシステムをさまざまなアプリケーション（ゲーム、ロボットな … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Model Assembly Learning with Heterogeneous Layer Weight Merging

投稿日: 2025年3月28日作成者: jarxiv

要約モデルのマージは、複数のモデルのパラメーターを組み合わせることにより、追加 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

TSKANMixer: Kolmogorov-Arnold Networks with MLP-Mixer Model for Time Series Forecasting

投稿日: 2025年3月28日作成者: jarxiv

要約時系列の予測は、経済学、エネルギー、ヘルスケア、交通管理など、多様な分野で … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

How do language models learn facts? Dynamics, curricula and hallucinations

Learning to Represent Individual Differences for Choice Decision Making

Effective Skill Unlearning through Intervention and Abstention

SWI: Speaking with Intent in Large Language Models

Critical Iterative Denoising: A Discrete Generative Model Applied to Graphs

Layer- and Timestep-Adaptive Differentiable Token Compression Ratios for Efficient Diffusion Transformers

When Astronomy Meets AI: Manazel For Crescent Visibility Prediction in Morocco

Towards Fully Automated Decision-Making Systems for Greenhouse Control: Challenges and Opportunities

Model Assembly Learning with Heterogeneous Layer Weight Merging

TSKANMixer: Kolmogorov-Arnold Networks with MLP-Mixer Model for Time Series Forecasting

最近の投稿

最近のコメント

アーカイブ

カテゴリー