「cs.LG」カテゴリーアーカイブ

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

投稿日: 2025年3月14日作成者: jarxiv

要約このペーパーでは、モデル、データ、コードがすべてリリースされたLight- … 続きを読む →

カテゴリー: cs.CL, cs.LG | コメントを受け付けていません

dFLMoE: Decentralized Federated Learning via Mixture of Experts for Medical Data Analysis

投稿日: 2025年3月14日作成者: jarxiv

要約 Federated Learningは、医療分野で幅広い用途を持っています … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Similarity Equivariant Graph Neural Networks for Homogenization of Metamaterials

投稿日: 2025年3月14日作成者: jarxiv

要約柔らかく多孔質の機械的メタマテリアルは、柔らかいロボット工学、音の還元、生 … 続きを読む →

カテゴリー: cond-mat.soft, cs.AI, cs.LG | コメントを受け付けていません

Fast MRI for All: Bridging Equity Gaps via Training without Raw Data Access

投稿日: 2025年3月14日作成者: jarxiv

要約物理主導のディープラーニング（PD-DL）アプローチは、高速磁気共鳴画像（ … 続きを読む →

カテゴリー: cs.AI, cs.CV, cs.LG, eess.IV | コメントを受け付けていません

Explainable Bayesian deep learning through input-skip Latent Binary Bayesian Neural Networks

投稿日: 2025年3月14日作成者: jarxiv

要約人工ニューラルネットワーク（ANN）を使用した自然現象のモデリングは、多く … 続きを読む →

カテゴリー: 05A16, 60J22, 62-02, 62-09, 62F07, 62F15, 62J05, 62J12, 62J99, 62M05, 90C27, 90C59, 92D20, cs.AI, cs.LG, G.1.6, stat.CO, stat.ME, stat.ML | コメントを受け付けていません

Confidence-Controlled Exploration: Efficient Sparse-Reward Policy Learning for Robot Navigation

投稿日: 2025年3月14日作成者: jarxiv

要約 Rehnection Learning（RL）は、ロボットナビゲーションの … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation

投稿日: 2025年3月14日作成者: jarxiv

要約 LLMの自己評価は、展開の信頼性を大幅に改善する可能性のある応答の正確性を … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Conformal Prediction Sets for Deep Generative Models via Reduction to Conformal Regression

投稿日: 2025年3月14日作成者: jarxiv

要約特定の入力のブラックボックスディープ生成モデル（テキストプロンプトなど）か … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity

投稿日: 2025年3月14日作成者: jarxiv

要約 LinformerやMambaなどのアーキテクチャは、最近、変圧器の競合的 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.SD, eess.AS | コメントを受け付けていません

What is the Alignment Objective of GRPO?

投稿日: 2025年3月14日作成者: jarxiv

要約このメモでは、Group Policy Optimization（GRPO … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

dFLMoE: Decentralized Federated Learning via Mixture of Experts for Medical Data Analysis

Similarity Equivariant Graph Neural Networks for Homogenization of Metamaterials

Fast MRI for All: Bridging Equity Gaps via Training without Raw Data Access

Explainable Bayesian deep learning through input-skip Latent Binary Bayesian Neural Networks

Confidence-Controlled Exploration: Efficient Sparse-Reward Policy Learning for Robot Navigation

Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation

Conformal Prediction Sets for Deep Generative Models via Reduction to Conformal Regression

Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity

What is the Alignment Objective of GRPO?

最近の投稿

最近のコメント

アーカイブ

カテゴリー