投稿者「jarxiv」のアーカイブ

Learning Generalized Hamiltonians using fully Symplectic Mappings

投稿日: 2025年5月26日作成者: jarxiv

要約多くの重要な物理システムは、保守的であるという重要な特性を持っているハミル … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

AFD-STA: Adaptive Filtering Denoising with Spatiotemporal Attention for Chaotic System Prediction

投稿日: 2025年5月26日作成者: jarxiv

要約このホワイトペーパーでは、部分的な微分方程式によって支配された高次元のカオ … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Backpropagation-Free Metropolis-Adjusted Langevin Algorithm

投稿日: 2025年5月26日作成者: jarxiv

要約 Backpropagationのない学習に関する最近の研究により、Forw … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Stable Reinforcement Learning for Efficient Reasoning

投稿日: 2025年5月26日作成者: jarxiv

要約 DeepSeek-R1の成功により、GRPOなどの強化学習（RL）方法に対 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

An alignment safety case sketch based on debate

投稿日: 2025年5月26日作成者: jarxiv

要約 AIシステムが幅広いタスクで人間の能力に一致するか、それを超えると、人間が … 続きを読む →

カテゴリー: cs.AI | コメントを受け付けていません

Data Mixing Can Induce Phase Transitions in Knowledge Acquisition

投稿日: 2025年5月26日作成者: jarxiv

要約大規模な言語モデル（LLM）は通常、データの混合物でトレーニングされていま … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL

投稿日: 2025年5月26日作成者: jarxiv

要約大規模な言語モデル（LLM）は、質問の回答や対話などのタスクで優れています … 続きを読む →

カテゴリー: cs.AI, cs.CL | コメントを受け付けていません

How Can I Publish My LLM Benchmark Without Giving the True Answers Away?

投稿日: 2025年5月26日作成者: jarxiv

要約インターネット上の大規模な言語モデル（LLM）ベンチマークを公開することは … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, stat.ME | コメントを受け付けていません

Activated LoRA: Fine-tuned LLMs for Intrinsics

投稿日: 2025年5月26日作成者: jarxiv

要約低ランク適応（LORA）は、大規模な基礎モデルの重みを微調整するための非常 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Bidirectional Knowledge Distillation for Enhancing Sequential Recommendation with Large Language Models

投稿日: 2025年5月26日作成者: jarxiv

要約大規模な言語モデル（LLMS）は、セマンティックパターンの理解と生成におい … 続きを読む →

カテゴリー: cs.AI, cs.IR | コメントを受け付けていません

投稿者「jarxiv」のアーカイブ

Learning Generalized Hamiltonians using fully Symplectic Mappings

AFD-STA: Adaptive Filtering Denoising with Spatiotemporal Attention for Chaotic System Prediction

Backpropagation-Free Metropolis-Adjusted Langevin Algorithm

Stable Reinforcement Learning for Efficient Reasoning

An alignment safety case sketch based on debate

Data Mixing Can Induce Phase Transitions in Knowledge Acquisition

Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL

How Can I Publish My LLM Benchmark Without Giving the True Answers Away?

Activated LoRA: Fine-tuned LLMs for Intrinsics

Bidirectional Knowledge Distillation for Enhancing Sequential Recommendation with Large Language Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー