「cs.LG」カテゴリーアーカイブ

EuroLLM-9B: Technical Report

投稿日: 2025年6月5日作成者: jarxiv

要約このレポートは、24の公式欧州連合言語すべてと11の追加言語をカバーするこ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment

投稿日: 2025年6月5日作成者: jarxiv

要約具体化されたエージェントの一部として、ユーザーからの自然言語の指示を考慮し … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG, cs.RO | コメントを受け付けていません

Optimizing Sensory Neurons: Nonlinear Attention Mechanisms for Accelerated Convergence in Permutation-Invariant Neural Networks for Reinforcement Learning

投稿日: 2025年6月5日作成者: jarxiv

要約トレーニング強化学習（RL）エージェントには、多くの場合、重要な計算リソー … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

TextAtari: 100K Frames Game Playing with Language Agents

投稿日: 2025年6月5日作成者: jarxiv

要約 TextAtariは、最大100,000のステップにまたがる非常に長期の意 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Engagement-Driven Content Generation with Large Language Models

投稿日: 2025年6月5日作成者: jarxiv

要約大規模な言語モデル（LLMS）は、1対1の相互作用において重要な説得力のあ … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Balancing Profit and Fairness in Risk-Based Pricing Markets

投稿日: 2025年6月5日作成者: jarxiv

要約動的でリスクベースの価格設定は、健康保険や消費者クレジットなどの重要なリソ … 続きを読む →

カテゴリー: cs.AI, cs.LG, econ.GN, q-fin.EC | コメントを受け付けていません

CLAIM: An Intent-Driven Multi-Agent Framework for Analyzing Manipulation in Courtroom Dialogues

投稿日: 2025年6月5日作成者: jarxiv

要約法廷は、命が決定され、運命が封印される場所であるが、操作は不浸透ではない。 … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Probability Theory

投稿日: 2025年6月5日作成者: jarxiv

要約最近、大規模な言語モデル（LLM）でのスケーリングテスト時間コンピューティ … 続きを読む →

カテゴリー: cs.AI, cs.CL, cs.LG | コメントを受け付けていません

SLAC: Simulation-Pretrained Latent Action Space for Whole-Body Real-World RL

投稿日: 2025年6月5日作成者: jarxiv

要約有能な家庭用および産業ロボットを建設するには、モバイルマニピュレーターなど … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.RO | コメントを受け付けていません

Horizon Reduction Makes RL Scalable

投稿日: 2025年6月5日作成者: jarxiv

要約この作業では、オフライン強化学習（RL）アルゴリズムのスケーラビリティを研 … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

EuroLLM-9B: Technical Report

AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment

Optimizing Sensory Neurons: Nonlinear Attention Mechanisms for Accelerated Convergence in Permutation-Invariant Neural Networks for Reinforcement Learning

TextAtari: 100K Frames Game Playing with Language Agents

Engagement-Driven Content Generation with Large Language Models

Balancing Profit and Fairness in Risk-Based Pricing Markets

CLAIM: An Intent-Driven Multi-Agent Framework for Analyzing Manipulation in Courtroom Dialogues

Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Probability Theory

SLAC: Simulation-Pretrained Latent Action Space for Whole-Body Real-World RL

Horizon Reduction Makes RL Scalable

最近の投稿

最近のコメント

アーカイブ

カテゴリー