月別アーカイブ: 2025年4月

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

投稿日: 2025年4月7日作成者: jarxiv

要約強化学習(RL)は、近年、大規模言語モデルの推論能力を向上させる強い可能性 … 続きを読む →

カテゴリー: cs.CL, cs.CV, cs.LG | コメントを受け付けていません

Computing High-dimensional Confidence Sets for Arbitrary Distributions

投稿日: 2025年4月4日作成者: jarxiv

要約 mathbb{R}^d$上の任意の分布の高密度領域を学習する問題を研究する … 続きを読む →

カテゴリー: cs.DS, cs.LG, math.ST, stat.ML, stat.TH | コメントを受け付けていません

Reservoir Computing: A New Paradigm for Neural Networks

投稿日: 2025年4月4日作成者: jarxiv

要約リザーバー・コンピューティングの文献レビュー。人工知能が計算科学の一分野 … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

A Dynamic, Ordinal Gaussian Process Item Response Theoretic Model

投稿日: 2025年4月4日作成者: jarxiv

要約社会科学者はしばしば、時間と共に変化する潜在特性を推定するために順序指標を … 続きを読む →

カテゴリー: cs.LG, stat.ME | コメントを受け付けていません

Solving the Paint Shop Problem with Flexible Management of Multi-Lane Buffers Using Reinforcement Learning and Action Masking

投稿日: 2025年4月4日作成者: jarxiv

要約ペイントショップの問題では、異なる色に割り当てられた車の順序のない入庫順序 … 続きを読む →

カテゴリー: cs.LG, math.OC | コメントを受け付けていません

MiLo: Efficient Quantized MoE Inference with Mixture of Low-Rank Compensators

投稿日: 2025年4月4日作成者: jarxiv

要約膨大なパラメータを持つMoE（Mixture-of-Experts）モデル … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

When Can You Trust Your Explanations? A Robustness Analysis on Feature Importances

投稿日: 2025年4月4日作成者: jarxiv

要約最近の法規制により、説明可能で透明性の高い人工知能システムの必要性が強調さ … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Integrating Human Knowledge Through Action Masking in Reinforcement Learning for Operations Research

投稿日: 2025年4月4日作成者: jarxiv

要約強化学習（RL）は、オペレーションズ・リサーチにおける問題に対処するための … 続きを読む →

カテゴリー: cs.LG, math.OC | コメントを受け付けていません

End-To-End Self-Tuning Self-Supervised Time Series Anomaly Detection

投稿日: 2025年4月4日作成者: jarxiv

要約時系列異常検出(TSAD)は、環境センサー、産業KPI、患者バイオマーカー … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Compositionality Unlocks Deep Interpretable Models

投稿日: 2025年4月4日作成者: jarxiv

要約我々は$chi$-netを提案する。$chi$-netは、テンソルネットワ … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

月別アーカイブ: 2025年4月

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Computing High-dimensional Confidence Sets for Arbitrary Distributions

Reservoir Computing: A New Paradigm for Neural Networks

A Dynamic, Ordinal Gaussian Process Item Response Theoretic Model

Solving the Paint Shop Problem with Flexible Management of Multi-Lane Buffers Using Reinforcement Learning and Action Masking

MiLo: Efficient Quantized MoE Inference with Mixture of Low-Rank Compensators

When Can You Trust Your Explanations? A Robustness Analysis on Feature Importances

Integrating Human Knowledge Through Action Masking in Reinforcement Learning for Operations Research

End-To-End Self-Tuning Self-Supervised Time Series Anomaly Detection

Compositionality Unlocks Deep Interpretable Models

最近の投稿

最近のコメント

アーカイブ

カテゴリー