「cs.LG」カテゴリーアーカイブ

Multi-objective Good Arm Identification with Bandit Feedback

投稿日: 2025年3月17日作成者: jarxiv

要約マルチオブジェクトを備えた確率的盗賊設定での優れたアーム識別の問題を検討し … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Deep Learning Agents Trained For Avoidance Behave Like Hawks And Doves

投稿日: 2025年3月17日作成者: jarxiv

要約簡単な回避ゲームを演奏するディープラーニングエージェントによって表明された … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Tests for model misspecification in simulation-based inference: from local distortions to global model checks

投稿日: 2025年3月17日作成者: jarxiv

要約異常検出、モデル検証、モデルの比較などのモデル誤解分析戦略は、科学モデル開 … 続きを読む →

カテゴリー: astro-ph.CO, astro-ph.IM, cs.LG, gr-qc | コメントを受け付けていません

Make Optimization Once and for All with Fine-grained Guidance

投稿日: 2025年3月17日作成者: jarxiv

要約最適化（L2O）を学習することで、統合されたニューラルネットワークを使用し … 続きを読む →

カテゴリー: 68Q32, cs.LG, I.2 | コメントを受け付けていません

In Shift and In Variance: Assessing the Robustness of HAR Deep Learning Models against Variability

投稿日: 2025年3月17日作成者: jarxiv

要約ウェアラブル慣性測定ユニット（IMU）センサーを使用した人間の活動認識（H … 続きを読む →

カテゴリー: cs.HC, cs.LG, eess.SP | コメントを受け付けていません

Dynamic Obstacle Avoidance with Bounded Rationality Adversarial Reinforcement Learning

投稿日: 2025年3月17日作成者: jarxiv

要約 Renforce Learning（RL）は、脚のロボットの安定した移動歩 … 続きを読む →

カテゴリー: cs.LG, cs.RO | コメントを受け付けていません

A Real-World Energy Management Dataset from a Smart Company Building for Optimization and Machine Learning

投稿日: 2025年3月17日作成者: jarxiv

要約 2018年から2023年までの6年間のスマート企業施設の監視から得られた大 … 続きを読む →

カテゴリー: cs.LG, cs.SY, eess.SY | コメントを受け付けていません

NeuMC — a package for neural sampling for lattice field theories

投稿日: 2025年3月17日作成者: jarxiv

要約 \ pytorchに基づいた\ texttt {neumc}ソフトウェアパ … 続きを読む →

カテゴリー: 68T07, cs.LG, hep-lat, J.2 | コメントを受け付けていません

A Review of DeepSeek Models’ Key Innovative Techniques

投稿日: 2025年3月17日作成者: jarxiv

要約 DeepSeek-V3とDeepSeek-R1は、汎用タスクと推論のための … 続きを読む →

カテゴリー: cs.LG | コメントを受け付けていません

Reinforcement Learning with Verifiable Rewards: GRPO’s Effective Loss, Dynamics, and Success Amplification

投稿日: 2025年3月17日作成者: jarxiv

要約グループ相対ポリシー最適化（GRPO）が導入され、検証可能またはバイナリ報 … 続きを読む →

カテゴリー: cs.LG, stat.ML | コメントを受け付けていません

「cs.LG」カテゴリーアーカイブ

Multi-objective Good Arm Identification with Bandit Feedback

Deep Learning Agents Trained For Avoidance Behave Like Hawks And Doves

Tests for model misspecification in simulation-based inference: from local distortions to global model checks

Make Optimization Once and for All with Fine-grained Guidance

In Shift and In Variance: Assessing the Robustness of HAR Deep Learning Models against Variability

Dynamic Obstacle Avoidance with Bounded Rationality Adversarial Reinforcement Learning

A Real-World Energy Management Dataset from a Smart Company Building for Optimization and Machine Learning

NeuMC — a package for neural sampling for lattice field theories

A Review of DeepSeek Models’ Key Innovative Techniques

Reinforcement Learning with Verifiable Rewards: GRPO’s Effective Loss, Dynamics, and Success Amplification

最近の投稿

最近のコメント

アーカイブ

カテゴリー