月別アーカイブ: 2023年2月

CAMEO: Curiosity Augmented Metropolis for Exploratory Optimal Policies

投稿日: 2023年2月16日作成者: jarxiv

要約強化学習は、最適制御問題を解決するためのツールとして大きな関心を集めていま … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Online Double Oracle

投稿日: 2023年2月16日作成者: jarxiv

要約巨大なアクションスペースを使用して戦略的なゲームを解決することは、経済学 … 続きを読む →

カテゴリー: cs.AI, cs.GT | コメントを受け付けていません

Exploiting No-Regret Algorithms in System Design

投稿日: 2023年2月16日作成者: jarxiv

要約コラムプレイヤーがシステムの設計者でもあり、ペイオフマトリックスの設計 … 続きを読む →

カテゴリー: cs.AI, cs.GT | コメントを受け付けていません

Word class representations spontaneously emerge in a deep neural network trained on next word prediction

投稿日: 2023年2月16日作成者: jarxiv

要約人間はどのようにして言語を習得するのでしょうか?最初の言語はそもそも習得で … 続きを読む →

カテゴリー: cs.AI, cs.CL, q-bio.NC | コメントを受け付けていません

Deep Learning for Hybrid Beamforming with Finite Feedback in GSM Aided mmWave MIMO Systems

投稿日: 2023年2月16日作成者: jarxiv

要約ハイブリッドビームフォーミングは、ミリ波 (mmWave) 多入力多出力 … 続きを読む →

カテゴリー: cs.AI, cs.IT, eess.SP, math.IT | コメントを受け付けていません

Frameworks for SNNs: a Review of Data Science-oriented Software and an Expansion of SpykeTorch

投稿日: 2023年2月16日作成者: jarxiv

要約ニューロモルフィック (NM) 分野で機械学習 (ML) アプリケーション … 続きを読む →

カテゴリー: cs.AI, cs.NE, cs.SE | コメントを受け付けていません

Efficient Online Reinforcement Learning with Offline Data

投稿日: 2023年2月16日作成者: jarxiv

要約サンプルの効率と探索は、オンライン強化学習 (RL) における主要な課題の … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

Reinforcement Learning Based Power Grid Day-Ahead Planning and AI-Assisted Control

投稿日: 2023年2月16日作成者: jarxiv

要約再生可能エネルギーへの継続的な移行により、風力や太陽光などの変動する電源の … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.SY, eess.SY | コメントを受け付けていません

Genetic multi-armed bandits: a reinforcement learning approach for discrete optimization via simulation

投稿日: 2023年2月16日作成者: jarxiv

要約この論文では、GMAB と呼ばれる新しいアルゴリズムを提案します。このアル … 続きを読む →

カテゴリー: cs.AI, cs.LG, cs.NE, econ.GN, math.OC, q-fin.EC | コメントを受け付けていません

Prioritized offline Goal-swapping Experience Replay

投稿日: 2023年2月16日作成者: jarxiv

要約目標条件付きオフライン強化学習では、エージェントは以前に収集されたデータか … 続きを読む →

カテゴリー: cs.AI, cs.LG | コメントを受け付けていません

月別アーカイブ: 2023年2月

CAMEO: Curiosity Augmented Metropolis for Exploratory Optimal Policies

Online Double Oracle

Exploiting No-Regret Algorithms in System Design

Word class representations spontaneously emerge in a deep neural network trained on next word prediction

Deep Learning for Hybrid Beamforming with Finite Feedback in GSM Aided mmWave MIMO Systems

Frameworks for SNNs: a Review of Data Science-oriented Software and an Expansion of SpykeTorch

Efficient Online Reinforcement Learning with Offline Data

Reinforcement Learning Based Power Grid Day-Ahead Planning and AI-Assisted Control

Genetic multi-armed bandits: a reinforcement learning approach for discrete optimization via simulation

Prioritized offline Goal-swapping Experience Replay

最近の投稿

最近のコメント

アーカイブ

カテゴリー