A Tutorial Introduction to Reinforcement Learning

要約

タイトル：強化学習のチュートリアル紹介
要約：
– 本論文は、Stochastic Approximation(SA)を統一的なテーマとして、強化学習(Reinforcement Learning)について簡単な概説を行っています。
– 論文の範囲は、マルコフ報酬プロセス、マルコフ決定プロセス、Stochastic Approximationアルゴリズム、Temporal Difference Learningや$Q$-learningなど、広く使用されているアルゴリズムを含みます。

要約(オリジナル)

In this paper, we present a brief survey of Reinforcement Learning (RL), with particular emphasis on Stochastic Approximation (SA) as a unifying theme. The scope of the paper includes Markov Reward Processes, Markov Decision Processes, Stochastic Approximation algorithms, and widely used algorithms such as Temporal Difference Learning and $Q$-learning.

arxiv情報

著者	Mathukumalli Vidyasagar
発行日	2023-04-03 08:50:58+00:00
arxivサイト	arxiv_id(pdf)

提供元, 利用サービス

arxiv.jp, OpenAI

A Tutorial Introduction to Reinforcement Learning

要約

要約(オリジナル)

arxiv情報

提供元, 利用サービス

最近の投稿

最近のコメント

アーカイブ

カテゴリー