Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques
Work
Year: 2014
Type: article
Author Abhijit Gosavi
Institution Missouri University of Science and Technology
Cites: 47
Cited by: 19
Related to: 10
FWCI: 1.984
Citation percentile (by year/subfield): 97.03
Subfield: Artificial Intelligence
Field: Computer Science
Domain: Physical Sciences
Sustainable Development Goal Peace, justice, and strong institutions
Open Access status: closed