OpenAlex
Value Iteration and Rolling Plans for Markov Control Processes with Unbounded Rewards
Work