OpenAlex
On the Expected Total Reward with Unbounded Returns for Markov Decision Processes
Work