ogpath
← all areas
learning by doing

learn reinforcement learning

markov decision processes, q-learning, policy gradients, and deep rl in practice.

the curated path

curatedmixed~5 weeks, part-time

reinforcement learning, practically

from the bellman equation to deep rl agents you can train in an afternoon — theory paired with clean, runnable implementations.

4 modules · 12 resources · checkpoint per module

stay current

what's new in reinforcement learning

see the full digest →

want something more specific in reinforcement learning? generate a fresh path.

generate a path