Reinforcement learning (RL)
personAntonio Celani
Program
- Markov reward processes
- Decision making and learning
- Markov decision processes
- Stochastic approximation and optimization
- Learning with a critic
- Actor-only and actor-critic algorithms
- Value function approximation
- Partially observable Markov decision processes
- Sequential allocation problems and bandits
Evaluation
References