-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Description
Reinforcement learning
- Theoretical bias
- Planning and game trees
Game tree approach to Pacman - Reinforcement learning, table-based Q-learning, Bellman equation.
Stanford example with robot, pacman and gridworld.
Deep Reinforcement learning
- Approximate reinforcement learning. Deep reinforcement learning. Caveats and pitfalls
Playing atari with deep reinforcement learning. - POMDP and recurrent RL.
Kung Fu and PBS cases: comparing DQN from previous seminar with DRQN. - [Sparse & delayed rewards. Hierarchical RL. Model-based Vs model-free options]?
[Regular VS Hierarchical RL in Montezuma Revenge-like games]
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels