Q Learning is a off policy reinforcement learning algorithm which has been popularized by Deep-Q networks used in networks in games such as in Go and DOTA. OpenAI hosts a bunch of different environments to reinforcement learning models to play around with. This post will give an example of how to implement a Q learning algorithm in atari games.