What the hell is reinforcement learning and how does it work?