#ai #computing #reinforcement-learning Brought the best of the two world together: policy learning & q learning merged