sykwer/pg_pong

Train ATARI pong agent by stochastic policy gradient method from raw playing images. - sykwer/pg_pong