v0.3 · Tags · Rubén Montero / Town Survival RL Simulator

v0.3

f9fa840f · Merge branch 'dqn-forces-environment-bellman-bridge' · Jan 11, 2020

DQN agent learns to interact with bridge to reach goal. Gym is force-based. Gym is no longer episodic - instead when reaching the goal, it flips north-south.

Also, many results added about experiments with ANN agent based on a O'reilly URL.
Base point for implementing multi agent interaction and test emergent tool use