Tags

Tags give the ability to mark specific points in history as being important

rllib-integration

b936841f · Updated README.md · Mar 30, 2020

f9fa840f · Merge branch 'dqn-forces-environment-bellman-bridge' · Jan 11, 2020

DQN agent learns to interact with bridge to reach goal. Gym is force-based. Gym is no longer episodic - instead when reaching the goal, it flips north-south.

Also, many results added about experiments with ANN agent based on a O'reilly URL.
Base point for implementing multi agent interaction and test emergent tool use

0.2

f4adba99 · Merge branch 'policy-gradient-bridge-oreilly-agent' · Jan 06, 2020

Ending experiments with ANN and policy gradients. Final stage before going back to DQN

0.1

bfefd8ac · Updated README.md with roadmap · Dec 12, 2019

v0.0.3

6316b649 · Merge branch 'snake-minigame-style-policy-gradient' · Dec 07, 2019

Added town gym with force-based actions and Policy Gradient trained agent, with complex architecture, from Simonini Thomas course.

Good base point to start creating a non-toy gym, probably using a Physics simulator or a game engine

v0.0.2

7d962138 · Merge branch 'snake-minigame-style-multi-agent' · Oct 03, 2019

Snake minigame-style with fixed goal, fixed initial position. Simple Q learning agent.

It's basically a has-less-bugs version of the 0.0.1, and also reaches the optimum solution at around 5800 iterations

v0.0.1

1bcaecbd · Merge branch 'snake-minigame-style-town-agent' · Oct 02, 2019

Snake minigame-style with fixed goal, random initial position per episode OpenAI gym. Single Q learning agent.