Snake minigame-style with fixed goal, fixed initial position. Simple Q learning agent. It's basically a has-less-bugs version of the 0.0.1, and also reaches the optimum solution at around 5800 iterations
Snake minigame-style with fixed goal, fixed initial position. Simple Q learning agent. It's basically a has-less-bugs version of the 0.0.1, and also reaches the optimum solution at around 5800 iterations