Deep Q-learning (DQN) for Multi-agent Reinforcement Learning (RL)

DQN implementation for two multi-agent environments: agents_landmarks and predators_prey (See details.pdf for a detailed description of these environments).

Code structure

Results

Predators and Prey Environment

In this environment, the prey is captured when one predator moves to the location of the prey while the other predators occupy, for support, the neighboring cells of the prey's location.

Fixed prey (mode 0)
Random prey (mode 1)
Random escaping prey (mode 2)

Agents and Landmarks Environment

10 agents and 10 landmarks
16 agents and 16 landmarks

Todos