This environment is part of the magent environments. Please read that page first for general information.
|Observation Shape||(9,9,15), (10,10,19)|
The red agents must navigate the obstacles and attack the blue agents. The blue agents should try to avoid being attacked. Since the red agents are slower (but larger) than the blue agents, they must work together to trap the blue agents, so they can attack them continually (note that they blue agent’s won’t die if attacked, so they can be used as an infinite source of reward).
Predator action options:
[do_nothing, move_4, attack_8]
Predator’s reward is given as:
Prey action options:
Prey’s reward is given as:
[empty, obstacle, predators, prey, one_hot_action, last_reward]
Map size: 45x45
attack_penalty: Adds the following value to the reward whenever an attacking action is taken
max_frames: number of frames (a step for each agent) until game terminates