This environment is part of the magent environments. Please read that page first for general information.
In gather, the agents must gain reward by eating food or fighting each other. Agent’s don’t die unless attacked. You expect to see that agents coordinate by not attacking each other until food is scarce.
[do_nothing, move_28, attack_4]
Reward is given as:
[empty, obstacle, omnivore, food, omnivore_minimap, food_minimap, one_hot_action, last_reward, agent_position]
Map size: 200x200
gather_v1.env(step_reward=-0.01, attack_penalty=-0.1, dead_penalty=-1, attack_food_reward=0.5, max_frames=500)
step_reward: reward added unconditionally
dead_penalty: reward added when killed
attack_penalty: reward added for attacking
attack_food_reward: Reward added for attacking a food
max_frames: number of frames (a step for each agent) until game terminates