This environment is part of the classic environments. Please read that page first for general information.
|Observation Values||[0, 1]|
num_players: Sets the number of players in the game. Minimum is 2.
The observation is a dictionary which contains an
'obs' element which is the usual RL observation described below, and an
'action_mask' which holds the legal moves, described in the Legal Actions Mask section.
The main observation space is a vector of 72 boolean integers. The first 52 entries depict the current player’s hand plus any community cards as follows
|0 - 12||Spades
|13 - 25||Hearts
|26 - 38||Diamonds
|39 - 51||Clubs
|52 - 56||Chips raised in Round 1
|57 - 61||Chips raised in Round 2
|62 - 66||Chips raised in Round 3
|67 - 71||Chips raised in Round 4
The legal moves available to the current agent are found in the
action_mask element of the dictionary observation. The
action_mask is a binary vector where each index of the vector represents whether the action is legal or not. The
action_mask will be all zeros for any agent except the one whose turn it is. Taking an illegal move ends the game with a reward of -1 for the illegally moving agent and a reward of 0 for all other agents.
|+raised chips/2||-raised chips/2|