Word Chains Environment Documentation
Overview
Word Chains is a turn-based word game where two players take turns providing valid English words that start with the last letter of the previous word. The game enforces rules such as no word repetition and requires valid English words. Players lose if they fail to provide a valid word.
Action Space
- Format: Actions are strings representing a word, enclosed in square brackets.
- Example:
[apple]
[monkey]
[yellow]
- Notes:
- Words must be wrapped in square brackets.
- The word must start with the last letter of the previously played word.
- Words cannot be repeated.
Observation Space
Observations
Players receive a message specifying the last played word, the letter required for the next word, and the history of used words.
Reset Observation:
[GAME]: Game started. The starting word is [banana]. Please provide the next word. On your turn, simply type your word.
Step Observation:
Player 0: [apple] [GAME]: Player 0 played the word: [apple]. Next word must start with 'e'. Player 1: [event] [GAME]: Player 1 played the word: [event]. Next word must start with 't'.
Gameplay
- Players: 2
- Turns: Players alternate turns, providing a word each time.
- Move Format: Words must be wrapped in square brackets and start with the correct letter.
- Objective: The game continues until a player fails to provide a valid word.
- Turn Limit: The game ends in a draw if the maximum number of turns is reached.
Key Rules
-
Word Selection:
- The word must start with the last letter of the previous word.
- Example: If the previous word is
[table]
, the next word must start withe
.
-
Valid Words:
- Words must be in standard English.
- Words are checked against an English dictionary.
-
No Repeats:
- Words cannot be reused.
- Example: If
[apple]
was played, it cannot be played again.
-
Game Termination:
- If a player provides an invalid word, they lose.
- If the turn limit is reached, the game ends in a draw.
Rewards
Outcome | Reward for Player | Reward for Opponent |
---|---|---|
Win | +1 | -1 |
Lose | -1 | +1 |
Draw | 0 | 0 |
Invalid Move | -1 | 0 |
Parameters
max_turns
(int
):- Description: Specifies the maximum number of turns allowed before the game ends automatically.
- Impact: Limits the duration of the game, preventing infinite loops.
Variants
Env-id | max_turns |
---|---|
WordChains-v0 | 100 |
WordChains-v0-infinite | None |
Example Usage
import textarena as ta # Initialize the environment env = ta.make(env_id="WordChains-v0") # Wrap the environment for easier observation handling env = ta.wrappers.LLMObservationWrapper(env=env) # Initialize agents agents = { 0: ta.basic_agents.OpenRouter(model_name="gpt-4o"), 1: ta.basic_agents.OpenRouter(model_name="gpt-4o-mini") } # Reset the environment env.reset(seed=490) done = False while not done: player_id, observation = env.get_observation() action = agents[player_id](observation) done, info = env.step(action=action) rewards = env.close()
Troubleshooting
-
Invalid Move Format:
- Issue: Player provides a move that doesn't follow
[word]
format. - Solution: Ensure that words are enclosed in square brackets.
- Issue: Player provides a move that doesn't follow
-
Invalid Words:
- Issue: Player attempts to use an invalid English word.
- Solution: Use words that exist in standard English dictionaries.
-
Repeated Words:
- Issue: Player attempts to reuse a previously played word.
- Solution: Ensure that each word is unique during the game.
Contact
If you have questions or face issues with this specific environment, please reach out to Guertlerlo@cfar.a-star.edu.sg