Using an Environment


If the gym-anm environment you would like to use has already been registered in the gym’s registry (see the Gym documentation), you can initialize it with gym.make('gym_anm:<ENV_ID>'), where <ENV_ID> it the ID of the environment. For example:

import gym
env = gym.make('gym_anm:ANM6Easy-v0')

Note: all environments provided as part of the gym-anm package are automatically registered.

Alternatively, the environment can be initialized directly from its class:

from gym_anm.envs import ANM6Easy
env = ANM6Easy()

Agent-environment interactions

Built on top of Gym, gym-anm provides 2 core functions: reset() and step(a).

reset() can be used to reset the environment and collect the first observation of the trajectory:

obs = env.reset()

After the agent has selected an action a to apply to the environment, step(a) can be used to do so:

obs, r, done, info = env.step(a)


  • obs is the vector of observations \(o_{t+1}\),

  • r is the reward \(r_t\),

  • done is a boolean value set to true if \(s_{t+1}\) is a terminal state,

  • info gathers information about the transition (it is seldom used in gym-anm).

Render the environment

Some gym-anm environments may support rendering through the render() and close() functions.

To update the visualization of the environment, the render method is called:


To end the visualization and close all used resources:


Currently, only gym-anm:ANM6Easy-v0 supports rendering.

Complete example

A complete example of agent-environment interactions with an arbitrary agent agent:

env = gym.make('gym_anm:ANM6Easy-v0')
o = env.reset()

for i in range(1000):
    a = agent.act(o)
    o, r, done, info = env.step(a)
    time.sleep(0.5)   # otherwise the rendering is too fast for the human eye

    if done:
        o = env.reset()

The above example would be rendered in your favorite web browser as:
