Skip to content

Algorithms

Built-in policy implementations.

RandomPolicy

Uniform random policy over discrete actions.

from tinyrl import GridWorld, Runner, RandomPolicy

env = GridWorld()
policy = RandomPolicy(n_actions=env.n_actions)
runner = Runner(env)

result = runner.run_episode(policy)

Constructor

RandomPolicy(n_actions: int)

Args:

  • n_actions — number of discrete actions to sample from uniformly