twodlearn.reinforce.models.bullet_env module

class twodlearn.reinforce.models.bullet_env.SingleRobotBulletEnv(BulletRobot, frame_skip=2)[source]

Bases: pybullet_envs.env_bases.MJCFBaseBulletEnv

calc_done(state, action)[source]
calc_reward(state, action)[source]
camera_adjust()[source]
create_single_player_scene(bullet_client)[source]
property dt[source]
property frame_skip[source]
reset()[source]

Resets the state of the environment and returns an initial observation.

Returns: observation (object): the initial observation of the

space.

step(action)[source]

Run one timestep of the environment’s dynamics. When end of episode is reached, you are responsible for calling reset() to reset this environment’s state.

Accepts an action and returns a tuple (observation, reward, done, info).

Parameters

action (object) – an action provided by the environment

Returns

agent’s observation of the current environment reward (float) : amount of reward returned after previous action done (boolean): whether the episode has ended, in which case further step() calls will return undefined results info (dict): contains auxiliary diagnostic information (helpful for debugging, and sometimes learning)

Return type

observation (object)