A standard API for reinforcement learning and a diverse set of reference environments (formerly Gym)