[Wrappers]: TimeAwareObservation #1490

zuoxingdong · 2019-05-14T13:15:29Z

No description provided.

pzhokhov · 2019-05-17T23:27:46Z

gym/wrappers/time_aware_observation.py

+        super(TimeAwareObservation, self).__init__(env)
+        low = np.append(self.observation_space.low, 0.0)
+        high = np.append(self.observation_space.high, np.inf)
+        self.observation_space = Box(low, high, dtype=np.float32)


let's add some asserts here to make sure it is not accidentally being used where it cannot be used with some arcane error message downstream (i.e. check that observation space is Box with dtype=np.float32). Another subtle possibility for a problem is limits of observation space / normalization. Imagine low=-1.0, high=1.0 - then time steps after the second step will be out of bounds. Even more subtly, if the timesteps observation is on the different scale with the rest of the observations, that can mess up the neural network training (so one will have to resort to VecNormalize-like wrapper from baselines approaches)

Two assertions are added for checking Box with np.float32. Indeed, for environments that beyond very short horizons, it can be problematic for training neural networks. Perhaps we could add a documentation in the docstring about that ? Or displays a warning msg (but this warning will be redundant if the user already wrap with VecNormalize-like wrappers).

what if we provide a horizon parameter, and then normalize the time observation by that value?

@pzhokhov That's a great idea ! Should we provide this functionality as an optional flag, so that the user could decide to give a horizon parameter or by default letting it be incremented ?

TheScalper · 2020-06-03T00:36:03Z

This PR is & year old. Is it still relevant or can it be close?

jkterry1 · 2020-11-02T00:27:46Z

@pzhokhov do the commits address your concerns?

pzhokhov · 2020-11-05T21:44:33Z

Close enough, let's merge it.

* Create time_aware_observation.py * Update __init__.py * Create test_time_aware_observation.py * Update time_aware_observation.py * Update time_aware_observation.py * Update time_aware_observation.py Co-authored-by: pzhokhov <peterz@openai.com>

zuoxingdong added 5 commits May 14, 2019 15:13

Create time_aware_observation.py

44c2719

Update __init__.py

c822f14

Create test_time_aware_observation.py

61f90d6

Update time_aware_observation.py

0f2cacb

Update time_aware_observation.py

b7a28ea

pzhokhov reviewed May 17, 2019

View reviewed changes

zuoxingdong and others added 4 commits May 20, 2019 16:12

Update time_aware_observation.py

76755e3

Merge branch 'master' into patch-11

b56955a

Merge branch 'master' into patch-11

e4b836d

Merge branch 'master' into patch-11

0457b6a

Merge branch 'master' into patch-11

d22bfb2

pzhokhov merged commit 28c42b6 into openai:master Nov 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Wrappers]: TimeAwareObservation #1490

[Wrappers]: TimeAwareObservation #1490

zuoxingdong commented May 14, 2019

pzhokhov May 17, 2019

zuoxingdong May 20, 2019

pzhokhov May 24, 2019

zuoxingdong May 27, 2019

TheScalper commented Jun 3, 2020

jkterry1 commented Nov 2, 2020

pzhokhov commented Nov 5, 2020

[Wrappers]: TimeAwareObservation #1490

[Wrappers]: TimeAwareObservation #1490

Conversation

zuoxingdong commented May 14, 2019

pzhokhov May 17, 2019

Choose a reason for hiding this comment

zuoxingdong May 20, 2019

Choose a reason for hiding this comment

pzhokhov May 24, 2019

Choose a reason for hiding this comment

zuoxingdong May 27, 2019

Choose a reason for hiding this comment

TheScalper commented Jun 3, 2020

jkterry1 commented Nov 2, 2020

pzhokhov commented Nov 5, 2020