Fix episodic buffer len #155

muupan · 2017-10-17T10:15:03Z

Add AbstractReplayBuffer and AbstractEpisodicReplayBuffer to clarify the interfaces of replay buffers
Change the behaviour of the __len__ of EpisodicReplayBuffer and PrioritizedEpisodicReplayBuffer so that they now return the number of transitions, not of episodes.
Add the n_episodes property to count the number of episodes
Update the examples accordingly
Update the tests accordingly (and fix a bug in tests)

Resolves #138

The behaviour of episodic replay buffers has been changed so that now __len__ returns the number of transitions, not episodes, to fix chainer#138. You can use the n_episodes property to get the number of episodes in the buffer.

coveralls · 2017-10-19T03:11:43Z

Coverage decreased (-0.08%) to 71.629% when pulling b20b2db on muupan:fix-episodic-buffer-len into 570ce6f on chainer:master.

toslunar

It looks good to me except it should be clear which class should take care of size checks. For example, I suppose update_if_necessary should have the same check as the lines you added to PCL.

Besides, I left some suggestions that might improve the code.

toslunar · 2017-10-19T05:49:16Z

chainerrl/replay_buffer.py

+    def stop_current_episode(self):
+        """Notify the buffer that the current episode is interrupted.
+
+        When a transtion with is_state_terminal=True is appended, the buffer


I suggest starting the document with the cases when the method should be called, for example:

You may want to interrupt the current episode and start a new one before observing a terminal state. This is typical in continuing envs. In such cases, you need to call this method before appending a new transition so that the buffer will treat it as an initial transition of a new episode.
This method should not be called after an episode whose termination is already notified by appending a transition with is_state_terminal=True.

toslunar · 2017-10-19T06:31:03Z

chainerrl/replay_buffer.py

@@ -14,14 +14,16 @@
 from chainerrl.misc.prioritized import PrioritizedBuffer


-class ReplayBuffer(object):
+class AbstractReplayBuffer(object):


Why don't you decorate the methods with @abstractmethod?

and raise an error instead of pass

muupan · 2017-10-20T05:13:26Z

Thank you for your review! I added a check of n_episodes in ReplayUpdater.update_if_necessary as well and followed your suggestions.

toslunar · 2017-10-20T05:23:58Z

LGTM!

coveralls · 2017-10-20T05:39:16Z

Coverage decreased (-0.03%) to 71.672% when pulling 3dc1ce5 on muupan:fix-episodic-buffer-len into 570ce6f on chainer:master.

muupan added 5 commits October 17, 2017 19:05

Define common interfaces for replay buffers

89454fb

The behaviour of episodic replay buffers has been changed so that now __len__ returns the number of transitions, not episodes, to fix chainer#138. You can use the n_episodes property to get the number of episodes in the buffer.

Change the value of default replay-start-size

bab0df4

Fix a bug of not executing subtest_save_and_load

4ddebdf

Check __len__ and n_eipsodes for episodic buffers

abc9165

Make PCL check n_episodes before calling sample_episodes

b20b2db

muupan changed the title ~~[WIP] Fix episodic buffer __len__~~ Fix episodic buffer __len__ Oct 19, 2017

toslunar reviewed Oct 19, 2017

View reviewed changes

muupan added 3 commits October 20, 2017 14:06

Use metaclass

2644312

Check n_episodes in ReplayUpdater.update_if_necessary

d5168dd

Improve docstring of stop_current_episode

3dc1ce5

and raise an error instead of pass

toslunar merged commit 51a2762 into chainer:master Oct 20, 2017

muupan added the bug label Nov 30, 2017

muupan added this to the v0.3 milestone Nov 30, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix episodic buffer len #155

Fix episodic buffer len #155

Uh oh!

muupan commented Oct 17, 2017 •

edited

Loading

Uh oh!

coveralls commented Oct 19, 2017 •

edited

Loading

Uh oh!

toslunar left a comment

Uh oh!

toslunar Oct 19, 2017

Uh oh!

toslunar Oct 19, 2017

Uh oh!

muupan commented Oct 20, 2017

Uh oh!

toslunar commented Oct 20, 2017

Uh oh!

coveralls commented Oct 20, 2017 •

edited

Loading

Uh oh!

Uh oh!

Fix episodic buffer __len__ #155

Fix episodic buffer __len__ #155

Uh oh!

Conversation

muupan commented Oct 17, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coveralls commented Oct 19, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

toslunar left a comment

Choose a reason for hiding this comment

Uh oh!

toslunar Oct 19, 2017

Choose a reason for hiding this comment

Uh oh!

toslunar Oct 19, 2017

Choose a reason for hiding this comment

Uh oh!

muupan commented Oct 20, 2017

Uh oh!

toslunar commented Oct 20, 2017

Uh oh!

coveralls commented Oct 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Fix episodic buffer len #155

Fix episodic buffer len #155

muupan commented Oct 17, 2017 •

edited

Loading

coveralls commented Oct 19, 2017 •

edited

Loading

coveralls commented Oct 20, 2017 •

edited

Loading