diff --git a/doc/source/rllib/rllib-replay-buffers.rst b/doc/source/rllib/rllib-replay-buffers.rst index 4efc6e76b..5ce3621d2 100644 --- a/doc/source/rllib/rllib-replay-buffers.rst +++ b/doc/source/rllib/rllib-replay-buffers.rst @@ -27,7 +27,7 @@ Eviction Strategy A buffer is naturally limited in its capacity to hold experiences. In the course of running an algorith, a buffer will eventually reach its capacity and in order to make room for new experiences, we need to delete (evict) older ones. This is generally done on a first-in-first-out basis. For your algorithms this means that buffers with a high capacity give the opportunity to learn from older samples, while smaller buffers -make the learning process more on-policy. An exception from this strategy is made in buffers that implement `reservoir sampling `__. +make the learning process more on-policy. An exception from this strategy is made in buffers that implement reservoir sampling. Replay Buffers in RLlib @@ -115,7 +115,7 @@ Here is an example of how to implement your own toy example of a ReplayBuffer cl :end-before: __sphinx_doc_replay_buffer_own_buffer__end__ For a full implementation, you should consider other methods like ``get_state()`` and ``set_state()``. -A more extensive example is `our implementation `__ of `reservoir sampling `__, the :py:class:`~ray.rllib.utils.replay_buffers.reservoir_replay_buffer.ReservoirReplayBuffer`. +A more extensive example is `our implementation `__ of reservoir sampling, the :py:class:`~ray.rllib.utils.replay_buffers.reservoir_replay_buffer.ReservoirReplayBuffer`. Advanced Usage diff --git a/rllib/utils/replay_buffers/reservoir_replay_buffer.py b/rllib/utils/replay_buffers/reservoir_replay_buffer.py index c086c90df..6cf098b15 100644 --- a/rllib/utils/replay_buffers/reservoir_replay_buffer.py +++ b/rllib/utils/replay_buffers/reservoir_replay_buffer.py @@ -19,8 +19,7 @@ class ReservoirReplayBuffer(ReplayBuffer): """This buffer implements reservoir sampling. The algorithm has been described by Jeffrey S. Vitter in "Random sampling - with a reservoir". See https://www.cs.umd.edu/~samir/498/vitter.pdf for - the full paper. + with a reservoir". """ def __init__(