mirror of
https://github.com/vale981/ray
synced 2025-03-06 02:21:39 -05:00
[Lint Check] Remove broken link (#26505)
The paper is not available anymore.
This commit is contained in:
parent
e48d381926
commit
5df66b917d
2 changed files with 3 additions and 4 deletions
|
@ -27,7 +27,7 @@ Eviction Strategy
|
|||
A buffer is naturally limited in its capacity to hold experiences. In the course of running an algorith, a buffer will eventually reach
|
||||
its capacity and in order to make room for new experiences, we need to delete (evict) older ones. This is generally done on a first-in-first-out basis.
|
||||
For your algorithms this means that buffers with a high capacity give the opportunity to learn from older samples, while smaller buffers
|
||||
make the learning process more on-policy. An exception from this strategy is made in buffers that implement `reservoir sampling <https://www.cs.umd.edu/~samir/498/vitter.pdf>`__.
|
||||
make the learning process more on-policy. An exception from this strategy is made in buffers that implement reservoir sampling.
|
||||
|
||||
|
||||
Replay Buffers in RLlib
|
||||
|
@ -115,7 +115,7 @@ Here is an example of how to implement your own toy example of a ReplayBuffer cl
|
|||
:end-before: __sphinx_doc_replay_buffer_own_buffer__end__
|
||||
|
||||
For a full implementation, you should consider other methods like ``get_state()`` and ``set_state()``.
|
||||
A more extensive example is `our implementation <https://github.com/ray-project/ray/blob/master/rllib/utils/replay_buffers/reservoir_replay_buffer.py>`__ of `reservoir sampling <https://www.cs.umd.edu/~samir/498/vitter.pdf>`__, the :py:class:`~ray.rllib.utils.replay_buffers.reservoir_replay_buffer.ReservoirReplayBuffer`.
|
||||
A more extensive example is `our implementation <https://github.com/ray-project/ray/blob/master/rllib/utils/replay_buffers/reservoir_replay_buffer.py>`__ of reservoir sampling, the :py:class:`~ray.rllib.utils.replay_buffers.reservoir_replay_buffer.ReservoirReplayBuffer`.
|
||||
|
||||
|
||||
Advanced Usage
|
||||
|
|
|
@ -19,8 +19,7 @@ class ReservoirReplayBuffer(ReplayBuffer):
|
|||
"""This buffer implements reservoir sampling.
|
||||
|
||||
The algorithm has been described by Jeffrey S. Vitter in "Random sampling
|
||||
with a reservoir". See https://www.cs.umd.edu/~samir/498/vitter.pdf for
|
||||
the full paper.
|
||||
with a reservoir".
|
||||
"""
|
||||
|
||||
def __init__(
|
||||
|
|
Loading…
Add table
Reference in a new issue