[Lint Check] Remove broken link (#26505)

The paper is not available anymore.
This commit is contained in:
Avnish Narayan 2022-07-13 02:30:20 -07:00 committed by GitHub
parent e48d381926
commit 5df66b917d
No known key found for this signature in database
GPG key ID: 4AEE18F83AFDEB23
2 changed files with 3 additions and 4 deletions

View file

@ -27,7 +27,7 @@ Eviction Strategy
A buffer is naturally limited in its capacity to hold experiences. In the course of running an algorith, a buffer will eventually reach
its capacity and in order to make room for new experiences, we need to delete (evict) older ones. This is generally done on a first-in-first-out basis.
For your algorithms this means that buffers with a high capacity give the opportunity to learn from older samples, while smaller buffers
make the learning process more on-policy. An exception from this strategy is made in buffers that implement `reservoir sampling <https://www.cs.umd.edu/~samir/498/vitter.pdf>`__.
make the learning process more on-policy. An exception from this strategy is made in buffers that implement reservoir sampling.
Replay Buffers in RLlib
@ -115,7 +115,7 @@ Here is an example of how to implement your own toy example of a ReplayBuffer cl
:end-before: __sphinx_doc_replay_buffer_own_buffer__end__
For a full implementation, you should consider other methods like ``get_state()`` and ``set_state()``.
A more extensive example is `our implementation <https://github.com/ray-project/ray/blob/master/rllib/utils/replay_buffers/reservoir_replay_buffer.py>`__ of `reservoir sampling <https://www.cs.umd.edu/~samir/498/vitter.pdf>`__, the :py:class:`~ray.rllib.utils.replay_buffers.reservoir_replay_buffer.ReservoirReplayBuffer`.
A more extensive example is `our implementation <https://github.com/ray-project/ray/blob/master/rllib/utils/replay_buffers/reservoir_replay_buffer.py>`__ of reservoir sampling, the :py:class:`~ray.rllib.utils.replay_buffers.reservoir_replay_buffer.ReservoirReplayBuffer`.
Advanced Usage

View file

@ -19,8 +19,7 @@ class ReservoirReplayBuffer(ReplayBuffer):
"""This buffer implements reservoir sampling.
The algorithm has been described by Jeffrey S. Vitter in "Random sampling
with a reservoir". See https://www.cs.umd.edu/~samir/498/vitter.pdf for
the full paper.
with a reservoir".
"""
def __init__(