ray/rllib/agents/pg
Balaji Veeramani 7f1bacc7dc
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
2022-01-29 18:41:57 -08:00
..
tests [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
__init__.py Revert "Revert [RLlib] POC: PGTrainer class that works by sub-classing, not trainer_template.py." (#20285) 2021-11-16 12:26:47 +01:00
default_config.py [RLlib] Issue 18499: PGTrainer with training_iteration fn does not support multi-GPU. (#21376) 2022-01-05 18:22:33 +01:00
pg.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
pg_tf_policy.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
pg_torch_policy.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
README.md [docs] Move all /latest links to /master (#11897) 2020-11-10 10:53:28 -08:00
utils.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00

Policy Gradient (PG)

An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.

Detailed Documentation

Implementation