hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-07 02:51:39 -05:00

History

Kai Fricke 3e6ba5d6d2 Revert "Revert [RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`." (#20285 ) * Revert "Revert "[RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055)" (#20284)" This reverts commit `246787cdd9`. Co-authored-by: sven1977 <svenmika1977@gmail.com>		2021-11-16 12:26:47 +01:00
..
tests	Revert "Revert [RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`." (#20285 )	2021-11-16 12:26:47 +01:00
__init__.py	Revert "Revert [RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`." (#20285 )	2021-11-16 12:26:47 +01:00
default_config.py	Revert "Revert [RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`." (#20285 )	2021-11-16 12:26:47 +01:00
pg.py	Revert "Revert [RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`." (#20285 )	2021-11-16 12:26:47 +01:00
pg_tf_policy.py	Revert "Revert [RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`." (#20285 )	2021-11-16 12:26:47 +01:00
pg_torch_policy.py	Revert "Revert [RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`." (#20285 )	2021-11-16 12:26:47 +01:00
README.md	[docs] Move all /latest links to /master (#11897 )	2020-11-10 10:53:28 -08:00
utils.py	[RLlib; Docs overhaul] Docstring cleanup: Evaluation (#19783 )	2021-10-29 12:03:56 +02:00

README.md

Policy Gradient (PG)

An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.

Detailed Documentation

Implementation