hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-10 05:16:49 -04:00

History

Sven Mika 6f85af435f [RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055 )		2021-11-11 12:16:20 +01:00
..
tests	[RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055 )	2021-11-11 12:16:20 +01:00
__init__.py	[RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055 )	2021-11-11 12:16:20 +01:00
default_config.py	[RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055 )	2021-11-11 12:16:20 +01:00
pg.py	[RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055 )	2021-11-11 12:16:20 +01:00
pg_tf_policy.py	[RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055 )	2021-11-11 12:16:20 +01:00
pg_torch_policy.py	[RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055 )	2021-11-11 12:16:20 +01:00
README.md	[docs] Move all /latest links to /master (#11897 )	2020-11-10 10:53:28 -08:00
utils.py	[RLlib; Docs overhaul] Docstring cleanup: Evaluation (#19783 )	2021-10-29 12:03:56 +02:00

README.md

Policy Gradient (PG)

An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.

Detailed Documentation

Implementation