hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Max Pumperla 6a6c58b5b4 [RLlib] Config objects for DDPG and SimpleQ. (#24339 )		2022-05-12 16:12:42 +02:00
..
tests	[RLlib] PGTrainer config object class (`PGConfig`). (#24295 )	2022-04-28 22:25:16 +02:00
__init__.py	[RLlib] PGTrainer config object class (`PGConfig`). (#24295 )	2022-04-28 22:25:16 +02:00
default_config.py	[RLlib] PGTrainer config object class (`PGConfig`). (#24295 )	2022-04-28 22:25:16 +02:00
pg.py	[RLlib] Config objects for DDPG and SimpleQ. (#24339 )	2022-05-12 16:12:42 +02:00
pg_tf_policy.py	[CI] Format Python code with Black (#21975 )	2022-01-29 18:41:57 -08:00
pg_torch_policy.py	[RLlib] Fix typo in docstring of PGTorchPolicy (#23818 )	2022-04-11 19:31:45 +02:00
README.md	[docs] Move all /latest links to /master (#11897 )	2020-11-10 10:53:28 -08:00
utils.py	[CI] Format Python code with Black (#21975 )	2022-01-29 18:41:57 -08:00

README.md

Policy Gradient (PG)

An implementation of a vanilla policy gradient algorithm for TensorFlow and PyTorch.

Detailed Documentation

Implementation