ray/rllib/models at 05df80afad7303ce233c3b95da60cfb263fed9db - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-12 22:26:39 -04:00

History

Eric Liang baadbdf8d4 [rllib] Execute PPO using training workflow (#8206 ) * wip * add kl * kl * works now * doc update * reorg * add ddppo * add stats * fix fetch * comment * fix learner stat regression * test fixes * fix test		2020-04-30 01:18:09 -07:00
..
tests	[RLlib] Nested action space PR (minimally invasive; torch only + test). (#8101 )	2020-04-23 09:09:22 +02:00
tf	[RLlib] Deprecate all Model(v1) usage. (#8146 )	2020-04-29 12:12:59 +02:00
torch	[RLlib] Deprecate all Model(v1) usage. (#8146 )	2020-04-29 12:12:59 +02:00
__init__.py	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
action_dist.py	[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155 )	2020-02-19 12:18:45 -08:00
catalog.py	[RLlib] Deprecate all Model(v1) usage. (#8146 )	2020-04-29 12:12:59 +02:00
extra_spaces.py	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
model.py	[RLlib] Deprecate all Model(v1) usage. (#8146 )	2020-04-29 12:12:59 +02:00
modelv2.py	[rllib] Execute PPO using training workflow (#8206 )	2020-04-30 01:18:09 -07:00
preprocessors.py	[RLlib] Fix broken example: tf-eager with custom-RNN (#6732 ). (#7021 )	2020-02-06 09:44:08 -08:00
README.txt	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00

Shared neural network models for RLlib.