ray/rllib/agents at 14405b90d5457863d71168c613b4961d34f19cc5 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Sven Mika 14405b90d5 [RLlib] Prototype of a DynaTrainer (for env dynamics learning in upcoming MBMPO algo). (#8860 )		2020-06-16 09:01:20 +02:00
..
a3c	[RLlib] Minor `rllib.utils` cleanup. (#8932 )	2020-06-16 08:52:20 +02:00
ars	[RLlib] Add testing `Policy.compute_single_action()` for all agents. (#8903 )	2020-06-13 17:51:50 +02:00
ddpg	[RLlib] Minor `rllib.utils` cleanup. (#8932 )	2020-06-16 08:52:20 +02:00
dqn	[RLlib] Minor `rllib.utils` cleanup. (#8932 )	2020-06-16 08:52:20 +02:00
dyna	[RLlib] Prototype of a DynaTrainer (for env dynamics learning in upcoming MBMPO algo). (#8860 )	2020-06-16 09:01:20 +02:00
es	[RLlib] Add testing `Policy.compute_single_action()` for all agents. (#8903 )	2020-06-13 17:51:50 +02:00
impala	[RLlib] Minor `rllib.utils` cleanup. (#8932 )	2020-06-16 08:52:20 +02:00
marwil	[RLlib] Minor `rllib.utils` cleanup. (#8932 )	2020-06-16 08:52:20 +02:00
pg	[RLlib] Minor `rllib.utils` cleanup. (#8932 )	2020-06-16 08:52:20 +02:00
ppo	[RLlib] Minor `rllib.utils` cleanup. (#8932 )	2020-06-16 08:52:20 +02:00
qmix	[RLlib] Minor `rllib.utils` cleanup. (#8932 )	2020-06-16 08:52:20 +02:00
sac	[RLlib] Minor `rllib.utils` cleanup. (#8932 )	2020-06-16 08:52:20 +02:00
__init__.py	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
agent.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
callbacks.py	[rllib] observation function api for multi-agent (#8236 )	2020-05-04 22:13:49 -07:00
mock.py	[RLlib] Fix broken tune tests in master due to framework=auto errors. (#8672 )	2020-05-29 11:55:47 +02:00
registry.py	[rllib] Deprecate policy optimizers (#8345 )	2020-05-21 10:16:18 -07:00
trainer.py	[rllib] Flexible multi-agent replay modes and replay_sequence_length (#8893 )	2020-06-12 20:17:27 -07:00
trainer_template.py	[rllib] Deprecate policy optimizers (#8345 )	2020-05-21 10:16:18 -07:00