ray/rllib/agents
2020-02-20 19:02:17 -08:00
..
a3c [Tests] determine_tests_to_run.sh has a bug affecting RLlib testing to be skipped sometimes. (#7243) 2020-02-20 19:02:17 -08:00
ars [RLlib] Add torch flag to train.py (#6807) 2020-01-17 18:48:44 -08:00
ddpg [RLlib] Fix erroneous use of LinearSchedule (in DDPG's exploration annealing). (#7125) 2020-02-12 23:46:49 -08:00
dqn Fix old exploration configs. (#7240) 2020-02-20 08:39:16 -08:00
es [RLlib] Add torch flag to train.py (#6807) 2020-01-17 18:48:44 -08:00
impala [RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155) 2020-02-19 12:18:45 -08:00
marwil [rllib] implemented compute_advantages without gae (#6941) 2020-01-31 22:25:45 -08:00
pg [rllib] [experimental] custom RL training pipelines (PG_pl, A2C_pl) (#7213) 2020-02-19 16:07:37 -08:00
ppo [RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155) 2020-02-19 12:18:45 -08:00
qmix [RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155) 2020-02-19 12:18:45 -08:00
sac [RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155) 2020-02-19 12:18:45 -08:00
__init__.py [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
agent.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
mock.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
registry.py [rllib] [experimental] custom RL training pipelines (PG_pl, A2C_pl) (#7213) 2020-02-19 16:07:37 -08:00
trainer.py [RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155) 2020-02-19 12:18:45 -08:00
trainer_template.py [rllib] [experimental] custom RL training pipelines (PG_pl, A2C_pl) (#7213) 2020-02-19 16:07:37 -08:00