.. |
a3c
|
[RLlib] Exploration API: ParamNoise Integration into DQN; working example/test cases. (#7814)
|
2020-04-03 10:44:25 -07:00 |
ars
|
[rllib] Rename sample_batch_size => rollout_fragment_length (#7503)
|
2020-03-14 12:05:04 -07:00 |
ddpg
|
[RLlib] Add testing framework_iterator. (#7852)
|
2020-04-03 12:24:25 -07:00 |
dqn
|
[RLlib] Add testing framework_iterator. (#7852)
|
2020-04-03 12:24:25 -07:00 |
es
|
[rllib] Rename sample_batch_size => rollout_fragment_length (#7503)
|
2020-03-14 12:05:04 -07:00 |
impala
|
[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798)
|
2020-04-01 00:43:21 -07:00 |
marwil
|
[rllib] implemented compute_advantages without gae (#6941)
|
2020-01-31 22:25:45 -08:00 |
pg
|
[rllib] Enable distributed exec api for A2C, A3C, PG by default (#7580)
|
2020-03-13 18:48:41 -07:00 |
ppo
|
[RLlib] Add testing framework_iterator. (#7852)
|
2020-04-03 12:24:25 -07:00 |
qmix
|
[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798)
|
2020-04-01 00:43:21 -07:00 |
sac
|
[RLlib] Add testing framework_iterator. (#7852)
|
2020-04-03 12:24:25 -07:00 |
__init__.py
|
[rllib] Try moving RLlib to top level dir (#5324)
|
2019-08-05 23:25:49 -07:00 |
agent.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
mock.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
registry.py
|
[rllib] Support multi-agent training in pipeline impls, add easy flag to enable (#7338)
|
2020-03-02 15:16:37 -08:00 |
trainer.py
|
[RLlib] Minimal ParamNoise PR. (#7772)
|
2020-03-28 16:16:30 -07:00 |
trainer_template.py
|
[RLlib] Bug default policy overrides torch policy. (#7756)
|
2020-03-26 10:03:20 -07:00 |