.. |
a3c
|
[rllib] implemented compute_advantages without gae (#6941)
|
2020-01-31 22:25:45 -08:00 |
ars
|
[RLlib] Add torch flag to train.py (#6807)
|
2020-01-17 18:48:44 -08:00 |
ddpg
|
[RLlib] Policy-classes cleanup and torch/tf unification. (#6770)
|
2020-01-17 22:26:28 -08:00 |
dqn
|
[RLlib] Bug fix: DQN goes into negative epsilon values after reaching explora… (#6971)
|
2020-01-31 09:54:12 -08:00 |
es
|
[RLlib] Add torch flag to train.py (#6807)
|
2020-01-17 18:48:44 -08:00 |
impala
|
[RLlib] Add torch flag to train.py (#6807)
|
2020-01-17 18:48:44 -08:00 |
marwil
|
[rllib] implemented compute_advantages without gae (#6941)
|
2020-01-31 22:25:45 -08:00 |
pg
|
[rllib] implemented compute_advantages without gae (#6941)
|
2020-01-31 22:25:45 -08:00 |
ppo
|
[rllib] implemented compute_advantages without gae (#6941)
|
2020-01-31 22:25:45 -08:00 |
qmix
|
[RLlib] Bug fix: DQN goes into negative epsilon values after reaching explora… (#6971)
|
2020-01-31 09:54:12 -08:00 |
sac
|
[RLlib] Bug fix: DQN goes into negative epsilon values after reaching explora… (#6971)
|
2020-01-31 09:54:12 -08:00 |
__init__.py
|
[rllib] Try moving RLlib to top level dir (#5324)
|
2019-08-05 23:25:49 -07:00 |
agent.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
mock.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
registry.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
trainer.py
|
[rllib] Support parallel, parameterized evaluation (#6981)
|
2020-02-01 22:12:12 -08:00 |
trainer_template.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |