ray/rllib/utils
Eric Liang a644060daa
[rllib] First pass at pipeline implementation of DQN (#7433)
* wip iters

* add test

* speed up

* update docs

* document it

* support serial sampling

* add test

* spacing

* annotate it

* update

* rename to pipeline

* comment

* iter2 wip

* update

* update

* context test

* update

* fix

* fix

* a3c pipeline

* doc

* update

* move timer

* comment

* add piepline test

* fix

* clean up

* document

* iter s

* wip dqn

* wip

* wip

* metrics

* metrics rename

* metrics ctx

* wip

* constants

* add todo

* suppport .union

* wip

* support union

* remove prints

* add todo

* remove auto timer

* fix up

* fix pipeline test

* typing

* fix breakage

* remove bad assert

* wip

* fix multiagent example

* fixapply

* update a3c

* remove a2c pl

* 0 workers

* wip

* wip

* share metrics

* wip

* wip

* doc

* fix weight sync and global var updates

* mode

* fix

* fix

* doc

* fix
2020-03-07 14:47:58 -08:00
..
exploration [rllib] Make timestep a required arg for exploration classes (#7380) 2020-03-04 13:00:37 -08:00
schedules [RLlib] DDPG refactor and Exploration API action noise classes. (#7314) 2020-03-01 11:53:35 -08:00
tests [RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155) 2020-02-19 12:18:45 -08:00
__init__.py [RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107) (#7124) 2020-02-22 14:19:49 -08:00
actors.py [rllib] - TaskPool.completed_prefetch() no longer returns stale object ids after an error (#7139) 2020-02-13 22:30:44 -08:00
annotations.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
compression.py Stop vendoring pyarrow (#7233) 2020-02-19 19:01:26 -08:00
debug.py [Core/RLlib] Move log_once from rllib to ray.util. (#7273) 2020-02-27 10:40:44 -08:00
deprecation.py [RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155) 2020-02-19 12:18:45 -08:00
error.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
experimental_dsl.py [rllib] First pass at pipeline implementation of DQN (#7433) 2020-03-07 14:47:58 -08:00
explained_variance.py [RLlib] Implement PPO torch version. (#6826) 2020-01-20 23:06:50 -08:00
filter.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
filter_manager.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
framework.py [rllib] Make timestep a required arg for exploration classes (#7380) 2020-03-04 13:00:37 -08:00
from_config.py [rllib] Fix torch GPU / yaml load warning (#7278) 2020-02-23 13:13:43 -08:00
memory.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
numpy.py [RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107) (#7124) 2020-02-22 14:19:49 -08:00
policy_client.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
policy_server.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
seed.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
sgd.py [RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155) 2020-02-19 12:18:45 -08:00
test_utils.py [RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238) 2020-02-22 11:02:31 -08:00
tf_ops.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
tf_run_builder.py [RLlib] SAC add discrete action support. (#7320) 2020-03-06 10:37:12 -08:00
timer.py [rllib] Enable performance metrics reporting for RLlib pipelines, add A3C (#7299) 2020-02-28 16:44:17 -08:00
torch_ops.py [RLlib] Issue 7421: can't convert cuda tensor to numpy in torch ppo. (#7445) 2020-03-06 12:45:30 -08:00
tracking_dict.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
tuple_actions.py [RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155) 2020-02-19 12:18:45 -08:00
window_stat.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00