ray/rllib/policy
2020-05-30 22:48:34 +02:00
..
tests [RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520) 2020-05-27 16:19:13 +02:00
__init__.py [rllib] observation function api for multi-agent (#8236) 2020-05-04 22:13:49 -07:00
dynamic_tf_policy.py [RLlib] Add 2 Transformer learning test cases on StatelessCartPole (PPO and IMPALA). (#8624) 2020-05-27 10:19:47 +02:00
eager_tf_policy.py [RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520) 2020-05-27 16:19:13 +02:00
policy.py [RLlib] utils/spaces ... (#8608) 2020-05-27 10:21:30 +02:00
rnn_sequencing.py [Testing] Do not run any non-RLlib/core tests if only RLLib affected (except wheels). (#7892) 2020-04-09 14:36:06 -07:00
sample_batch.py [rllib] Distributed exec workflow for impala (#8321) 2020-05-11 20:24:43 -07:00
tf_policy.py [RLlib] Remove tf.py_function from all Schedule classes (not differentiable and causes other bugs in MA setups). (#8304) 2020-05-04 23:53:38 +02:00
tf_policy_template.py [RLlib] DQN torch version. (#7597) 2020-04-06 11:56:16 -07:00
torch_policy.py [RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590) 2020-05-30 22:48:34 +02:00
torch_policy_template.py [RLlib] Add 2 Transformer learning test cases on StatelessCartPole (PPO and IMPALA). (#8624) 2020-05-27 10:19:47 +02:00