.. |
exploration
|
[RLlib] Issue 9667 DDPG Torch bugs and enhancements. (#9680)
|
2020-07-28 14:15:03 +02:00 |
schedules
|
[RLlib] Add tensor-based tests for Schedules and fix some bugs related to using Schedules with tensor time input. (#9782)
|
2020-07-30 12:49:32 +02:00 |
spaces
|
fixed simplex initialisation seeding bug (#9660)
|
2020-07-24 14:22:41 -07:00 |
tests
|
Change Python's ObjectID to ObjectRef (#9353)
|
2020-07-10 17:49:04 +08:00 |
__init__.py
|
[RLlib] Retire try_import_tree (should be installed along with other requirements). (#9211)
|
2020-07-02 13:06:34 +02:00 |
actors.py
|
Change Python's ObjectID to ObjectRef (#9353)
|
2020-07-10 17:49:04 +08:00 |
annotations.py
|
[rllib] Deprecate policy optimizers (#8345)
|
2020-05-21 10:16:18 -07:00 |
compression.py
|
Stop vendoring pyarrow (#7233)
|
2020-02-19 19:01:26 -08:00 |
debug.py
|
[rllib] Flexible multi-agent replay modes and replay_sequence_length (#8893)
|
2020-06-12 20:17:27 -07:00 |
deprecation.py
|
[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155)
|
2020-02-19 12:18:45 -08:00 |
error.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
filter.py
|
Remove future imports (#6724)
|
2020-01-09 00:15:48 -08:00 |
filter_manager.py
|
[rllib] Deprecate policy optimizers (#8345)
|
2020-05-21 10:16:18 -07:00 |
framework.py
|
[RLlib] Implement DQN PyTorch distributional head. (#9589)
|
2020-07-25 09:29:24 +02:00 |
from_config.py
|
[RLlib] Make envs specifiable in configs by their class path. (#8750)
|
2020-06-03 08:14:29 +02:00 |
memory.py
|
[RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678)
|
2020-07-29 21:15:09 +02:00 |
numpy.py
|
[RLlib] Layout of Trajectory View API (new class: Trajectory; not used yet). (#9269)
|
2020-07-14 04:27:49 +02:00 |
policy_client.py
|
[RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590)
|
2020-05-30 22:48:34 +02:00 |
policy_server.py
|
[rllib] Add high-performance external application connector (#7641)
|
2020-03-20 12:43:57 -07:00 |
sgd.py
|
WIP. (#8456)
|
2020-05-15 21:43:27 +02:00 |
test_utils.py
|
[RLlib] Tf2.x native. (#8752)
|
2020-07-11 22:06:35 +02:00 |
tf_ops.py
|
[RLlib] Tf2.x native. (#8752)
|
2020-07-11 22:06:35 +02:00 |
tf_run_builder.py
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
|
2020-06-30 10:13:20 +02:00 |
timer.py
|
[rllib] Enable performance metrics reporting for RLlib pipelines, add A3C (#7299)
|
2020-02-28 16:44:17 -08:00 |
torch_ops.py
|
[RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678)
|
2020-07-29 21:15:09 +02:00 |
tracking_dict.py
|
This PR fixes the currently broken lstm_use_prev_action_reward flag for default lstm models (model.use_lstm=True). (#8970)
|
2020-06-27 20:50:01 +02:00 |
tuple_actions.py
|
[RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143)
|
2020-04-28 14:59:16 +02:00 |
types.py
|
[RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678)
|
2020-07-29 21:15:09 +02:00 |
window_stat.py
|
[RLLib] WindowStat bug fix (#9213)
|
2020-07-12 23:01:32 +02:00 |