Xuehai Pan
|
6087eda91b
|
[RLlib] Issue 21991: Fix SampleBatch slicing for SampleBatch.INFOS in RNN cases (#22050)
|
2022-04-25 11:40:24 +02:00 |
|
Sven Mika
|
de9e143938
|
[RLlib] Issue 23907: SampleBatch.shuffle does not flush intercepted_values dict (which it should). (#24005)
|
2022-04-19 17:55:59 +02:00 |
|
Sven Mika
|
a8494742a3
|
[RLlib] Memory leak finding toolset using tracemalloc + CI memory leak tests. (#15412)
|
2022-04-12 07:50:09 +02:00 |
|
Sven Mika
|
0b3a79ca41
|
[RLlib] Issue 23639: Error in client/server setup when using LSTMs (#23740)
|
2022-04-07 10:16:22 +02:00 |
|
Max Pumperla
|
60054995e6
|
[docs] fix doctests and activate CI (#23418)
|
2022-03-24 17:04:02 -07:00 |
|
Artur Niederfahrenhorst
|
c0ade5f0b7
|
[RLlib] Issue 22625: MultiAgentBatch.timeslices() does not behave as expected. (#22657)
|
2022-03-08 14:25:48 +01:00 |
|
Sven Mika
|
04a5c72ea3
|
Revert "Revert "[RLlib] Speedup A3C up to 3x (new training_iteration function instead of execution_plan) and re-instate Pong learning test."" (#18708)
|
2022-02-10 13:44:22 +01:00 |
|
Alex Wu
|
b122f093c1
|
Revert "[RLlib] Speedup A3C up to 3x (new training_iteration function instead of execution_plan ) and re-instate Pong learning test." (#22250)
Reverts ray-project/ray#22126
Breaks rllib:tests/test_io
|
2022-02-09 09:26:36 -08:00 |
|
Sven Mika
|
ac3e6ab411
|
[RLlib] Speedup A3C up to 3x (new training_iteration function instead of execution_plan ) and re-instate Pong learning test. (#22126)
|
2022-02-08 19:04:13 +01:00 |
|
Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Sven Mika
|
92f030331e
|
[RLlib] Initial code/comment cleanups in preparation for decentralized multi-agent learner. (#21420)
|
2022-01-10 11:22:55 +01:00 |
|
Sven Mika
|
3a3d0a4a2b
|
[RLlib] Issue 21340: SampleBatch __init__ docstring wrong. (#21447)
|
2022-01-07 15:48:14 +01:00 |
|
Sven Mika
|
9e6b871739
|
[RLlib] Better utils for flattening complex inputs and enable prev-actions for LSTM/attention for complex action spaces. (#21330)
|
2022-01-05 11:29:44 +01:00 |
|
Sven Mika
|
62dbf26394
|
[RLlib] POC: Run PGTrainer w/o the distr. exec API (Trainer's new training_iteration method). (#20984)
|
2021-12-21 08:39:05 +01:00 |
|
Sven Mika
|
f814c2af89
|
[RLlib; Docs] Docs API reference pages: rllib/execution , rllib/evaluation , rllib/models , rllib/offline . (#20538)
|
2021-12-10 09:41:29 +01:00 |
|
Ishant Mrinal
|
2868d1a2cf
|
[RLlib] Support for RE3 exploration algorithm (for tf) (#19551)
|
2021-12-07 13:26:34 +01:00 |
|
mvindiola1
|
eadc7669c5
|
[RLlib] SampleBatch.concat_samples fix incorrect max_seq_len calculation (#20704)
|
2021-11-29 12:01:40 +01:00 |
|
Sven Mika
|
70fe25055a
|
[RLlib] Issue: Get single step input dict incorrect. (#20217)
|
2021-11-12 08:38:51 +01:00 |
|
Sven Mika
|
a931076f59
|
[RLlib] Tf2 + eager-tracing same speed as framework=tf; Add more test coverage for tf2+tracing. (#19981)
|
2021-11-05 16:10:00 +01:00 |
|
Sven Mika
|
f3397b6f48
|
[RLlib] Minor fixes/cleanups; chop_into_sequences now handles nested data. (#19408)
|
2021-11-05 14:39:28 +01:00 |
|
Sven Mika
|
0b308719f8
|
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
2021-11-01 21:46:02 +01:00 |
|
Sven Mika
|
05a55a9335
|
[RLlib] Issue 18668: Unity3D env client/server example not working (fix + add to test cases). (#18942)
|
2021-09-30 08:30:20 +02:00 |
|
Sven Mika
|
61a1274619
|
[RLlib] No Preprocessors (part 2). (#18468)
|
2021-09-23 12:56:45 +02:00 |
|
Sven Mika
|
a96dbd885b
|
[RLlib] Reinstate trajectory view API tests. (#18809)
|
2021-09-23 08:31:51 +02:00 |
|
Sven Mika
|
08c09737fa
|
[RLlib] Fix R2D2 (torch) multi-GPU issue. (#18550)
|
2021-09-14 19:58:10 +02:00 |
|
Sven Mika
|
8a066474d4
|
[RLlib] No Preprocessors; preparatory PR #1 (#18367)
|
2021-09-09 08:10:42 +02:00 |
|
Sven Mika
|
9a8ca6a69d
|
[RLlib] Fix Atari learning test regressions (2 bugs) and 1 minor attention net bug. (#18306)
|
2021-09-03 13:29:57 +02:00 |
|
Sven Mika
|
9883505e84
|
[RLlib] Add [LSTM=True + multi-GPU]-tests to nightly RLlib testing suite (for all algos supporting RNNs, except R2D2, RNNSAC, and DDPPO). (#18017)
|
2021-08-24 21:55:27 +02:00 |
|
Sven Mika
|
494ddd98c1
|
[RLlib] Replace "seq_lens" w/ SampleBatch.SEQ_LENS. (#17928)
|
2021-08-21 17:05:48 +02:00 |
|
Sven Mika
|
f3bbe4ea44
|
[RLlib] Test cases/BUILD cleanup; split "everything else" (longest running one rn) tests in 2. (#17640)
|
2021-08-16 22:01:01 +02:00 |
|
Sven Mika
|
2bd2ee7a73
|
[RLlib] SampleBatch: Docstring- and API cleanups; Add support for nested data. (#17485)
|
2021-08-16 06:08:14 +02:00 |
|
Sven Mika
|
b539560429
|
[RLlib] Remove SampleBatch._get_slice_indices() deprecated logging spam (moderate). (#17567)
|
2021-08-04 04:21:32 -04:00 |
|
Sven Mika
|
5107d16ae5
|
[RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. (#17530)
|
2021-08-03 18:30:02 -04:00 |
|
Sven Mika
|
924f11cd45
|
[RLlib] Torch algos use now-framework-agnostic MultiGPUTrainOneStep execution op (~33% speedup for PPO-torch + GPU). (#17371)
|
2021-08-03 11:35:49 -04:00 |
|
Sven Mika
|
5a313ba3d6
|
[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169)
|
2021-07-20 14:58:13 -04:00 |
|
Sven Mika
|
c95dea51e9
|
[RLlib] External env enhancements + more examples. (#16583)
|
2021-06-23 09:09:01 +02:00 |
|
Michael Luo
|
474f04e322
|
[RLlib] DDPG/TD3 + A3C/A2C + MARWIL/BC Annotation/Comments/Code Cleanup (#14707)
|
2021-05-19 16:32:29 +02:00 |
|
Sven Mika
|
2303851c3c
|
[RLlib] Torch multi-GPU + LSTM/RNN bug fix. (#15492)
|
2021-05-18 11:51:05 +02:00 |
|
Sven Mika
|
e973b726c2
|
[RLlib] Support native tf.keras.Models (part 2) - Default keras models for Vision/RNN/Attention. (#15273)
|
2021-04-30 19:26:30 +02:00 |
|
Sven Mika
|
bb8a286cbc
|
[RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684)
|
2021-04-27 10:44:54 +02:00 |
|
Sven Mika
|
b9761d7081
|
[RLlib] Discussion 1759: SampleBatch._get_slice_indices stuck for R2D2 when using incorrect Trainer. (#15451)
Thanks @Manuscrit for raising this issue!
|
2021-04-22 19:21:03 +02:00 |
|
Sven Mika
|
cecfc3b43b
|
[RLlib] Multi-GPU support for Torch algorithms. (#14709)
|
2021-04-16 09:16:24 +02:00 |
|
Sven Mika
|
9c5a0cfd7a
|
[RLlib] Issue 14385: Policy.compute_actions_from_input_dict does not properly track accessed fields for Policy's view requirements. (#14386)
|
2021-04-11 18:20:04 +02:00 |
|
Raphael CHEN
|
93d4244d9c
|
[RLlib] Correctly get bytes size of SampleBatch (#14801)
|
2021-03-30 19:24:58 +02:00 |
|
Sven Mika
|
4f66309e19
|
[RLlib] Redo issue 14533 tf enable eager exec (#14984)
|
2021-03-29 20:07:44 +02:00 |
|
SangBin Cho
|
fa5f961d5e
|
Revert "[RLlib] Issue 14533: tf.enable_eager_execution() must be called at beginning. (#14737)" (#14918)
This reverts commit 3e389d5812 .
|
2021-03-25 00:42:01 -07:00 |
|
Sven Mika
|
3e389d5812
|
[RLlib] Issue 14533: tf.enable_eager_execution() must be called at beginning. (#14737)
|
2021-03-24 12:54:27 +01:00 |
|
Sven Mika
|
69202c6a7d
|
[RLlib] Obsolete usage tracking dict via sample batch. (#13065)
|
2021-03-17 08:18:15 +01:00 |
|
Sven Mika
|
8000258333
|
[RLlib] R2D2 Implementation. (#13933)
|
2021-02-25 12:18:11 +01:00 |
|
Maltimore
|
3a3e4aed86
|
[RLlib] Add __len__() method to SampleBatch (#13371)
|
2021-01-12 20:15:23 +01:00 |
|