Sven Mika
|
47eb6613b5
|
[RLlib] Remove unnecessary copies in compute_advantages . (#10897)
|
2020-09-29 12:25:20 +02:00 |
|
Eric Liang
|
609c1b8acd
|
Start moving ray internal files to _private module (#10994)
|
2020-09-24 22:46:35 -07:00 |
|
Sven Mika
|
805dad3bc4
|
[RLlib] SAC algo cleanup. (#10825)
|
2020-09-20 11:27:02 +02:00 |
|
Eric Liang
|
f83c588f08
|
[rllib] Remove broken no eager on workers mode (#10745)
* remove no eager
* Update trainer.py
|
2020-09-15 17:25:20 -07:00 |
|
Sven Mika
|
4b278c36fc
|
[RLlib] Behavioral Cloning (from MARWIL). (#10619)
|
2020-09-09 17:33:21 +02:00 |
|
Alex Wu
|
a699f6a4d8
|
[Core] Fix override memory and object_store_memory in decorator (#10563)
|
2020-09-06 20:56:48 -07:00 |
|
Sven Mika
|
244aafdcf8
|
[RLlib] Curiosity enhancements. (#10373)
|
2020-09-05 13:14:24 +02:00 |
|
architkulkarni
|
6ae9e76b81
|
[RLlib] Fix seeding issue (#10589)
|
2020-09-04 17:17:53 -07:00 |
|
Sven Mika
|
715ee8dfc9
|
[RLlib] Issue 10469: Callbacks should receive env idx ... (#10477)
|
2020-09-03 17:27:05 +02:00 |
|
Eric Liang
|
2a204260a8
|
[api] Second round of 1.0 API changes: exceptions, num_return_vals (#10377)
|
2020-08-28 19:57:02 -07:00 |
|
Eric Liang
|
519354a39a
|
[api] Initial API deprecations for Ray 1.0 (#10325)
|
2020-08-28 15:03:50 -07:00 |
|
raoul-khour-ts
|
c8c4832794
|
Prevent Local Worker creation from blocking remote worker creation by creating remote workers before local worker (#10245)
* create remote workers before local worker
* reformatted
|
2020-08-24 12:29:55 -07:00 |
|
Sven Mika
|
e968b52cb7
|
[RLlib] Trajectory view API - 03 Fast LSTM + prev actions/rewards (#9950)
|
2020-08-21 12:35:16 +02:00 |
|
Sven Mika
|
d14b501692
|
[RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115)
|
2020-08-20 17:05:57 +02:00 |
|
Sven Mika
|
2cbe29a7fa
|
[RLlib] Curiosity minor fixes, do-overs, and testing. (#10143)
|
2020-08-19 17:49:50 +02:00 |
|
Sven Mika
|
aeb5be7733
|
[RLlib] Trajectory View API (part 2.5): Actual implementations (not used yet) of a SampleCollector. (#10112)
|
2020-08-15 15:09:00 +02:00 |
|
Sven Mika
|
2256047876
|
[RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114)
|
2020-08-15 13:24:22 +02:00 |
|
yncxcw
|
32cd94b750
|
[Core] Do not convert gpu id to int (#9744)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
|
2020-08-11 12:09:46 -07:00 |
|
Barak Michener
|
8e76796fd0
|
ci: Redo format.sh --all script & backfill lint fixes (#9956)
|
2020-08-07 16:49:49 -07:00 |
|
Sven Mika
|
57690a3a9f
|
[RLlib] Trajectory view API - 02 actual API scaffold (#9753)
|
2020-08-06 10:54:20 +02:00 |
|
Michael Luo
|
4d7bd8c892
|
[RLlib] Implementation of "Model-based Meta Policy Optimization" (MB MPO) (#9409)
|
2020-08-02 18:12:09 +02:00 |
|
Miguel Morales
|
372114b4ed
|
Update sampler.py (#9805)
Minor fix for warning string
|
2020-07-29 22:58:35 -07:00 |
|
Sven Mika
|
b0b0463161
|
[RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678)
|
2020-07-29 21:15:09 +02:00 |
|
Sven Mika
|
e4c5d3526f
|
Issue 9631: Tf1.14 does not have tf.config.list_physical_devices. (#9681)
|
2020-07-24 21:48:58 +02:00 |
|
Sven Mika
|
8204717eed
|
[RLlib] Issue 9218: PyTorch Policy places Model on GPU even with num_gpus=0 (#9516)
|
2020-07-17 05:53:25 +02:00 |
|
Sven Mika
|
03ab86567f
|
[RLlib] Layout of Trajectory View API (new class: Trajectory; not used yet). (#9269)
|
2020-07-14 04:27:49 +02:00 |
|
Sven Mika
|
fcdf410ae1
|
[RLlib] Tf2.x native. (#8752)
|
2020-07-11 22:06:35 +02:00 |
|
Hao Chen
|
d49dadf891
|
Change Python's ObjectID to ObjectRef (#9353)
|
2020-07-10 17:49:04 +08:00 |
|
Sven Mika
|
5b2a97597b
|
[RLlib] Retire try_import_tree (should be installed along with other requirements). (#9211)
- Retire try_import_tree.
- Stabilize test_supported_multi_agent.py.
|
2020-07-02 13:06:34 +02:00 |
|
Sven Mika
|
43043ee4d5
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
* WIP.
* Fixes.
* LINT.
* WIP.
* WIP.
* Fixes.
* Fixes.
* Fixes.
* Fixes.
* WIP.
* Fixes.
* Test
* Fix.
* Fixes and LINT.
* Fixes and LINT.
* LINT.
|
2020-06-30 10:13:20 +02:00 |
|
Sven Mika
|
5c6d5d4ab1
|
This PR fixes the currently broken lstm_use_prev_action_reward flag for default lstm models (model.use_lstm=True). (#8970)
|
2020-06-27 20:50:01 +02:00 |
|
Eric Liang
|
1e0e1a45e6
|
[rllib] Add type annotations for evaluation/, env/ packages (#9003)
|
2020-06-19 13:09:05 -07:00 |
|
Ian Rodney
|
2e972c2a77
|
RLLIB and pylintrc (#8995)
|
2020-06-17 18:14:25 +02:00 |
|
Sven Mika
|
7008902cff
|
[RLlib] Minor rllib.utils cleanup. (#8932)
|
2020-06-16 08:52:20 +02:00 |
|
Eric Liang
|
34bae27ac7
|
[rllib] Flexible multi-agent replay modes and replay_sequence_length (#8893)
|
2020-06-12 20:17:27 -07:00 |
|
Sven Mika
|
8d1ccfd0f7
|
[RLlib] Issue 8889: action clipping bug ppo not learning mujoco (#8898)
|
2020-06-11 19:17:43 +02:00 |
|
Sven Mika
|
a90cd0fcbb
|
[RLlib] Unity3d soccer benchmarks (#8834)
|
2020-06-11 14:29:57 +02:00 |
|
mehrdadn
|
f93bb008bb
|
Change os.uname()[1] and socket.gethostname() to the portable and faster platform.node_ip() (#8839)
Co-authored-by: Mehrdad <noreply@github.com>
|
2020-06-08 21:29:46 -07:00 |
|
Sven Mika
|
368088be85
|
[RLlib] Sample batch docs and cleanup. (#8778)
|
2020-06-04 22:47:32 +02:00 |
|
Sven Mika
|
d8a081a185
|
[RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590)
|
2020-05-30 22:48:34 +02:00 |
|
Sven Mika
|
2746fc0476
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
|
Sven Mika
|
6d196197bc
|
[RLlib] utils/spaces ... (#8608)
|
2020-05-27 10:21:30 +02:00 |
|
Eric Liang
|
9a83908c46
|
[rllib] Deprecate policy optimizers (#8345)
|
2020-05-21 10:16:18 -07:00 |
|
Sven Mika
|
d76578700d
|
[RLlib] Policy.compute_single_action() broken for nested actions (Issue 8411). (#8514)
|
2020-05-20 22:29:08 +02:00 |
|
Eric Liang
|
9d012626e5
|
[rllib] Distributed exec workflow for impala (#8321)
|
2020-05-11 20:24:43 -07:00 |
|
Eric Liang
|
f48da50e1c
|
[rllib] observation function api for multi-agent (#8236)
|
2020-05-04 22:13:49 -07:00 |
|
Sven Mika
|
b95e28faea
|
[RLlib] APEX_DDPG (PyTorch) test case and docs. (#8288)
APEX_DDPG (PyTorch) test case and docs.
|
2020-05-04 09:36:27 +02:00 |
|
Eric Liang
|
2a0ad0b8ce
|
[rllib] [hotfix] Remove assert that trips on pytorch multiagent (#8241)
|
2020-05-01 06:32:54 +02:00 |
|
Eric Liang
|
baadbdf8d4
|
[rllib] Execute PPO using training workflow (#8206)
* wip
* add kl
* kl
* works now
* doc update
* reorg
* add ddppo
* add stats
* fix fetch
* comment
* fix learner stat regression
* test fixes
* fix test
|
2020-04-30 01:18:09 -07:00 |
|
Sven Mika
|
1775e89f26
|
[RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143)
Deprecate TupleActions and support arbitrarily nested action spaces.
Closes issue #8143.
|
2020-04-28 14:59:16 +02:00 |
|