hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Sven Mika	c9d220bcda	[RLlib] Upgrade RLlib regression test scripts to new testing tool - RLlib release logs for 1.4. (#16080 )	2021-06-01 17:39:18 +02:00
Chris Bamford	1e3721ef4a	[RLlib] Remove bad spinlocks to allow pytorch GPU scheduler to interrupt. (#16162 )	2021-06-01 16:40:28 +02:00
Sven Mika	5fe34862ce	[RLlib] DDPG torch GPU bug. (#16133 )	2021-05-28 22:09:25 +02:00
Sven Mika	33a69135cb	[RLlib] Issue 16117: DQN/APEX torch not working on GPU. (#16118 )	2021-05-28 09:12:53 +02:00
Sven Mika	f6302d81be	[RLlib] Discussion 2210: BC algo broken, if "advantages" missing in offline data. (#16019 )	2021-05-25 08:47:17 +02:00
Eric Liang	810f5c803a	Disable flaky object spilling test on OSX & adjust test timeouts (#15986 ) * blacklist * move it * adjust according to bazel timeouts * fix build * move to large * Update BUILD	2021-05-24 09:49:59 -07:00
Steven Morad	581d63e607	[RLlib] Fix dnc input shape (#15939 ) Co-authored-by: Steven Morad <sm2558@cam.ac.uk>	2021-05-20 19:06:02 -07:00
Sven Mika	e80095591c	[RLlib] Entropy coeff schedule bug fix and git bisect script. (#15937 )	2021-05-20 18:15:10 +02:00
Sven Mika	03c7c530a9	[RLlib] Issue 15483: Wrong init states (should be non-zero if `ModelV2.get_initial_state` returns non-zero values). (#15733 )	2021-05-20 09:28:09 +02:00
Sven Mika	2d34216660	[RLlib] APEX-DQN: Bug fix for torch and add learning test. (#15762 )	2021-05-20 09:27:03 +02:00
Sven Mika	eaa7f6696d	[RLlib] Issue 15887: MARWIL adv norm update mismatch for tf (static-graph) vs torch versions. (#15898 )	2021-05-19 15:44:11 -07:00
Stefan Schneider	55709bac7a	[RLlib] Examples for training, saving, loading, testing an agent with SB & RLlib (#15897 )	2021-05-19 16:36:59 +02:00
Michael Luo	474f04e322	[RLlib] DDPG/TD3 + A3C/A2C + MARWIL/BC Annotation/Comments/Code Cleanup (#14707 )	2021-05-19 16:32:29 +02:00
Steven Morad	d8eed68af2	[RLlib] Add differentiable neural computer example (#14844 )	2021-05-19 09:15:39 +02:00
Rick Lan	3b1b1d74fe	[rllib] Read "logger_config" first before "prefix". (#15871 )	2021-05-18 10:50:46 -07:00
Sven Mika	7e260edb07	[RLlib] Fix small memory leak in SimpleListCollector (already superseeded by Bam4d's PR + small fix in error message). (#15783 )	2021-05-18 16:02:03 +02:00
Chris Bamford	0be83d9a95	[RLlib] Fixing Memory Leak In Multi-Agent environments. Adding tooling for finding memory leaks in workers. (#15815 )	2021-05-18 13:23:00 +02:00
Sven Mika	d2c755ccef	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
Sven Mika	2303851c3c	[RLlib] Torch multi-GPU + LSTM/RNN bug fix. (#15492 )	2021-05-18 11:51:05 +02:00
Sven Mika	839fc59224	[RLlib] CQL TensorFlow support (#15841 )	2021-05-18 11:10:46 +02:00
Sven Mika	a36b9305d4	[RLlib] Better error message when deep-learning framework not installed. (#15735 )	2021-05-18 11:06:05 +02:00
Sven Mika	6f4d988713	[RLlib] Issue 15556: Fix R2D2 using chunks from previous episodes in the "burn-in" window. (#15737 )	2021-05-18 11:05:42 +02:00
Sven Mika	308ea62430	[RLlib] Fix "seed" setting to work in all frameworks and w/ all CUDA versions. (#15682 )	2021-05-18 11:00:24 +02:00
Sven Mika	f25d58492d	[Testing] Dependabot for RLlib. (#15812 )	2021-05-17 18:24:13 +02:00
Sven Mika	d89fb82bfb	[RLlib] Add simple curriculum learning API and example script. (#15740 )	2021-05-16 17:35:10 +02:00
Sven Mika	ebc6d8692a	[RLlib] Docs: Example scripts and blogs documentation update. (#15763 )	2021-05-16 15:24:38 +02:00
Sven Mika	469f5227da	[RLlib] CQL bug fix: Normalize actions for atanh in BC part of the CQL loss. (#15814 )	2021-05-16 15:21:06 +02:00
Sven Mika	bc09e75b78	[RLlib] Fix 3 flakey test cases. (#15785 )	2021-05-16 12:20:33 +02:00
Ian Rodney	00c913cbc6	[Flaky] Mark `test_nested_observation_spaces` as Flaky (#15794 )	2021-05-14 12:08:52 -07:00
Ian Rodney	82876ecc2a	[rllib] [testing] make kill failure non fatal (#15771 )	2021-05-13 12:24:49 -07:00
Sven Mika	c4a3e1589b	[RLlib] CQL: Bug fixes and OPE example added to test and offline_rl.py example. (#15761 )	2021-05-13 09:17:23 +02:00
Sven Mika	16ddab49f5	[RLlib] Trainer._evaluate -> Trainer.evaluate; Also make evaluation possible w/o evaluation worker set. (#15591 )	2021-05-12 12:16:00 +02:00
Sven Mika	a495759f06	[RLlib] Discussion 2022: PPO should auto-adjust `rollout_fragment_length` if other settings do not align with `train_batch_size`. (#15611 )	2021-05-10 16:16:02 +02:00
Sven Mika	461d73ddf1	[RLlib] `simple_optimizer` should not be used by default for tf+MA. (#15365 )	2021-05-10 16:10:44 +02:00
Sven Mika	46f6fa2361	[RLlib] Example script for restoring 1 agent (out of n) from a checkpoint (multi-agent). (#15540 )	2021-05-10 16:09:05 +02:00
Eric Liang	ff36ae594b	Remove flaky tag from newly unflaky tests (#15639 )	2021-05-05 12:15:46 -07:00
Kai Fricke	1d52ab819f	[release] release 1.3.0 results and test updates (#15366 ) Convert a number of release tests and add logs for release 1.3.0	2021-05-04 22:10:04 +01:00
Sven Mika	c7563a32ed	[RLlib] DD-PPO not supported on Win (add meaningful error message). (#15631 )	2021-05-04 19:26:17 +02:00
Michael Luo	4cbe13cdfd	[RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603 ) Co-authored-by: Sven Mika <sven@anyscale.io> Co-authored-by: sven1977 <svenmika1977@gmail.com>	2021-05-04 19:06:19 +02:00
Sven Mika	4b3add0066	[RLlib] Discussion 2021: PPO does not learn vf, iff use_gae=False (ignores use_critic setting). (#15610 )	2021-05-04 14:17:00 +02:00
mvindiola1	170366fbf1	[RLlib] contrib/MADDPG: Make get_weights and set_weights use dictionaries rather than lists. (#14903 ) Co-authored-by: Manny Vindiola <manuel.m.vindiola.civ@mail.mil>	2021-05-04 13:26:39 +02:00
Sertingolix	5a45009ebc	[RLlib] Handle array custom metrics correctly in evaluate (#15190 ) Co-authored-by: Lucas Brunner <lucas.brunner@urb-x.ch>	2021-05-04 13:25:28 +02:00
Antoine Galataud	ce1c001b1d	[RLlib] DQN: Place LearningRateSchedule mixin at the right moment (#15558 )	2021-05-04 13:21:40 +02:00
Yeachan-Heo	0552f6e886	[RLlib] Update alpha_zero_policy.py (#15042 )	2021-05-04 13:20:24 +02:00
Amog Kamsetty	ebc44c3d76	[CI] Upgrade flake8 to 3.9.1 (#15527 ) * formatting * format util * format release * format rllib/agents * format rllib/env * format rllib/execution * format rllib/evaluation * format rllib/examples * format rllib/policy * format rllib utils and tests * format streaming * more formatting * update requirements files * fix rllib type checking * updates * update * fix circular import * Update python/ray/tests/test_runtime_env.py * noqa	2021-05-03 14:23:28 -07:00
Sven Mika	e973b726c2	[RLlib] Support native tf.keras.Models (part 2) - Default keras models for Vision/RNN/Attention. (#15273 )	2021-04-30 19:26:30 +02:00
Sven Mika	fc3a65f9d4	[RLlib] Split test_checkpoint_restore tests into 3 and make each "large" (from "enormous"). (#15499 )	2021-04-30 12:33:12 +02:00
Sven Mika	78b776942f	[RLlib] Discussion 1928: Initial lr wrong if schedule used that includes ts=0 (both tf and torch). (#15538 )	2021-04-27 17:19:52 +02:00
SebastianBo1995	f5be8d8f74	[Rllib] Offline Learning Bug, different shapes (#15132 )	2021-04-27 17:18:17 +02:00
Sven Mika	bb8a286cbc	[RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684 )	2021-04-27 10:44:54 +02:00

1 2 3 4 5 ...

697 commits