hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Sven Mika	ce3e550c43	[RLlib] Enhance comment in example script multi_agent_custom_policy. (#16740 )	2021-07-01 10:28:38 +02:00
Sven Mika	53206dd440	[RLlib] CQL BC loss fixes; PPO/PG/A2\|3C action normalization fixes (#16531 )	2021-06-30 12:32:11 +02:00
Amog Kamsetty	abd16a8438	[RLlib] Skip two_step_game_qmix test (#16758 )	2021-06-29 14:27:48 -07:00
Travis Addair	e5dfa4cfb9	[tune] Only use TBXLoggerCallback when torch is installed (#16695 ) * [tune] Only use TBXLoggerCallback when torch is installed * Fix lint * fix * Update python/ray/tune/utils/callback.py Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com> Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>	2021-06-28 16:34:20 -07:00
Amog Kamsetty	be1f6d59fa	[CI] Re-try Tag rllib flaky tests (#16680 )	2021-06-28 18:42:54 +02:00
AnnaKosiorek	1e709771b2	[rllib][minor] clarification of the softmax axis in dqn_torch_policy (#16311 ) pytorch nn.functional.softmax (unlike tf.nn.softmax) calculates softmax along zeroth dimension by default	2021-06-26 11:19:54 -07:00
architkulkarni	35039869ee	Revert "[RLlib] Add some learning tests to rllib-flaky (#16604 )" (#16677 ) This reverts commit `d1510911e0`.	2021-06-25 09:37:58 -07:00
Eric Liang	1c709cbeb3	Fix typing (#16668 )	2021-06-24 22:06:33 -07:00
Amog Kamsetty	d1510911e0	[RLlib] Add some learning tests to rllib-flaky (#16604 )	2021-06-25 00:28:54 +02:00
Sven Mika	c95dea51e9	[RLlib] External env enhancements + more examples. (#16583 )	2021-06-23 09:09:01 +02:00
mvindiola1	82a3ff795c	[RLlib] ensure curiosity exploration actions are passed in as tf tens… (#15704 )	2021-06-21 10:03:17 -07:00
Benjamin D. Killeen	50049f86d0	[rllib] check if `self.env is not None` explicitly (#15634 ) Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2021-06-21 10:02:13 -07:00
Sven Mika	be6db06485	[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569 )	2021-06-21 13:46:01 +02:00
Sven Mika	169ddabae7	[RLlib] Issue 15973: Trainer.with_updates(validate_config=...) behaves confusingly. (#16429 )	2021-06-19 22:42:00 +02:00
Sven Mika	79a9d6d517	[RLlib] Issues 16287 and 16200: RLlib not rendering custom multi-agent Envs. (#16428 )	2021-06-19 08:57:53 +02:00
Amog Kamsetty	bd3cbfc56a	Revert "[RLlib] Allow policies to be added/deleted on the fly. (#16359 )" (#16543 ) This reverts commit `e78ec370a9`.	2021-06-18 12:21:49 -07:00
Sven Mika	2900a06dd7	[RLlib] Issue 14503: SAC not allowing custom action distributions. (#16427 )	2021-06-18 17:27:29 +02:00
Sven Mika	e78ec370a9	[RLlib] Allow policies to be added/deleted on the fly. (#16359 )	2021-06-18 10:31:30 +02:00
Sven Mika	a5831f9429	[RLlib] Fix bandit example scripts and add all scripts to CI testing suite.	2021-06-15 13:30:31 +02:00
Sven Mika	d0014cd351	[RLlib] Policies get/set_state fixes and enhancements. (#16354 )	2021-06-15 13:08:43 +02:00
Chris Bamford	fd1a97e39f	[RLlib] Memory leak docs (#15908 )	2021-06-10 18:10:21 +02:00
Sven Mika	3d4dc60e2e	[RLlib] CQL iteration count fixes: Remove dummy buffer and unnecessary store op from exec_plan. (#16332 )	2021-06-10 07:49:17 +02:00
matthewdeng	138b273136	[rllib] Add tests for examples using ray client (#16271 ) * [rllib] add tests for examples using ray client * rename test_client to test_ray_client	2021-06-09 10:39:14 -07:00
Sven Mika	4b8dadccbd	[RLlib] Fix PR 16162: Having added sleep to `_NextValueNotReady` causes TD3 tests to become flakey. (#16309 )	2021-06-08 07:27:02 -07:00
Gerges Dib	f8cf4a1985	[RLlib] Fixed import tensorflow when module not available (#16171 )	2021-06-04 10:07:59 +02:00
Sven Mika	e2be41b407	[RLlib] MARWIL + BC: Various fixes and enhancements. (#16218 )	2021-06-03 22:29:00 +02:00
Sven Mika	c9d220bcda	[RLlib] Upgrade RLlib regression test scripts to new testing tool - RLlib release logs for 1.4. (#16080 )	2021-06-01 17:39:18 +02:00
Chris Bamford	1e3721ef4a	[RLlib] Remove bad spinlocks to allow pytorch GPU scheduler to interrupt. (#16162 )	2021-06-01 16:40:28 +02:00
Sven Mika	5fe34862ce	[RLlib] DDPG torch GPU bug. (#16133 )	2021-05-28 22:09:25 +02:00
Sven Mika	33a69135cb	[RLlib] Issue 16117: DQN/APEX torch not working on GPU. (#16118 )	2021-05-28 09:12:53 +02:00
Sven Mika	f6302d81be	[RLlib] Discussion 2210: BC algo broken, if "advantages" missing in offline data. (#16019 )	2021-05-25 08:47:17 +02:00
Eric Liang	810f5c803a	Disable flaky object spilling test on OSX & adjust test timeouts (#15986 ) * blacklist * move it * adjust according to bazel timeouts * fix build * move to large * Update BUILD	2021-05-24 09:49:59 -07:00
Steven Morad	581d63e607	[RLlib] Fix dnc input shape (#15939 ) Co-authored-by: Steven Morad <sm2558@cam.ac.uk>	2021-05-20 19:06:02 -07:00
Sven Mika	e80095591c	[RLlib] Entropy coeff schedule bug fix and git bisect script. (#15937 )	2021-05-20 18:15:10 +02:00
Sven Mika	03c7c530a9	[RLlib] Issue 15483: Wrong init states (should be non-zero if `ModelV2.get_initial_state` returns non-zero values). (#15733 )	2021-05-20 09:28:09 +02:00
Sven Mika	2d34216660	[RLlib] APEX-DQN: Bug fix for torch and add learning test. (#15762 )	2021-05-20 09:27:03 +02:00
Sven Mika	eaa7f6696d	[RLlib] Issue 15887: MARWIL adv norm update mismatch for tf (static-graph) vs torch versions. (#15898 )	2021-05-19 15:44:11 -07:00
Stefan Schneider	55709bac7a	[RLlib] Examples for training, saving, loading, testing an agent with SB & RLlib (#15897 )	2021-05-19 16:36:59 +02:00
Michael Luo	474f04e322	[RLlib] DDPG/TD3 + A3C/A2C + MARWIL/BC Annotation/Comments/Code Cleanup (#14707 )	2021-05-19 16:32:29 +02:00
Steven Morad	d8eed68af2	[RLlib] Add differentiable neural computer example (#14844 )	2021-05-19 09:15:39 +02:00
Rick Lan	3b1b1d74fe	[rllib] Read "logger_config" first before "prefix". (#15871 )	2021-05-18 10:50:46 -07:00
Sven Mika	7e260edb07	[RLlib] Fix small memory leak in SimpleListCollector (already superseeded by Bam4d's PR + small fix in error message). (#15783 )	2021-05-18 16:02:03 +02:00
Chris Bamford	0be83d9a95	[RLlib] Fixing Memory Leak In Multi-Agent environments. Adding tooling for finding memory leaks in workers. (#15815 )	2021-05-18 13:23:00 +02:00
Sven Mika	d2c755ccef	[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832 )	2021-05-18 13:18:12 +02:00
Sven Mika	2303851c3c	[RLlib] Torch multi-GPU + LSTM/RNN bug fix. (#15492 )	2021-05-18 11:51:05 +02:00
Sven Mika	839fc59224	[RLlib] CQL TensorFlow support (#15841 )	2021-05-18 11:10:46 +02:00
Sven Mika	a36b9305d4	[RLlib] Better error message when deep-learning framework not installed. (#15735 )	2021-05-18 11:06:05 +02:00
Sven Mika	6f4d988713	[RLlib] Issue 15556: Fix R2D2 using chunks from previous episodes in the "burn-in" window. (#15737 )	2021-05-18 11:05:42 +02:00
Sven Mika	308ea62430	[RLlib] Fix "seed" setting to work in all frameworks and w/ all CUDA versions. (#15682 )	2021-05-18 11:00:24 +02:00
Sven Mika	f25d58492d	[Testing] Dependabot for RLlib. (#15812 )	2021-05-17 18:24:13 +02:00

1 2 3 4 5 ...

823 commits