ray/rllib/tuned_examples/cartpole-marwil-torch.yaml
Sven Mika c2cb5c2214
[RLlib] MARWIL torch. (#7836)
* WIP.

* WIP.

* LINT.

* Fix MARWIL so it can run with eager-mode.

* LINT.
2020-04-06 16:38:50 -07:00

13 lines
445 B
YAML

# To generate training data, first run:
# $ ./train.py --run=PPO --env=CartPole-v0 \
# --stop='{"timesteps_total": 50000}' \
# --config='{"use_pytorch": true, "output": "/tmp/out", "batch_mode": "complete_episodes"}'
cartpole-marwil-torch:
env: CartPole-v0
run: MARWIL
stop:
timesteps_total: 500000
config:
beta:
grid_search: [0, 1] # compare IL (beta=0) vs MARWIL
input: /tmp/out