ray/release/long_running_distributed_tests/compute_tpl.yaml
matthewdeng 86718071fe
[tune] Increase volume size for long running pbt failure (#27163) (#27247)
Currently running into an issue:

Cluster startup Failed. Error: RuntimeError: botocore.exceptions.ClientError: An error occurred (InvalidBlockDeviceMapping) when calling the RunInstances operation: Volume of size 202GB is smaller than  snapshot 'snap-02c4e6a0ad06cf3d6', expect size >= 400GB

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
2022-07-29 01:16:40 -07:00

29 lines
No EOL
588 B
YAML

cloud_id: {{env["ANYSCALE_CLOUD_ID"]}}
region: us-west-2
max_workers: 3
head_node_type:
name: head_node
instance_type: g3.8xlarge
worker_node_types:
- name: worker_node
instance_type: g3.8xlarge
min_workers: 2
max_workers: 2
use_spot: false
aws:
TagSpecifications:
- ResourceType: "instance"
Tags:
- Key: anyscale-user
Value: '{{env["ANYSCALE_USER"]}}'
- Key: anyscale-expiration
Value: '{{env["EXPIRATION_2D"]}}'
BlockDeviceMappings:
- DeviceName: /dev/sda1
Ebs:
VolumeSize: 400