Skip to main content

Storage configurations

Check your docs version

This version of the Anyscale docs is deprecated. Go to the latest version for up to date information.

Anyscale provides out-of-the box storage options including:

  1. Local storage for a Node
  2. Storage shared across Nodes
  3. Object storage

Configuring local storage for a node

To configure root disk or volume of a node, modify the compute config as you specify instance types. If the node has NVMe storage available, it's automatically mounted at /mnt/local_storage.

Root disk modifications

The default disk size for an Anyscale node is 150 GB. To change the default, use the Advanced Configuration section of the compute config.

The following example sets the root drive for all nodes in the cluster to 500 GB:

AWS 500 GB Example
{
"BlockDeviceMappings": [
{
"Ebs": {
"VolumeSize": 500,
"VolumeType": "gp3",
"DeleteOnTermination": true
},
"DeviceName": "/dev/sda1"
}
]
}

NVMe configuration

Anyscale supports Non-Volatile Memory Express (NVMe) interface to access SSD storage volumes, which provides additional temporary storage to the instances. This enables higher performance, lower latency, scalability and support for versatile use cases across a variety of workloads. Anyscale exposes /mnt/local_storage as the mount path in the ray container by default. For instance types that don't have NVMe, /mnt/local_storage just falls back to the root disk.

If you are using AWS, you can choose the EC2 instance types that have NVMe (refer to AWS instance store documentation for more details). Anyscale then automatically detects the devices, format and mount them when the Ray Container starts.

For EC2 instance types that have multiple NVMe devices, Anyscale also configures them as a software RAID (RAID 0) which maximizes the disk performance.

EC2 NVMe Example

Configuring storage shared across nodes

NFS mounts automatically on the Workspace/Job/Service Clusters. Follow the instructions below to opt out of mounting NFS.

  1. /mnt/shared_storage -- accessible to all the Anyscale users of the same Anyscale Cloud. It's mounted on every Node of all the Clusters in the same Cloud.
  2. /mnt/user_storage -- private to the Anyscale user but accessible from every Node of all their Workspaces, Jobs, and Services Clusters.
  3. /mnt/cluster_storage -- accessible to all Nodes of a Workspaces, Jobs, and Services Cluster.

Limitations

NFS storage is a single file system per Anyscale Cloud. There are some limitations associated with these storage options based on the underlying cloud provider.

AWS has a hard limit of 25,000 “connections” for each file system. Within each Anyscale Cloud, you can connect a total number of 25,000 running instances or nodes to the file system at any moment across all clusters (Workspaces, Services, Jobs) and users.

Opting out of mounting NFS

You can opt out of mounting NFS for Anyscale Jobs and Services, but not for Workspaces because Workspaces rely on NFS storage to work properly.

To opt out, turn off the NFS Mount flag in the Cluster-wide advanced configuration and use this compute config for the Jobs and Services.

{
"TagSpecifications": [
{
"Tags": [
{
"Key": "as-feature-disable-nfs-mount",
"Value": "true"
}
],
"ResourceType": "instance"
}
]
}
AdvancedConfig

Configure object storage

For every Anyscale Cloud, Anyscale configures a default object storage bucket during the Cloud deployment. No additional configuration in the Compute Config is needed.