Skip to main content

What are Anyscale services?

What are Anyscale services?

Anyscale services deploy Ray Serve applications to production endpoints. Anyscale services offer additional benefits on top of Ray Serve, including high availability and zero-downtime upgrades.

Best practices

Anyscale recommends the following best practices when deploying services:

Permission requirements

You can optionally deploy an Anyscale cloud without permissions to deploy services, and you must opt-in to support head node fault tolerance for all cloud deployment options.

Cloud infrastructure to support Anyscale services uses the networking you configure while deploying your Anyscale cloud. Deploy a cloud with private networking if you need your load balancer to be private.

Services require additional IAM permissions in your cloud provider account to configure a Redis in-memory store and load balancer. See IAM permissions for AWS or Google Cloud.

The following table provides an overview of support for services with different cloud deployment options:

Cloud deploymentDeployment methodDetails
Serverless Anyscale cloud (also called Anyscale-hosted cloud)Deployed by defaultEnables services by default. No support for head node fault tolerance or private networking.
Anyscale cloud on AWSanyscale cloud setupEnables services by default. Opt-in to head node fault tolerance using the --enable-head-node-fault-tolerance flag.
Anyscale cloud on AWSanyscale cloud registerYou must configure IAM roles and a MemoryDB instance when deploying your cloud. Contact Anyscale support for assistance customizing the Anyscale Terraform modules for AWS.
Anyscale cloud on Google Cloudanyscale cloud setupEnables services by default. Opt-in to head node fault tolerance using the --enable-head-node-fault-tolerance flag.
Anyscale cloud on Google Cloudanyscale cloud registerYou must configure service account roles and a Memorystore instance when deploying your cloud. Contact Anyscale support for assistance customizing the Anyscale Terraform modules for Google Cloud.
Anyscale cloud on Kubernetesanyscale cloud registerYou must configure custom permissions and a Redis in-memory store when deploying your cloud. Contact Anyscale support for assistance customizing the Anyscale Terraform modules for Kubernetes.
important

Anyscale clouds on AWS have changed default behavior for deploying Anyscale services.

Legacy Anyscale clouds on AWS use CloudFormation to configure Elastic Load Balancing for your service. Anyscale now directly configures Elastic Load Balancing for your services.

All new Anyscale clouds on AWS deployed with anyscale cloud setup use this configuration by default. You can run anyscale cloud update to upgrade your legacy AWS clouds deployed with anyscale cloud setup to the new behavior.

Anyscale has updated the Anyscale Terraform modules for AWS to provide the proper IAM permissions for the new default behavior. If you have a legacy Anyscale cloud deployed using anyscale cloud register, contact Anyscale support for assistance updating your cloud IAM permissions.

See Update your IAM role for services on Anyscale clouds on AWS.

Capacity limit

There's a quota of 20 running services per Anyscale cloud. A service can have many deployments and can scale to greater than 2000 nodes. If you need to increase your quota, contact Anyscale support.

Pricing

Services use standard Anyscale pricing based on the type of machines used. See the Anyscale pricing page.

In addition to Anyscale costs and virtual machine costs, Anyscale uses load balancer resources and a Redis-compatible in-memory store in your cloud provider account.

Use the following links to learn about pricing details for these services:

CloudPricing links
AWS
GCP