Skip to main content

Anyscale task dashboard

Anyscale task dashboard

This page provides details on using the Anyscale task dashboard for monitoring Ray tasks in your Anyscale jobs and workspaces.

important

This feature is in beta release. Using this feature has cost implications.

The Anyscale task dashboard persists task details beyond the lifetime of the cluster for easy offline debugging. Anyscale uses compute and storage in your cloud provider account for the task dashboard, serving the dashboard to the Anyscale console without passing task details through the Anyscale control plane.

Anyscale has tested support for the task dashboard with millions of tasks. If you encounter limitations due to task count, contact Anyscale support.

Use the Anyscale task dashboard

The Anyscale task dashboard updates in near real-time and provides filtering and aggregate counts for tasks based on the following states:

StateDescription
FinishedFinished tasks.
FailedFailed tasks.
RunningTasks actively running on your cluster.
Pending dependenciesTasks waiting for dependent tasks to complete.
Pending schedulingSubmitted tasks waiting for scheduling on your cluster.

The following table describes the information in each component of the task dashboard.

Dashboard componentDescription
Task summaryA count of all tasks and tasks aggregated by state.
Tasks by functionA view of all tasks by function name.
Tasks by errorsA view of all errors raised by tasks.
Tasks by jobsA view of tasks by job ID.
Task tableA detailed view of all tasks that includes options for filtering and search.

The Tasks by function, Tasks by errors, and Tasks by jobs components include summary metrics on total tasks and max, min, and average time. For all time metrics, the reported value measures the time elapsed between when the Ray application requests scheduling for a task and when the task completes.

The Task table displays details for each task using the following fields. Use the search and filter to limit displayed tasks.

FieldDescription
IDThe unique ID for each task.
Task nameThe name of the task.

Task names use the function name by default unless you set a custom one.
StatusThe status of the task.

This is the main field used for filtering the task table.
Status detailsAdditional details about the status of the task.

For most task states, you can click the More details link under Status details in the Task table to show error messages or cluster event logs.
AttemptThe attempt number of the task out of the total number of attempts.
ProfilingProfiling options for the task, including CPU flame graph, CPU stack trace, and memory profiling. Only available for running tasks in the current session.
Execution timeThe amount of time elapsed between when the task starts running and when the task completes.
Total durationThe amount of time elapsed between when the Ray application requests scheduling for a task and when the task completes.
Start timeThe date and time the Ray application enqueued the task for scheduling.
End timeThe date and time the task finished.
Required resourcesThe type and amount of resources required for the task.
Runtime environmentThe runtime environment configuration for the task.
Node IDThe unique ID for the node where the task ran.
Worker PIDThe ID of the Ray worker process.
Worker IDThe ID of the worker that ran the task.
SessionThe session in which the task ran.
Job IDThe ID of the job where the task ran.

Workspaces assign job IDs when you run code that triggers compute on your Ray cluster.
TypeIndicates whether the task ran on an actor.

Requirements and limitations

The task dashboard requires the system cluster to be enabled for your Anyscale cloud. An organization admin must enable the system cluster. See Enable the system cluster.

The following requirements and limitations exist:

  • The task dashboard reports metrics for jobs and workspaces that use Ray 2.49.0 or later.
  • The task dashboard only captures metrics when the system cluster is enabled. Jobs or workspaces launched before you enable the system cluster don't report metrics, even if you enable the system cluster while they're running.
  • The task dashboard is available for jobs and workspaces using only the primary cloud resource. Multi-resource configurations are not supported.
note

Anyscale recommends using Ray 2.51.0 or later when enabling the task dashboard. In earlier Ray versions, overhead from reporting task metrics might cause performance degradation.

Access the task dashboard

You can view the task dashboard in the Anyscale console for any job or workspace.

Complete the following steps to access the Anyscale task dashboard:

  1. Log in to the Anyscale console.
  2. Click Workspaces or Jobs.
  3. Click the name of a workspace or job.
  4. Click Ray Workloads.
  5. Click Tasks. The task dashboard appears.
important

Anyscale deploys a system cluster to power the task dashboard. If your cloud doesn't have an active system cluster running, a screen with the message Observability service is launching appears as the system cluster starts.