The Overview Dashboard in VESSL provides a unified interface for monitoring and managing both provisioned and serverless model services. It equips ML teams with the tools to oversee operational metrics, worker status, and resource utilization effectively.

Dashboard Layout

The dashboard is designed to provide a comprehensive view at a glance and is structured into several key sections depending on the deployment mode—provisioned or serverless.

Upper Region

This area displays the activated endpoint for the service, which is consistent across both deployment modes.

You can navigate to your service’s Swagger page, if enabled, by clicking the Request button.

Workers

The Workers section provides a hostmap view of the current state of the service’s revisions and workers.

  • Provisioned Mode: Display the details of service revision, view logs, filter metrics on the dashboard graphs, and manage the workers serving each model.
  • Serverless Mode: Note that autoscaling configurations are not modifiable in this mode.

Metrics

Metrics tracking is available in both modes but includes additional metrics for queue in serverless mode:

  • Provisioned Mode: Key service metrics such as autoscaling and request counts.
  • Serverless Mode: Autoscaling and queue length.
    • Request Queue Length: The number of requests waiting to be processed. It is limited to 1000 requests. If you need more capacity, please contact our sales team
    • In Progress Queue Length: The number of requests currently being processed.

You can set up custom dashboards using Grafana to track and visualize performance metrics that are crucial for your serverless services. This tab helps you keep an eye on: For more details on setting up and configuring Grafana dashboards, please refer to Grafana Documentation.