Overview

VESSL AI — Purpose-built cloud for AI

VESSL AI provides a unified interface for training and deploying AI models on the cloud. Simply define your GPU resource and pinpoint to your code & dataset. VESSL AI does the orchestration & heavy lifting for you:

Create a GPU-accelerated container with the right Docker Image.
Mount your code and dataset from GitHub, Hugging Face, Amazon S3, and more.
Launches the workload on our fully managed GPU cloud.

One any cloud, at any scale

Instantly scale workloads across multiple clouds.

Streamlined interface

Launch any AI workloads with a unified YAML definition.

End-to-end coverage

A single platform for fine-tuning to deployment.

A centralized compute platform

Optimize GPU usage and save up to 80% in cloud.

What can you do?

Run compute-intensive AI workloads remotely within seconds.
Fine-tune LLMs with distributed training and auto-failover with zero-to-minimum setup.
Scale training and inference workloads horizontally.
Deploy an interactive web applicaiton for your model.
Serve your AI models as web endpoints.

How to get started

Head over to vessl.ai and sign up for a free account. No docker build or kubectl get.

Create your account at vessl.ai and get $15 in free GPU credits.
Install our Python package — pip install vessl.
Follow our Quickstart guide or try out our example models at VESSL Hub.

How does it work?

VESSL AI abstracts the obscure infrastructure and complex backends inherent to launching AI workloads into a simple YAML file, so you don’t have to mess with AWS, Kubernetes, Docker, or more. Here’s an example that launches a web app for Stable Diffusion.

resources:
  cluster: vessl-gcp-oregon
  preset: gpu-l4-small
image: quay.io/vessl-ai/torch:2.1.0-cuda12.2-r3
import:
  /code/:
    git:
      url: https://github.com/vessl-ai/hub-model
      ref: main
  /model/: hf://huggingface.co/VESSL/SSD-1B
run:
  - command: |-
      pip install -r requirements.txt
      streamlit run ssd_1b_streamlit.py --server.port=80
    workdir: /code/SSD-1B
interactive:
  max_runtime: 24h
  jupyter:
    idle_timeout: 120m
ports:
  - name: streamlit
    type: http
    port: 80

With every YAML file, you are creating a VESSL Run. VESSL Run is an atomic unit of VESSL AI, a single unit of Kubernetes-backed AI workload. You can use our YAML definition as you progress throughout the AI lifecycle from checkpointing models for fine-tuning to exposing ports for inference.

What’s next?

See VESSL AI in action with our examples Runs and pre-configured open-source models.

Quickstart – Hello, world!

Fine-tune Llama2-7B with a code instructions dataset

GPU-accelerated notebook

Launch a GPU-accelerated Streamlit app with Mistral 7B

Stable Diffusion Playground

Interactive playground of Stable Diffusion

Llama3-8B Fine-tuning

Fine-tune Llama3-8B with instruction dataset

Get Started

Build

Manage

Admin

Resources

VESSL AI — Purpose-built cloud for AI

One any cloud, at any scale

Streamlined interface

End-to-end coverage

A centralized compute platform

What can you do?

How to get started

How does it work?

What’s next?

Quickstart – Hello, world!

GPU-accelerated notebook

Stable Diffusion Playground

Llama3-8B Fine-tuning

Get Started

Build

Manage

Admin

Resources

​VESSL AI — Purpose-built cloud for AI

One any cloud, at any scale

Streamlined interface

End-to-end coverage

A centralized compute platform

​What can you do?

​How to get started

​How does it work?

​What’s next?

Quickstart – Hello, world!

GPU-accelerated notebook

Stable Diffusion Playground

Llama3-8B Fine-tuning

VESSL AI — Purpose-built cloud for AI

What can you do?

How to get started

How does it work?

What’s next?