GPU-accelerated notebook

This example deploys a Jupyter Notebook server. You will also learn how you can connect to the server on VS Code or an IDE of your choice.

Try it on VESSL Hub

Try out the Quickstart example with a single click on VESSL Hub.

Note that if you want to save your credits, remember to Terminate to stop and end the runs.

What you will do

Launch a GPU-accelerated interactive workload
Set up a Jupyter Notebook
Use SSH to connect to the workload

Writing the YAML file

Let’s start by creating notebook.yaml file and fill it in.

Spin up a workload

Let’s repeat the steps from Quickstart and define the compute resource and runtime environment for our workload. Again, we will use the L4 instance from our managed cloud and the latest PyTorch container from NVIDIA NGC.

name: Jupyter notebook
description: A Jupyter Notebook server with an SSH connection
resources:
  cluster: vessl-oci-sanjose
  preset: gpu-l4-small
image: quay.io/vessl-ai/torch:2.3.1-cuda12.1-r5

Configure an interactive run

By default, workloads launched with VESSL Run are batch jobs like the one we launched in our Quickstart example. On the other hand, interactive runs are essentially virtual machines running on GPUs for live interaction with your models and datasets.You can enable this with the interactive key, followed by the jupyter key. Interactive runs come with a default field for idle culler which automatically shuts down user notebook servers when they have not been used for a certain period.max_runtime works with idle_timeout as an additional measure to prevent resource overuse

name: Jupyter notebook
description: A Jupyter Notebook server with an SSH connection
resources:
  cluster: vessl-oci-sanjose
  preset: gpu-l4-small
image: quay.io/vessl-ai/torch:2.3.1-cuda12.1-r5
interactive:
  jupyter:
    idle_timeout: 120m
  max_runtime: 24h

Running the workload

Now that we have a completed YAML, we can once again run the workload with vessl run.

vessl run create -f notebook.yaml

Running the command will verify your YAML and show you the current status of the workload. Click the output link in your terminal to see the full details and realtime logs of the Run on the web. Click Jupyter under Connect to launch a notebook.

Create an SSH connection

An interactive run is essentially a GPU-accelerated workload on a cloud with a port and an endpoint for live interactions. This means you can access the remote workload using SSH.

Get an SSH key pair

First, get an SSH key pair.ssh-keygen -t ed25519 -C "vesslai"

Add the generated key to your account

vessl ssh-key addPress “enter” three times.

Connect via SSH

Use the workload address from the Run Summary page to connect. You are ready to use VS Code or an IDE of your choice for remote development.

Manual access: The below command for manual access may change when you perform a new run due to changes in IP address, user account, port number, SSH key file path, proxy settings, or other options.

ssh -p 22 root@34.127.82.9

Tips & tricks

Keep in mind that GPUs are fully dedicated to a notebook server —and therefore consume credits— even when you are not running compute-intensive cells. To optimize GPU usage, use tools like nbconvert to convert the notebook into a Python file or package it as a Python container and run it as a batch job. You can also mount volumes to interactive workloads by defining import and reference files or datasets from your notebook.

Using our web interface

You can repeat the same process on the web. Head over to your Organization, select a project, and create a New run.

What’s next?

Next, let’s see how you use our interactive workloads to host a web app on the cloud using tools like Streamlit and Gradio.

Stable Diffusion Playground

Launch an interactive web application for Stable Diffusion

Llama 3.2 fine-tuning

Fine-tune Llama 3.2-3B with instruction datasets

Llama 3.1 Deployment

Serve & deploy vLLM-accelerated Llama 3.1-8B

Get Started

Compute

Resource

Admin

Private Hub

Pricing

GPU-accelerated notebook

Try it on VESSL Hub

What you will do

Writing the YAML file

Spin up a workload

Configure an interactive run

Running the workload

Create an SSH connection

Tips & tricks

Using our web interface

What’s next?

Stable Diffusion Playground

Llama 3.2 fine-tuning

Llama 3.1 Deployment

Get Started

Compute

Resource

Admin

Private Hub

Pricing

Try it on VESSL Hub

​What you will do

​Writing the YAML file

Spin up a workload

Configure an interactive run

​Running the workload

​Create an SSH connection

​Tips & tricks

​Using our web interface

​What’s next?

Stable Diffusion Playground

Llama 3.2 fine-tuning

Llama 3.1 Deployment

What you will do

Writing the YAML file

Running the workload

Create an SSH connection

Tips & tricks

Using our web interface

What’s next?