> ## Documentation Index
> Fetch the complete documentation index at: https://docs.vessl.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Dataset (Deprecated)

<Warning>
  The `dataset` functions are deprecated.
  Please use the `vessl.storage` package to manage datasets with VESSL storage volumes.
</Warning>

### read\_dataset

```python theme={null}
vessl.read_dataset(
   dataset_name: str, **kwargs
)
```

Read a dataset in the default organization. If you want to override the
default organization, then pass `organization_name` as `**kwargs`.

**Args**

* `dataset_name` (str) : Dataset name.

**Example**

```python theme={null}
vessl.read_dataset(
   dataset_name="mnist",
)
```

***

## read\_dataset\_version

```python theme={null}
vessl.read_dataset_version(
   dataset_id: int, dataset_version_hash: str, **kwargs
)
```

Read the specific version of dataset in the default organization. If you
want to override the default organization, then pass `organization_name` as
`**kwargs`.

**Args**

* `dataset_id` (int) : Dataset id.
* `dataset_version_hash` (str) : Dataset version hash.

**Example**

```python theme={null}
vessl.read_dataset_version(
    dataset_id=1,
    dataset_version_hash="hash123"
)
```

***

## list\_datasets

```python theme={null}
vessl.list_datasets(
   **kwargs
)
```

List datasets in the default organization. If you want to override the
default organization, then pass `organization_name` as `**kwargs`.

**Example**

```
vessl.list_datasets()
```

***

## create\_dataset

```python theme={null}
vessl.create_dataset(
   dataset_name: str, description: str = None, is_version_enabled: bool = False,
   is_public: bool = False, external_path: str = None, aws_role_arn: str = None,
   version_path: str = None, **kwargs
)
```

Create a dataset in the default organization. If you want to override
the default organization, then pass `organization_name` as `**kwargs`.

**Args**

* `dataset_name` (str) : Dataset name.
* `description` (str) : dataset description. Defaults to None.
* `is_version_enabled` (bool) : True if a dataset versioning is set,
  False otherwise. Defaults to False.
* `is_public` (bool) : True if a dataset is source from a public bucket, False
  otherwise. Defaults to False.
* `external_path` (str) : AWS S3 or Google Cloud Storage bucket URL. Defaults
  to None.
* `aws_role_arn` (str) : AWS Role ARN to access S3. Defaults to None.
* `version_path` (str) : Versioning bucket path. Defaults to None.

**Example**

```python theme={null}
vessl.create_dataset(
    dataset_name="mnist",
    is_public=True,
    external_path="s3://savvihub-public-apne2/mnist"
)
```

***

## list\_dataset\_volume\_files

```python theme={null}
vessl.list_dataset_volume_files(
   dataset_name: str, need_download_url: bool = False, path: str = '',
   recursive: bool = False, **kwargs
)
```

List dataset volume files in the default organization. If you want to
override the default organization, then pass `organization_name` as
`**kwargs`.

**Args**

* `dataset_name` (str) : Dataset name.
* `need_download_url` (bool) : True if you need a download URL, False
  otherwise. Defaults to False.
* `path` (str) : Directory path to list. Defaults to root(""),
* `recursive` (bool) : True if list files recursively, False otherwise.
  Defaults to False.

**Example**

```python theme={null}
vessl.list_dataset_volume_files(
    dataset_name="mnist",
    recursive=True,
)
```

***

## upload\_dataset\_volume\_file

```python theme={null}
vessl.upload_dataset_volume_file(
   dataset_name: str, source_path: str, dest_path: str, **kwargs
)
```

Upload file to the dataset. If you want to override the default
organization, then pass `organization_name` as `**kwargs`.

**Args**

* `dataset_name` (str) : Dataset name.
* `source_path` (str) : Local source path.
* `dest_path` (str) : Destination path within the dataset.

**Example**

```python theme={null}
vessl.upload_dataset_volume_file(
    dataset_name="mnist",
    source_path="test.csv",
    dest_path="train",
)
```

***

## download\_dataset\_volume\_file

```python theme={null}
vessl.download_dataset_volume_file(
   dataset_name: str, source_path: str, dest_path: str, **kwargs
)
```

Download file from the dataset. If you want to override the default
organization, then pass `organization_name` as `**kwargs`.

**Args**

* `dataset_name` (str) : Dataset name.
* `source_path` (str) : Source path within the dataset.
* `dest_path` (str) : Local destination path.

**Example**

```python theme={null}
vessl.download_dataset_volume_file(
    dataset_name="mnist",
    source_path="train/test.csv",
    dest_path=".",
)
```

***

## copy\_dataset\_volume\_file

```python theme={null}
vessl.copy_dataset_volume_file(
   dataset_name: str, source_path: str, dest_path: str, **kwargs
)
```

Copy files within the same dataset. Noted that this is not supported for
externally sourced datasets like S3 or GCS. If you want to override the
default organization, then pass `organization_name` as `**kwargs`.

**Args**

* `dataset_name` (str) : Dataset name.
* `source_path` (str) : Source path within the dataset.
* `dest_path` (str) : Local destination path.

**Example**

```python theme={null}
vessl.download_dataset_volume_file(
    dataset_name="mnist",
    source_path="train/test.csv",
    dest_path="test/test.csv",
)
```

***

## delete\_dataset\_volume\_file

```python theme={null}
vessl.delete_dataset_volume_file(
   dataset_name: str, path: str, **kwargs
)
```

Delete the dataset volume file. Noted that this is not supported for
externally sourced datasets like S3 or GCS. If you want to override the
default organization, then pass `organization_name` as `**kwargs`.

**Args**

* `dataset_name` (str) : Dataset name.
* `path` (str) : File path.

**Example**

```python theme={null}
vessl.delete_dataset_volume_file(
    dataset_name="mnist",
    path="train/test.csv",
)
```
