Get Started
Tutorials
Notebooks
Public Profiles
Projects
Experiments
Jobs
Models
Deployments
Machines
Data
Gradient SDK
Gradient Private Cloud
Instances
Release Notes

Public Datasets Repository

Jobs and notebooks have access to a read-only directory that is mounted at /datasets. This directory includes the following public datasets (with many more to come).

List of Public Datasets

Name & Path

Description

Source

Fast.ai

/datasets/fastai/

Paperspace's Fast.ai template is built for getting up and running with the enormously popular Fast.ai online MOOC called Practical Deep Learning for Coders.

http://files.fast.ai/data/

CelebA

/datasets/celebA/

CelebFaces Attributes Dataset (CelebA) is a large-scale face attributes dataset with more than 200K celebrity images, each with 40 attribute annotations.

http://mmlab.ie.cuhk.edu.hk/projects/CelebA.html

LSUN

/datasets/lsun/

Contains around one million labeled images for each of 10 scene categories and 20 object categories.

http://lsun.cs.princeton.edu/2017/

http://www.yf.io/p/lsun

MNIST

/datasets/mnist/

The MNIST database of handwritten digits, available from this page, has a training set of 60,000 examples, and a test set of 10,000 examples

http://yann.lecun.com/exdb/mnist/

COCO

/datasets/coco

COCO is a large-scale object detection, segmentation, and captioning dataset.

http://cocodataset.org/

Selfie

/datasets/selfie

Selfie dataset contains 46,836 selfie images annotated with 36 different attributes divided into several categories.

http://crcv.ucf.edu/data/Selfie/

StyleGan

/datasets/stylegan

StyleGan is a Style-Based Generator Architecture for Generative Adversarial Networks. This dataset allows for photographs of people to be produced by the generator that allows control over different aspects of the image

https://github.com/NVlabs/stylegan