Skip to content

jeffejefe/aws-open-data

 
 

Repository files navigation

aws-open-data

Colab Binder License

Introduction

The AWS Open Data program hosts a lot of publicly available datasets. This repo compiles the list of all datasets on AWS as a CSV file and as a JSON file, making it easier to find and use them programmatically. The list is updated daily.

A complete list of AWS open datasets as individual YAML files is available here.

Usage

This repo provides the list of AWS open datasets in two formats:

The TSV file can be easily read into a Pandas DataFrame using the following code:

import pandas as pd

url = 'https://github.com/giswqs/aws-open-data/raw/master/aws_open_datasets.tsv'
df = pd.read_csv(url, sep='\t')
df.head()

Related Projects

About

A list of open datasets on AWS

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 65.2%
  • Jupyter Notebook 34.8%