Skip to content

Download GitHub Archive data and upload it to an Amazon S3 bucket.

Notifications You must be signed in to change notification settings

xmartlabs/gh2s3

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GH2S3

It downloads GitHub Archive 2016 data and uploads it to an Amazon S3 bucket.

It's preferred to run it inside an Amazon EC2 instance, for better bandwidth and latency.

Run locally

With Python 3 and pip:

pip install -r requirements.txt

You need to setup your AWS credentials, the same way it's done with AWS CLI.

Then run:

export S3_BUCKET=YOUR_BUCKET
./gh2s3.py

With Docker

docker build -t gh2s3 .

docker run \
    --rm \
    -e "AWS_ACCESS_KEY_ID=YOUR_ID" \
    -e "AWS_SECRET_ACCESS_KEY=YOUR_KEY" \
    -e "S3_BUCKET=YOUR_BUCKET" \
    gh2s3

About

Download GitHub Archive data and upload it to an Amazon S3 bucket.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages