This is a simple data pipeline extracting cryptocurrency data from CoinCap API utilising tools such as Docker, cron, AWS EC2, PostgreSQL, and Metabase.
It is based on a project by Joseph Machado. I've made various amendments, including using a different endpoint and extraction script, updating containers to allow for data persistence, and documenting some design decisions.
I wanted to create a simple ETL which would help develop my skills and knowledge further. I also have a passing interest in crypto, and the CoinCap API was a good place to explore this.
- Extract data using CoinCap API
- Load data into PostgreSQL
- Connect with Google Data Studio dashboard
- Orchestrate using cron
- Run with Docker
- Data visualisation with Metabase
-
Setup ~ step-by-step instructions to setup pipeline.
-
Design ~ pipeline description and design decisions.
-
Improvements ~ listing of improvements I plan to make.