Skip to content

omerkolcak/electric-guitars-de-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Electric Guitars Data Engineering Project in Google Cloud Platform

Description

The goal of the project is to build an end-to-end data engineering project using modern tools.

System Architecture

alt text for screen readers

Technology

  • Web Scraping:
    • Data is scraped from this website using Selenium library.
  • Google Cloud Platform
    • Google storage: Raw csv file is stored.
    • Compute engine: ETL pipeline is runned on compute engine.
    • BigQuery: Structured data is stored in BigQuery database.
    • Looker Studio: Looker studio is connected to the BigQuery database, and simple dashboard is designed.
  • Mage
    • Mage is an open source data pipeline tool. It is initialized in GCP compute instance to run the ETL pipeline.

Data Model

alt text for screen readers

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published