Skip to content

Repository includes programs which was created while studying for google data engineer cert.

Notifications You must be signed in to change notification settings

yuwtennis/google-data-engineer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

google-data-engineer

Overview

Sets of code I have created while studying below book.

Data Science on the Google Cloud Platform
http://shop.oreilly.com/product/0636920057628.do

  1. Ch02. Ingesting Data into the Cloud
  2. Ch03. Creating Compelling Dashboard
  3. Ch04. Streaming Data' Publiation and Ingest
  4. Ch05. Interactive Data Exploration
  5. Ch06. Bayes Classifier on Cloud Dataproc

Instructions

Chapter 6

  1. Activate environment variables
cd Ch06/dataproc
source env.sh
  1. Copy bootstrap script to google storage
./copy_to_gs.sh
  1. Start dataproc
./init_cluster.sh
  1. Remove cluster
./delete_cluster.sh

About

Repository includes programs which was created while studying for google data engineer cert.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published