Sets of code I have created while studying below book.
Data Science on the Google Cloud Platform
http://shop.oreilly.com/product/0636920057628.do
- Ch02. Ingesting Data into the Cloud
- Ch03. Creating Compelling Dashboard
- Ch04. Streaming Data' Publiation and Ingest
- Ch05. Interactive Data Exploration
- Ch06. Bayes Classifier on Cloud Dataproc
- Activate environment variables
cd Ch06/dataproc
source env.sh
- Copy bootstrap script to google storage
./copy_to_gs.sh
- Start dataproc
./init_cluster.sh
- Remove cluster
./delete_cluster.sh