On 2025.02.02 I went through with the book all the way to the end.
However, some topics were not implemented since the 1st edition of the book was out-dated and
there was no point to put cost into it. This repos will be archived.
I will start new repository with 2nd edition and if you are interested , follow the new one.
Thank you. @yuwtennis
Sets of code I have created while studying below book.
Data Science on the Google Cloud Platform
http://shop.oreilly.com/product/0636920057628.do
- Ch02. Ingesting Data into the Cloud
- Ch03. Creating Compelling Dashboard
- Ch04. Streaming Data' Publiation and Ingest
- Ch05. Interactive Data Exploration
- Ch06. Bayes Classifier on Cloud Dataproc
- Activate environment variables
cd Ch06/dataproc
source env.sh
- Copy bootstrap script to google storage
./copy_to_gs.sh
- Start dataproc
./init_cluster.sh
- Remove cluster
./delete_cluster.sh