Python code samples and solutions based on examples and exercises in Doing Data Science by Rachel Schutt and Cathy O'Neil
These code samples and solutions are splitted into different IPython/Jupyter notebooks.
Clone this repo and launch ipython notebook
at the root directory of this
project. You can also launch the Bash script launch.sh
which carries out the
same command with some specific ipython options.
You can also use the NBViewer website to read a notebook.
Note: This repo aims to practice myself in Data Science and to show that's easy to do Data Science in Python.
Data samples can be downloaded at the official Doing Data Science Github project at https://github.com/oreillymedia/doing_data_science
For some exercises, I provide a Python script which downloads needed files.
Chapter 2 - Exploratory Data Analysis
Play with some (simulated) data about ads shown and clicks recorded on the NY Time home page.
The file
down_nyt.py
downloads the 31 CSV files into thenyt-data
directory. Just do:python down_nyt.py
and wait... it can be a quite long.