Skip to content

Dépôt associé au cours Python pour data scientists (ENSAE 2e année)

License

Notifications You must be signed in to change notification settings

lbaudin/python-datascientist

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data science with Python

build-doc Actions Status

Download nbviewer Onyxia
Open In Colab

DOI

Note

This is the English 🇬🇧🇺🇸 version of the README. If you want to see the French 🇫🇷 version, you can click on the link below:

fr

This GitHub repository stores the source files used to build the site https://pythonds.linogaliana.fr/.

It contains the entire course Python for Data Science that I teach in the second year (Master 1) at ENSAE.

Note

A guide to assist potential contributors is available by clicking the button below:

CONTRIBUTING.md

Syllabus

The syllabus is available on the ENSAE website and on the course website.

Overall, it offers a very comprehensive content that can satisfy both beginners in data science and those looking for more advanced content:

  1. Data Manipulation: standard data manipulation (Pandas), geographical data (Geopandas), data retrieval (web scraping, API)...
  2. Data Visualization: classic visualizations (Matplotlib, Seaborn), cartography, interactive visualizations (Plotly, Folium)
  3. Modeling: machine learning (Scikit), econometrics
  4. Text Data Processing (NLP): introduction to tokenization with NLTK and SpaCy, modeling...
  5. Introduction to Modern Data Science: cloud computing, ElasticSearch, continuous integration...

The content of this site is based on open data, whether French data (mainly from the central platform data.gouv or the website of Insee) or American data.

A good complement to the website's content is the course we give with Romain Avouac (@avouacr) in the final year at ENSAE, more focused on the production of data science projects: https://ensae-reproductibilite.github.io/website/

Testing Python examples

You can use a personal installation of Python or shared servers. On the website, a series of buttons are available to easily test the examples on Jupyter notebooks in the configuration that suits you best.

Here are, for example, these buttons for the Numpy tutorial:

Download Onyxia
Open In Colab

About

Dépôt associé au cours Python pour data scientists (ENSAE 2e année)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 48.7%
  • Lua 30.1%
  • TeX 13.1%
  • Shell 4.2%
  • EJS 3.1%
  • Dockerfile 0.8%