Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Logo1/IntroPythonForDS

Open more actions menu
 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 

Repository files navigation

Introduction to Python For Data Science

This repo contains the teaching material for the Introduction to Python (and useful libraries) masterclass at the Data Science Retreat.

Table of Content

About me

Slides for this section can be found here.

The Python Programming Language

Slide deck for this entire section is available here.

Why Python?

Slides on this topic start here

Python for DS Components

Slides on this topic start here

Python 2 vs. Python 3

Slides on this topic start here

A great notebook covering the main differences has been written by Sebastian Raschka.

To keep your code compatible with both Python 2 and Python 3, you might also want to use this Cheat Sheet.

Installing Python and all useful packages

Slides on this topic start here

Running the IPython interpreter and a python file

Slides on this topic start here

Jupyter Notebook

A live demo will be given during the masterclass.

Experiment further with the IPython Notebook environment with this Jupyter Notebook. Try to clone or download it, before opening it, running and modifying its cells.

Many more Jupyter features in this blog post.

Python basics

Times to get your hands dirty. Read and test for yourself the examples provided in: The SciPy Lectures -- The Python Language.

Practice those examples using alternatively python files, the IPython interpreter and an IPython Notebook.

To go further:

Pandas

Intro tutorials on pandas basics

Data munging with pandas

Scikit-learn

Your first data analysis case

A great source of data problems nowadays is the Kaggle platform. We'll be starting today with a simple but representative dataset: Titanic: Machine Learning from Disaster.

  • Guide for orientation to approach the problem

IMPORTANT: you will find plenty of materials to analyze this data, however you'll learn the most if you give the problem some thought and try out several things before resorting to ready-made answers.

License

This repository contains a variety of content: some developed by Amélie Anglade, some derived from or largely inspired by third-parties' work, and some entirely from third-parties.
The third-party content is distributed under the license provided by those parties. Any derivative work respects the original licenses, and credits its initial authors.

Original content developed by Amélie Anglade is distributed under the MIT license.

About

Teaching material for the Introduction to Python at the Data Science Retreat

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
Morty Proxy This is a proxified and sanitized view of the page, visit original site.