In order to show how I understand statistical theories and machine learning argolithms, I applied knowledge to analyse some data sets and also coded some argolithms from textbooks.
Firefox may not work correctly, especially rendering equations. Please browse with Chrome or Safari.
##Data analysis with Spark Data analysis and engineering at PySpark was performed
Click-through rate(CTR) prediction
Movie Recommendation with MLlib
Building a word count application
##Machine learning algorighms In order to understand theoritically and be able to use knowledge practically, algorithms were coded from scratch with Python(Pandas, Scipy, Numpy) and figures on the textbook are reproduced from understanding of equations. Algorithms comes from following book. The numbering indicate the chapter number of the book.
Christopher Bishop. (2007). Pattern Recognition and Machine Learning (Information Science and Statistics). Springer
3.3 Bayesian Linear Regression
4.1.7 The perceptron algorithm
##Statistical Analysis Statistical analysis was conducted with R package and sample data.
Generalized linear mixed model
Comparison GLM vs Bayesian modeling
##Algorithm coding practiec in Python Problems provided by CodingBat(http://codingbat.com/python) were solved.