This a school data analysis project.
It consist on build a model at first with a subset of the covariates(7) and second with all the covariates(19)
i use just a part of the dataset cause it require 10.2GB of free RAM for cross validation with the entire dataset.
the dataset was taken from https://github.com/hadley/nycflights13