This course given in English in the MMMEF master. It starts with an introduction to the Big Data phenomenon and the focuses on the predictive methods of data science (a.k.a. machine learning methods).
Please note that the R scripts below have been extracted automatically from
the knitr sources of the slides. They must be adapted to run properly: paths
to data files must be modified and the opt_chunk
related code must be
removed. The code is developed under GNU/Linux and uses frequently the
doMC
package which is not available under MS Windows. It should be
replaced by the doParallel
package (and the code should be adapted).
Video series by Jennifer Widom, in particular:
Other videos are very interesting and relevant. I recommend to skip everything that is XML related videos, and possibly JSON related ones.
Tom Mitchell's and Nina Balcan's machine learning course: