I describe in this part of the course solutions to handle medium to large scale data in R. The organisation of this part of the course reflects the general organisation of the course, starting from basic data management up to advanced modeling.

Topics

  1. Data Management and Querying
  2. Shared memory parallel programming
  3. Distributed Systems