Corso di laurea magistrale in Data Science
Facoltà di Ingegneria dell'Informazione, Informatica e Statistica, Sapienza Università di Roma

Data Management for Data Science

2015/2016

Prof. Riccardo Rosati


News


Course contents and objectives

The main goal of the course is to present the basic concepts of data management systems. The first part of the course introduces the main aspects of relational database systems, including basic functionalities, file and index organizations, and query processing. The second part of the course aims at presenting the main non-relational approaches to data management, in particular, multidimensional data management, large-scale data management, and open data management.

Course program

  1. Introduction to relational databases
  2. The structure of a Data Base Management System
  3. Physical structures for data
  4. Multidimensional data management
  5. Large-scale data management
  6. Open data management

Lectures

The lectures for a.y. 2015/2016 will be held in the second semester (from February 22, 2016 to May 29, 2016), with the following schedule:


Course material

  1. Introduction to relational databases
  2. SQL
  3. Exercise on SQL
  4. DBMS transaction management and recovery management
  5. DBMS file organization
  6. DBMS query evaluation
  7. Exercise on file organization and query evaluation
  8. Introduction to big data and data warehouses
  9. Data warehouses
  10. Graph databases (UPDATED ON MAY 18, 2016)
  11. NoSQL aggregated databases
  12. Exercises on OLAP
Other useful references:

Homework assignments


Exam

The written exam is a set of exercises and questions about all the course topics.

Students who have completed and presented homework assignments during the course, will not have to pass the written exam on the topics covered by such assignments.

Exam dates:


Lectures