Corso di laurea magistrale in Data Science
Facoltà di Ingegneria dell'Informazione, Informatica e Statistica, Sapienza Università di Roma

Data Management for Data Science


Prof. Domenico Lembo and Prof. Riccardo Rosati


Course contents and objectives

The main goal of the course is to present the basic concepts of data management systems. The first part of the course introduces the main aspects of relational database systems, including basic functionalities, file and index organizations, and query processing. The second part of the course aims at presenting the main non-relational approaches to data management, in particular, multidimensional data management, large-scale data management, and open data management.

Course program

  1. Introduction to relational databases
  2. The structure of a Data Base Management System
  3. Physical structures for data
  4. Multidimensional data management
  5. Large-scale data management
  6. Open data management


The lectures for a.y. 2019/2020 are held in the second semester (from February 24, 2020 to May 30, 2020), with the following schedule:

Course material

(To be completed)
  1. Introduction to relational databases
  2. SQL

Other useful references:

Homework assignments


The written exam is a set of exercises and questions about all the course topics (time to complete the exam: 2 hours).

Exam dates:

As usual, before every exam date, students MUST reserve for the exam on Infostud. The reservation deadline is 3 or 4 days before the exam date.

Schedule and contents of past lectures

Link to the website of the 2018/2019 edition of this course