Corso di laurea magistrale in Data Science
Facoltà di Ingegneria dell'Informazione, Informatica e Statistica, Sapienza Università di Roma

Data Management for Data Science

2018/2019

Prof. Domenico Lembo and Prof. Riccardo Rosati


News


Course contents and objectives

The main goal of the course is to present the basic concepts of data management systems. The first part of the course introduces the main aspects of relational database systems, including basic functionalities, file and index organizations, and query processing. The second part of the course aims at presenting the main non-relational approaches to data management, in particular, multidimensional data management, large-scale data management, and open data management.


Course program

  1. Introduction to relational databases
  2. The structure of a Data Base Management System
  3. Physical structures for data
  4. Multidimensional data management
  5. Large-scale data management
  6. Open data management

Lectures

The lectures for a.y. 2018/2019 are held in the second semester (from February 25, 2019 to May 31, 2019), with the following schedule:


Course material

(To be completed)
  1. Introduction to relational databases
  2. SQL
  3. Exercise on SQL
  4. DBMS transaction management
  5. DBMS file organization
  6. DBMS query evaluation
  7. Exercise on file organization and query evaluation
  8. Introduction to big data
  9. Graph databases (updated on May 14, 2019)
  10. Neo4J (updated on May 14, 2019)
  11. NoSQL aggregate databases
  12. The MongoDB system

Other useful references:


Homework assignments


Exam

The written exam is a set of exercises and questions about all the course topics (time to complete the exam: 2 hours).

Exam dates:

As usual, before every exam date, students MUST reserve for the exam on Infostud. The reservation deadline is 3 or 4 days before the exam date.


Schedule and contents of past lectures


Link to the website of the 2017/2018 edition of this course