Corso di laurea magistrale in Data Science
Facoltà di Ingegneria dell'Informazione, Informatica e Statistica,
Sapienza Università di Roma
Data Management for Data Science
Prof. Riccardo Rosati
Results of the written exam of May 31, 2107.
The results will be uploaded on Infostud by prof. Rosati on June 7, for all the students who have registered for the exam date of May 31.
(The overall evaluation of the students who have completed all homeworks has been communicated during the exam)
Course contents and objectives
The main goal of the course is to present the basic concepts of data management systems. The first part of the course introduces the main aspects of relational database systems, including basic functionalities, file and index organizations, and query processing. The second part of the course aims at presenting the main non-relational approaches to data management, in particular, multidimensional data management, large-scale data management, and open data management.
Introduction to relational databases
- the relational data model, relational algebra, SQL
The structure of a Data Base Management System
- Basic functionalities of data server
Physical structures for data
- file organizations, indexed organizations, query planning and optimization
Multidimensional data management
- OLAP Queries, Structures for multidimensional data, OLAP query evaluation
Large-scale data management
- Distributed query evaluation, NoSQL databases, graph databases
Open data management
- open data, linked open data, RDF databases
The lectures for a.y. 2016/2017 will be held in the second semester (from February 20, 2017 to May 29, 2017), with the following schedule:
Monday, 15-18:30, room II, edificio scienze statistiche, Città Universitaria
Wednesday, 17-18:30, room V, edificio scienze statistiche, Città Universitaria
Other useful references:
- Introduction to relational databases
- Exercise on SQL
- DBMS transaction management and recovery management
- DBMS file organization
- DBMS query evaluation
- Exercise on file organization and query evaluation
- Introduction to big data and data warehouses
- Data warehouses
- Exercises on OLAP
- Graph databases
- NoSQL aggregated databases
- The MongoDB system
R. Ramakrishnan, J. Gehrke. Database Management Systems. McGraw-Hill, 2004.
The written exam is a set of exercises and questions about all the course topics (time to complete the exam: 2 hours).
Text of the written exam of June 7, 2016
Exam rules for students who have completed and presented (some of the) homeworks:
Consequently, the students who have completed and presented all the homework assignments during the course, do not have to take the written exam.
Students who have successfully presented homework 1, do not have to solve exercise 1 of the written exam.
Students who have successfully presented homework 2, do not have to solve exercise 2 of the written exam.
Students who have successfully presented homework 3, do not have to solve exercise 4, 5 and 6 of the written exam.
Students who have successfully presented the extra homework, do not have to solve exercise 3 of the written exam.
- May 31, 2017
- July 13, 2017
- September 11, 2017
- January 2018
- February 2018
As usual, before every exam date, students MUST reserve for the exam on Infostud. Reservations usually close 3 or 4 days before the exam date.