Corso di laurea magistrale in Data Science
Facoltà di Ingegneria dell'Informazione, Informatica e Statistica,
Sapienza Università di Roma
Data Management for Data Science
2016/2017
Prof. Riccardo Rosati
News
-
Results of the written exam of May 31, 2017.
(The overall evaluation of the students who have completed all homeworks has been communicated during the exam)
The results will be uploaded on Infostud by prof. Rosati on June 7, for all the students who have registered for the exam date of May 31.
Course contents and objectives
The main goal of the course is to present the basic concepts of data management systems. The first part of the course introduces the main aspects of relational database systems, including basic functionalities, file and index organizations, and query processing. The second part of the course aims at presenting the main non-relational approaches to data management, in particular, multidimensional data management, large-scale data management, and open data management.
Course program
-
Introduction to relational databases
- the relational data model, relational algebra, SQL
-
The structure of a Data Base Management System
- Basic functionalities of data server
-
Physical structures for data
- file organizations, indexed organizations, query planning and optimization
-
Multidimensional data management
- OLAP Queries, Structures for multidimensional data, OLAP query evaluation
-
Large-scale data management
- Distributed query evaluation, NoSQL databases, graph databases
-
Open data management
- open data, linked open data, RDF databases
Lectures
The lectures for a.y. 2016/2017 will be held in the second semester (from February 20, 2017 to May 29, 2017), with the following schedule:
-
Monday, 15-19, room II, edificio scienze statistiche, Città Universitaria
-
Wednesday, 17-19, room V, edificio scienze statistiche, Città Universitaria
Course material
- Introduction to relational databases
- SQL
- Exercise on SQL
- DBMS transaction management and recovery management
- DBMS file organization
- DBMS query evaluation
- Exercise on file organization and query evaluation
- Introduction to big data and data warehouses
- Data warehouses
- Exercises on OLAP
- Graph databases
- NoSQL aggregated databases
- The MongoDB system
Other useful references:
-
R. Ramakrishnan, J. Gehrke. Database Management Systems. McGraw-Hill, 2004.
Exam
The written exam is a set of exercises and questions about all the course topics (time to complete the exam: 2 hours).
Text of the written exam of June 7, 2016
Exam rules for students who have completed and presented (some of the) homeworks:
-
Students who have successfully presented homework 1, do not have to solve exercise 1 of the written exam.
-
Students who have successfully presented homework 2, do not have to solve exercise 2 of the written exam.
-
Students who have successfully presented homework 3, do not have to solve exercise 4, 5 and 6 of the written exam.
-
Students who have successfully presented the extra homework, do not have to solve exercise 3 of the written exam.
Consequently, the students who have completed and presented all the homework assignments during the course, do not have to take the written exam.
Exam dates:
- May 31, 2017
- July 13, 2017
- September 11, 2017
- January 2018
- February 2018
As usual, before every exam date, students MUST reserve for the exam on Infostud. Reservations usually close 3 or 4 days before the exam date.