Corso di laurea magistrale in Data Science
Facoltà di Ingegneria dell'Informazione, Informatica e Statistica,
Sapienza Università di Roma
Data Management for Data Science
2015/2016
Prof. Riccardo Rosati
News
-
Exam results - 5/7/2016.
The results of the students who registered for the exam on Infostud will be uploaded (to Infostud) on July 14. Those students who do not want their grade to be uploaded have to send an email to prof. Rosati no later than July 13.
The interested students will have the possibility of looking at the exam corrections on July 12, 17:00-18:30 (via Ariosto 25, room B216).
Course contents and objectives
The main goal of the course is to present the basic concepts of data management systems. The first part of the course introduces the main aspects of relational database systems, including basic functionalities, file and index organizations, and query processing. The second part of the course aims at presenting the main non-relational approaches to data management, in particular, multidimensional data management, large-scale data management, and open data management.
Course program
-
Introduction to relational databases
- the relational data model, relational algebra, SQL
-
The structure of a Data Base Management System
- Basic functionalities of data server
-
Physical structures for data
- file organizations, indexed organizations, query planning and optimization
-
Multidimensional data management
- OLAP Queries, Structures for multidimensional data, OLAP query evaluation
-
Large-scale data management
- Distributed query evaluation, NoSQL databases, graph databases
-
Open data management
- open data, linked open data, RDF databases
Lectures
The lectures for a.y. 2015/2016 will be held in the second semester (from February 22, 2016 to May 29, 2016), with the following schedule:
-
Monday, 15-18:30, room II, edificio scienze statistiche, Città Universitaria
-
Wednesday, 17-18:30, room V, edificio scienze statistiche, Città Universitaria
Course material
- Introduction to relational databases
- SQL
- Exercise on SQL
- DBMS transaction management and recovery management
- DBMS file organization
- DBMS query evaluation
- Exercise on file organization and query evaluation
- Introduction to big data and data warehouses
- Data warehouses
- Graph databases (UPDATED ON MAY 18, 2016)
- NoSQL aggregated databases
- Exercises on OLAP
Other useful references:
-
R. Ramakrishnan, J. Gehrke. Database Management Systems. McGraw-Hill, 2004.
Exam
The written exam is a set of exercises and questions about all the course topics.
Students who have completed and presented homework assignments during the course, will not have to pass the written exam on the topics covered by such assignments.
Exam dates:
- June 7, 2016
- July 5, 2016
- September 13, 2016