Corso di laurea magistrale in Data Science
Facoltà di Ingegneria dell'Informazione, Informatica e Statistica,
Sapienza Università di Roma
Data Management for Data Science
2017/2018
Prof. Riccardo Rosati
News
-
Exam results - 23/7/2018.
The results of the students who registered for the exam on Infostud will be uploaded (to Infostud) starting from August 2. Those students who do not want their grade to be uploaded have to send an email to prof. Rosati no later than August 1.
The interested students will have the possibility to look at the exam corrections during the office hours of August 1 (14:30-16:00, room B216).
-
Results of the homework presentations for 2017/2018.
-
The lectures for a.y. 2017/2018 were held in the second semester (from February 26, 2018 to June 1, 2018), with the following schedule:
-
Monday, 15:00-19:00, room II, edificio Scienze Statistiche, Città Universitaria
-
Tuesday, 15:00-17:00, room XIII, edificio Tumminelli, Città Universitaria
Course contents and objectives
The main goal of the course is to present the basic concepts of data management systems. The first part of the course introduces the main aspects of relational database systems, including basic functionalities, file and index organizations, and query processing. The second part of the course aims at presenting the main non-relational approaches to data management, in particular, multidimensional data management, large-scale data management, and open data management.
Course program
-
Introduction to relational databases
- the relational data model, relational algebra, SQL
-
The structure of a Data Base Management System
- Basic functionalities of data server
-
Physical structures for data
- file organizations, indexed organizations, query planning and optimization
-
Large-scale data management
- Distributed query evaluation, NoSQL databases, graph databases
-
Open data management
- open data, linked open data, RDF databases
Lectures
The lectures for a.y. 2017/2018 were held in the second semester (from February 26, 2018 to June 1, 2018), with the following schedule:
-
Monday, 15:00-19:00, room II, edificio Scienze Statistiche, Città Universitaria
-
Tuesday, 15:00-17:00, room XIII, edificio Tumminelli, Città Universitaria
Course material
- Introduction to relational databases
- SQL
- Exercise on SQL
- DBMS transaction management and recovery management
- DBMS file organization
- DBMS query evaluation
- Introduction to big data and data warehouses
- Graph databases
- Exercise on file organization and query evaluation
- NoSQL aggregated databases
- The MongoDB system
Other useful references:
-
R. Ramakrishnan, J. Gehrke. Database Management Systems. McGraw-Hill, 2004.
Homework
Exam
The written exam is a set of exercises and questions about all the course topics (time to complete the exam: 2 hours).
The students who presented all the homeworks during the lectures do not have to take the written exam: please see the page on homework results for more details.
Exam dates:
- June 13, 2018
- July 23, 2018
- September 7, 2018
- January 2019
- February 2019
As usual, before every exam date, students MUST reserve for the exam on Infostud. The reservation deadline is 3 or 4 days before the exam date.
The students who presented all the homeworks during the lectures have to follow the instructions on the page on homework results to register their grade.