Corso di laurea magistrale in Data Science
Facoltà di Ingegneria dell'Informazione, Informatica e Statistica,
Sapienza Università di Roma
Data Management for Data Science
Prof. Domenico Lembo and Prof. Riccardo Rosati
- THE LECTURE OF MARCH 8, 2021 IS CANCELED DUE TO THE UNAVAILABILITY OF THE COMPUTING AND NETWORK SERVICES OF SAPIENZA
To access online lectures, lecture recordings, course material, and all the announcements about this course, students are asked to access their Google account at studenti.uniroma1.it and access Google Classroom (https://classroom.google.com or you can download the app Classroom for smartphones); then, students must enroll in the Data Management for Data Science course using the following link: https://classroom.google.com/c/Mjc2MDU5OTA0MzQ1?cjc=xjo46a3
The lectures for a.y. 2020/2021 are held in the second semester (from February 22, 2021 to May 28, 2021) in "blended" modality (they can be attended both in-person and online) with the following schedule:
All lectures can be attended online live: instructions appear in the Google Classroom course. Every lecture will be recorded, and the recording will be published right after the lecture in the Google Classroom course.
Monday, 13:00-15:00, room B2, DIAG, via Ariosto 25;
Wednesday, 9:00-13:00, room 15, Scienze Statistiche, Città Universitaria.
Course contents and objectives
The main goal of the course is to present the basic concepts of data management systems. The first part of the course introduces the main aspects of relational database systems, including basic functionalities, file and index organizations, and query processing. The second part of the course aims at presenting the main non-relational approaches to data management, in particular, multidimensional data management, large-scale data management, and open data management.
Introduction to relational databases
- the relational data model, relational algebra, SQL
The structure of a Data Base Management System
- Basic functionalities of data server
Physical structures for data
- file organizations, indexed organizations, query planning and optimization
Multidimensional data management
- OLAP Queries, Structures for multidimensional data, OLAP query evaluation
Large-scale data management
- Distributed query evaluation, NoSQL databases, graph databases
Open data management
- open data, linked open data, RDF databases
The modalities of the exam will be decided for every exam session according to the University guidelines for that exam session.
- June 2021
- July 2021
- September 2021
- January 2022
- February 2022
As usual, before every exam date, students MUST reserve for the exam on Infostud. The reservation deadline is 3 or 4 days before the exam date.