Information Integration (academic year 2017/2018)

This is one of the two sections of the course Large Scale Data Management. The lectures of this section will be held in March-April 2018.

For whom is this course. This 3 credits course is actually one of the two sections of the course Large Scale Data Management for the students of the Master in Computer Engineering (School of Engineering) of Sapienza Università di Roma.
Prerequisites. A good knowledge of the fundamentals of Programming Structures, Programming Languages, Databases (SQL, relational data model, Entity-Relationship data model, conceptual and logical database design) and Database systems, as well as a basic knowledge of Mathematical Logic is required.
Course goals. Information integration is the problem of combining data residing at different sources, and providing the user with a unified view of these data. The problem of designing information integration systems is important in current real world applications, and is characterized by a number of issues that are interesting from both a theoretical and a practical point of view. In the last years, there has been a huge amount of research work on data integration, and a precise, clear picture of a systematic approach to such problem is now available. This section will present an overview of the research work carried out in the area of data integration, with emphasis on the theoretical results that are relevant for the development of information integration solutions. Special attention will be devoted to the following aspects: architectures for information integration, modeling an information integration application, ontology-based data access and integration, processing queries in information integration, data exchange, and reasoning on queries.

  • News
    • April 4, 2018 The material on KARMA is available in the Moodle system.
    • March 15, 2018 The complete set of slides on the theory of data integration are available in the Moodle system.
    • February 19, 2018 With regard to the dates of the exam in April 2018 for the students of the academic year 2016/2017, please see below. The students who must register the whole exam of Large Scale Data Integration during April 2018 session can now book for the exam using the INFOSTUD system until April 10, 2018.
  • Topics covered
    • Architectures for information integration
    • Distributed data management
    • Data federation
    • Data exchange and data warehousing
    • ETL (Extraction, Transformation and Loading), data cleaning and data reconciliation
    • Data integration
    • Ontology-based data integration
  • Teaching material
  • Exams
    The rules for the exam will be posted here.

  • Schedule of exams
    • First exam: June 2018
    • Second exam: July 2018
    • Third exam: September 2018
    • First special session: October 2018
    • Fourth exam: January 2019
    • Fifth exam: February 2019
    • Second special session: April 2019
  • Lectures
    • When: Monday, 9:00am - 11:00am, Thursday, 2:00pm - 5:00pm,
      starting from February 26, 2018.

    • Where: Classroom A5, via Ariosto 25, Roma
    • Schedule

      Week Monday (9:00am - 11:00am)
      classroom A5
      Thursday (2:00pm - 5:00pm)
      classroom A5
      01 (Feb 26)
      02 (Mar 05)
      Lectures 1,2,3
      - Introduction to information integration
      - Propositional logic: syntax and semantics
      03 (Mar 12) Lectures 4,5
      - Predicate: syntax and semantics
      Lectures 6,7,8
      - Relationship between logic and data management
      04 (Mar 19) Lectures 9,10
      - Formalization of data integration
      Lectures 11,12,13
      - Mapping languages
      05 (Mar 26) Lectures 14,15
      - Algorithms for query answering in GAV without axioms
      06 (Apr 02)
      Lectures 16,17,18
      - Tool for mapping specification: KARMA
      07 (Apr 09) Lectures 19,20
      - Algorithms for data exchange in GLAV without axioms
      Lectures 21,22,23
      - Virtual data integration in GLAV without axioms
      - Data integration with axioms in the global schema
      08 (Apr 16) Lectures 24,25
      - Ontology-based data integration

  • Past editions
  • Office hours. Tuesday, 5:00 pm, at the Dipartimento di Informatica e Sistemistica "Antonio Ruberti",
    via Ariosto 25, Roma, second floor, room B203 (if available), or room B217 (otherwise) -- please, look at the last
    minute news for the next office hours