Length: 16 hours - 4 cfu
Abstract
In this course, we will discuss fundamental data warehouse architectures, followed by modeling data warehouse schemas and data warehouse internals. Ultimately it will also be about alternatives to traditional data warehouses, namely data lakes and data meshes. In particular, the course will cover the following topics:
Data integration challenges and architectures
Data warehouse architectures
Challenges in designing data integration processes
Evolution of data sources and its impact on integration processes: research challenges
Optimizing the performance of process integration processes: research and industrial approaches
Data quality issues: data cleansing
Entity matching: Standard processing pipeline
Entity matching: algorithms and techniques
Research challenge: automatic entity matching
Data Lake and data mesh
Dates & Venue
Giorni | Aula | Orario |
13/03/2024 | Lab. Laurea Magistrale 3° floor - Via Celoria 18 - 20133 Milan |
11:00-13:00 |
14/03/2024 | Sala riunioni 5° floor - Via Celoria 18 - 20133 Milan |
11:00-13:00 |
15/03/2024 | Lab. Laurea Magistrale 3° floor - Via Celoria 18 - 20133 Milan | 11:00-13:00 |
18/03/2024 | Lab. Laurea Magistrale 5° floor - Via Celoria 18 - 20133 Milan |
11:00-14:00 |
19/03/2024 | Lab. Laurea Magistrale 3° floor - Via Celoria 18 - 20133 Milan |
11:00-14:00 |
20/03/2024 | Lab. Laurea Magistrale 3° floor - Via Celoria 18 - 20133 Milan | 11:00-13:00 |
21/03/2024 | Sala riunioni 5° floor - Via Celoria 18 - 20133 Milan | 11:00-13:00 |
Lecturer:
Prof. Paolo Ceravolo - Dipartimento di Informatica
Prof. Robert Wrembel - University of Technology, Poznań, Poland
Assessor:
Prof. Paolo Ceravolo - Dipartimento di Informatica
Prof. Robert Wrembel - University of Technology, Poznań, Poland