For data scientists, business users and operational systems wanting to store and access all data and metadata, the data lake is a single source of truth, allowing separation of storage and interrelations between data. Data Lake eliminates dark data, saves money and enables scalability. In this session, we will discuss the why’s and how’s of building a data lake, focusing on the first layer: Ingestion and Meta Data Management.
- What and why of a Data Lake
- Layers of a Data Lake
- Ingestion Patterns
- Meta Data Management- Data Cataloging
- Data Access Patterns