Design a Data Storage Structure :
- Design an Azure Data Lake solution
- Recommend file types for storage
- Recommend file types for analytical queries
- Design for efficient querying
- Design for data pruning
- Design a folder structure that represents the levels of data transformation
- Design a distribution strategy
Design a Partition Strategy :
- Design a partition strategy for files
- Design a partition strategy for analytical workloads
- Design a partition strategy for efficiency/performance
- Design a partition strategy for Azure Synapse Analytics
- Identify when partitioning is needed in Azure Data Lake Storage Gen2
Design the Serving Layer :
- Design star schemas
- Design slowly changing dimensions
- Design a dimensional hierarchy
- Design a solution for temporal data
- Design for incremental loading
- Design analytical stores
- Design meta stores in Azure Synapse Analytics and Azure Data bricks
Implement Physical Data Storage Structures :
- Implement compression
- Implement partitioning
- Implement sharing
- Implement different table geometries with Azure Synapse Analytics pools
- Implement data redundancy
- Implement distributions
- Implement data archiving
Implement Logical Data Structures :
- Build a temporal data solution
- Build a slowly changing dimension
- Build a logical folder structure
- Build external tables
- Implement file and folder structures for efficient querying and data pruning
Implement the Serving Layer :
- Deliver data in a relational star schema
- Deliver data in Parquet files
- Maintain metadata
- Implement a dimensional hierarchy