Explain wide vs. narrow transformations and how they drive shuffle cost, failure domains, and pipeline design. When would you intentionally add a wide transformation, and how do you minimize its impact?
Spark/Big Datahard
2
Data masking scenarios for secure data handling
General/Othereasy
3
Normalization: Various forms and impact on query performance
SQLmedium
4
Optimization: Performance tuning strategies and temporal tables
SQLhard
5
SCDs: Types of Slowly Changing Dimensions and their use cases
SQLeasy
6
Schema Design: Star vs. Snowflake schema differences