Explain wide vs. narrow transformations and how they drive shuffle cost, failure domains, and pipeline design. When would you intentionally add a wide transformation, and how do you minimize its impact?
Spark/Big Datahard
2
Optimization: Performance tuning strategies and temporal tables
SQLhard
3
Schema Design: Star vs. Snowflake schema differences