What is the difference between repartition and coalesce in Apache Spark?
Spark/Big Datamedium
2
Write an SQL query to find the second-highest salary from an employee table.
SQLmedium
3
What strategies can you use to handle skewed data in Spark?
Spark/Big Datamedium
4
Design a Delta table layout for mixed workload: point lookups by user_id, range scans by date, and full partition scans. Compare partitioning vs. Z-ordering—when to use each, and the rewrite cost trade-off.
Spark/Big Datahard
5
Describe how to secure sensitive data in cloud storage solutions.
Cloud/Toolseasy
6
What are the pros and cons of using a data lake on AWS, GCP, or Azure?
Cloud/Toolshard
7
Explain how you gather and define requirements for a complex data platform project.
System Design/Architectureeasy
8
How would you model customer transaction data for both analytical and operational use cases?
General/Otherhard
+20 More Questions with Expert Answers
Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.