Write optimized SQL queries involving window functions, CTEs, and joins.
SQLmedium
2
Write queries combining Joins and Group By operations.
SQLmedium
3
You need to create a workflow where Task B runs only if Task A is successful, and Task C should always run regardless of Task A or B's status. How would you define this dependency using Airflow?
SQLhard
4
You need to design a Kafka topic for a logging service. How would you decide the number of partitions and the key for partitioning to balance throughput and ordering requirements?
SQLhard
5
Your Kafka consumer shows significant lag during peak hours. What strategies would you employ to reduce lag and ensure timely data processing?
SQLmedium
6
map() vs mapPartitions(): Highlight the difference between map (row-level transformation) and mapPartitions (partition-level transformation).
SQLmedium
7
repartition() vs coalesce(): Explain when to use repartition() (increases partitions) vs coalesce() (reduces partitions).
SQLmedium
+7 More Questions with Expert Answers
Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.