Create a Python program to demonstrate the use of set operations (union, intersection).
Python/Codingmedium
3
Describe Spark's memory management model. How do you handle heap memory overhead issues?
Python/Codingmedium
4
Implement a recursive query for hierarchy (employee-manager). Explain the termination guarantees, depth limits, and when a recursive CTE becomes a scalability bottleneck. What alternatives exist for graph-scale hierarchies in Spark or a data lake?
SQLmedium
5
Given a table of sales data, use window functions to calculate a running total.
SQLmedium
6
How would you optimize a query with multiple joins and subqueries?
SQLmedium
7
Code a simple PySpark job to read a JSON file, filter records, and write output in Parquet format.
Spark/Big Datamedium
+7 More Questions with Expert Answers
Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.