Convert complex SQL (CTEs, window functions, subqueries) to production-grade PySpark. Discuss when to use spark.sql() vs. DataFrame API, and the implications for testability, partitioning, and execution predictability.
Spark/Big Datamedium
2
Write a query to find the 5th highest salary in an employee table and calculate the number of employees whose salary is greater than that of their manager.
SQLmedium
3
Provide specific examples of challenges faced with PySpark and SQL and solutions implemented.
Spark/Big Datamedium
+3 More Questions with Expert Answers
Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.