Describe how Dataproc integrates with BigQuery for processing large datasets.
SQLmedium
4
Duplicate characters in a string (e.g., '123a!' to '112233aa!!').
SQLmedium
5
Explain Dynamic Partition Pruning error and how to fix it.
SQLmedium
6
Implement a recursive query for hierarchy (employee-manager). Explain the termination guarantees, depth limits, and when a recursive CTE becomes a scalability bottleneck. What alternatives exist for graph-scale hierarchies in Spark or a data lake?
SQLmedium
7
Explain how partitioning and bucketing in Hive/Spark optimize queries. What are the trade-offs in bucket count, partition cardinality, and small-file problem? When does over-partitioning or over-bucketing become counterproductive?
SQLmedium
8
Explain how to implement cumulative sum in SQL.
SQLmedium
+20 More Questions with Expert Answers
Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.