SQL questions from Capgemini data engineering interviews.
These sql questions are sourced from Capgemini data engineering interviews. Each includes an expert-level answer.
Discuss how you handled null values or unstructured data in your previous projects.
How does indexing improve query performance in SQL?
How would you deal with data skewness in a join operation?
How would you deal with data skewness in a large dataset?
Solve a problem using a window function in Spark or SQL.
map() vs mapPartitions(): Highlight the difference between map (row-level transformation) and mapPartitions (partition-level transformation).
repartition() vs coalesce(): Explain when to use repartition() (increases partitions) vs coalesce() (reduces partitions).
Download the complete interview prep bundle with expert answers. Study offline, on your commute, anywhere.