Interview questions · medium
What is the difference between repartition and coalesce in Apache Spark?
What is the difference between narrow and wide transformations in Apache Spark? Explain with examples.
Explain the difference between Spark's map() and flatMap() transformations.
Explain the concept of Broadcast Join in Spark. When should it be used?
Why do you want to join this company?
What is the difference between SQL and NoSQL databases?
When and how do you use Broadcast Join in Spark?
Describe your approach to managing data deduplication.
How do you monitor consumer lag in Kafka, and how can you reduce it?
How do you optimize partitioning when dealing with large datasets?
Optimize a query fetching customer data with a rolling 6-month sales sum.
Write a SQL query to find employees earning the second-highest salary.
Write a SQL query to find the top 5 products by sales per region.
How do you handle out-of-memory errors in Spark jobs?
What is the role of Zookeeper in Kafka?
Type or paste your answer to any of these questions and our AI Coach scores it, highlights gaps, and rewrites it at FAANG quality. Free to try.