JavaScript is required to use this application. Please enable JavaScript in your browser settings or disable any extensions that may be blocking scripts.
Interview questions · medium
What is the difference between repartition and coalesce in Apache Spark?
Write an SQL query to find the second-highest salary from an employee table.
What strategies can you use to handle skewed data in Spark?
Describe strategies for optimizing a slow-running query on a massive dataset.
Explain the difference between Star and Snowflake schemas. When would you choose one over the other?
Explain the use of surrogate keys vs. natural keys in data modeling.
Given an unoptimized query execution plan, how would you diagnose and improve performance?
Kafka Partitioning: How would you ensure even load distribution across Kafka partitions in a high-volume system?
Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.