How does Spark's Catalyst Optimizer work? Explain its stages.
Spark/Big Datahard
2
Have you worked on Data Warehousing projects?
General/Otherhard
3
What is the difference between OLTP and OLAP?
General/Otherhard
4
How do you optimize a long-running SQL query?
SQLhard
5
What is Spark's Catalyst Optimizer? Explain its stages.
Spark/Big Datahard
6
Name the tools and technologies you have worked with to date.
General/Otherhard
7
You need to create a workflow where Task B runs only if Task A is successful, and Task C should always run regardless of Task A or B's status. How would you define this dependency using Airflow?
SQLhard
8
You need to design a Kafka topic for a logging service. How would you decide the number of partitions and the key for partitioning to balance throughput and ordering requirements?
SQLhard
+17 More Questions with Expert Answers
Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.