What is the difference between cache() and persist() in Spark? When would you use each?
Spark/Big Datamedium
2
Can you explain the architecture of Apache Spark and its components?
Spark/Big Datahard
3
Tell me about a time when you faced a challenging situation at work and how you handled it.
Behavioralmedium
4
What is a window function? Explain with an example.
SQLmedium
5
Prioritize Spark optimizations by impact and effort. Discuss partitioning strategy, caching policy, join selection, shuffle reduction, and when each becomes a scalability or cost bottleneck.
Spark/Big Datahard
6
Explain the difference between batch and streaming data processing in Data Fusion.
Spark/Big Datahard
7
Why are you leaving your current role?
Behavioraleasy
8
Explain job bookmarking in AWS Glue. How does it help in incremental data processing?
Cloud/Toolsmedium
+19 More Questions with Expert Answers
Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.