What is the difference between SparkSession and SparkContext in Spark?
Spark/Big Datahard
2
What is the difference between partitioning and bucketing in Spark, and when would you use bucketing?
SQLmedium
3
Write a Python function to check if a string is a palindrome.
Python/Codingmedium
4
When would you architecturally choose Dataset[T] over DataFrame in a Scala Spark pipeline, and what are the scalability and portability trade-offs? Include type-safety benefits vs. operational constraints.
Spark/Big Dataeasy
5
Design a cost-aware resource strategy for a Databricks workload with spiky and batch jobs. Explain Dynamic Resource Allocation, when to disable it, and how min/max executors and spot instances affect cost and SLAs.
Spark/Big Datahard
6
Command to Read JSON Data and Options
General/Otherhard
7
Daily Data Volume - quantify
General/Othereasy
8
Describe a project you worked on, focusing on the data pipeline and your role.
System Design/Architectureeasy
+20 More Questions with Expert Answers
Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.