DataEngPrep.tech
QuestionsBlogStore
Get PDF Bundle

Interview Questions

Real questions from top companies · easy

700+ Easy450+ Medium650+ Hard
All CategoriesBehavioralSpark/Big DataSQLPython/CodingSystem Design/ArchitectureCloud/ToolsGeneral/Othereasymediumhard
1

What is the difference between managed and external tables in Hive or Spark SQL?

Spark/Big Dataeasy
2

What is the difference between map and flatMap in Spark transformations?

Spark/Big Dataeasy
3

What is the purpose of the VACUUM command in Delta Lake?

Spark/Big Dataeasy
4

What limitations do you face when using Delta Tables in a multi-cloud environment?

Spark/Big Dataeasy
5

What metrics do you use to determine whether a Spark job is going well or not?

Spark/Big Dataeasy
6

Which Spark version are you using in your project, and why did you choose it?

Spark/Big Dataeasy
7

Why does Hive use Derby by default, and what alternatives are used in production?

Spark/Big Dataeasy
8

Worked with UDFs - share examples

Spark/Big Dataeasy

+15 More Questions with Expert Answers

Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.

Get PDF Bundle — from $21Try Free Sample
Previous1...343536