DataEngPrep.tech
QuestionsBlogStore
Get PDF Bundle

Interview Questions

Real questions from top companies · easy

700+ Easy450+ Medium650+ Hard
All CategoriesBehavioralSpark/Big DataSQLPython/CodingSystem Design/ArchitectureCloud/ToolsGeneral/Othereasymediumhard
1

What are the different delivery semantics in Kafka (at least-once, at-most-once, exactly-once)?

Spark/Big Dataeasy
2

What are the different modes in which you can submit Spark jobs? Explain each.

Spark/Big Dataeasy
3

What are the performance considerations when using Auto Loader?

Spark/Big Dataeasy
4

What are the steps to connect to Salesforce?

Spark/Big Dataeasy
5

What are the steps to debug a failed workflow in Databricks?

Spark/Big Dataeasy
6

What are the steps to execute a Python file with PySpark code on an EC2 environment?

Spark/Big Dataeasy
7

What are the trade-offs between using Glue Catalog vs. Hive Metastore for metadata management?

Spark/Big Dataeasy
8

What are transient clusters in EMR, and when would you use them?

Spark/Big Dataeasy

+20 More Questions with Expert Answers

Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.

Get PDF Bundle — from $21Try Free Sample
Previous1...33343536Next