DataEngPrep.tech
QuestionsBlogStore
Get PDF BundlePDF Bundle

Interview Questions

Real questions from top companies

700+ Easy450+ Medium650+ Hard
All CategoriesBehavioralSpark/Big DataSQLPython/CodingSystem Design/ArchitectureCloud/ToolsGeneral/Othereasymediumhard
1

A task intermittently fails due to external API limitations. How would you configure Airflow retries and alerts to manage this situation efficiently?

Spark/Big Dataeasy
2

Accumulator and Broadcast Variables - explain

Spark/Big Dataeasy
3

Accumulators - use as shared variable for write-only operations

Spark/Big Datamedium
4

Adaptive Query Execution (AQE): Discuss how AQE optimizes query execution in Spark dynamically based on runtime stats.

Spark/Big Datahard
5

After cleaning, how would you store the transformed data into Delta Lake?

Spark/Big Datahard
6

Alternatives to the Medallion Architecture

Spark/Big Datahard
7

Apache Spark Architecture - RDD, DAG, cluster manager, driver node, worker node

Spark/Big Datahard
8

Apache Spark Fundamentals - discuss

Spark/Big Datahard

+20 More Questions with Expert Answers

Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.

Get PDF Bundle - from $21Try Free Sample
Previous1...6566676869...94Next