DataEngPrep.tech
QuestionsPracticeAI CoachDashboardPacksBlog
ProLogin

Interview Questions

Real questions from top companies Β· medium

700+ Easy450+ Medium650+ Hard
All CategoriesBehavioralSpark/Big DataSQLPython/CodingSystem Design/ArchitectureCloud/ToolsGeneral/Othereasymediumhard
41

What is a self-join, and when would you use it?

SQLmediumjoin0.4 min read
PresidioSwiggy
β†’
42

What is normalization and denormalization? When would you use each?

SQLmediumetljoin0.4 min read
PresidioSwiggy
β†’
43

What is the difference between a view and a materialized view?

SQLmedium0.4 min read
PresidioSwiggy
β†’
44

Write an SQL query to find duplicate emails in a users table.

SQLmediumpartitionsqlwindow0.5 min read
Daniel WellingtonGoldman SachsSwiggy
β†’
45

Triggers in ADF, especially tumbling window triggers.

SQLmediumpartitionwindow0.5 min read
AccentureYash Technologies
β†’
46

What is a window function? Explain with an example.

SQLmediumjoinpartitionwindow0.5 min read
CitiFreecharge
β†’
47

What is the difference between OLTP and OLAP?

SQLmediumbigquerysnowflake0.4 min read
ChryselysEY
β†’
48

Write a SQL query to find top 3 earners in each department.

SQLmediumpartitionsql0.4 min read
FedEx DataworksIncedo
β†’
49

Write a query to find the top three highest-paid employees in each department using window functions.

SQLmediumpartitionwindow0.4 min read
Bristol Myers SquibbWipro
β†’
50

Write complex SQL queries involving multiple joins, subqueries, and data aggregation logic.

SQLmediumjoinpartitionsql0.7 min read
AppleTiger Analytics
β†’
51

Convert complex SQL (CTEs, window functions, subqueries) to production-grade PySpark. Discuss when to use spark.sql() vs. DataFrame API, and the implications for testability, partitioning, and execution predictability.

Spark/Big Datamediumpartitionpythonspark0.8 min read
DatameticaS&P Global
β†’
52

Explain how Adaptive Query Execution changes the economics of Spark tuning. What problems does it solve at runtime, and when might you still need manual intervention (e.g., salting, broadcast hints)?

Spark/Big Datamediumjoinpartitionspark0.6 min read
FedEx DataworksPWC
β†’
53

Architect incremental load in ADF + Databricks with idempotency, late-arrival handling, and cost/scalability implications of watermark vs. change data capture.

Spark/Big Datamediumpartition1 min read
DeloitteIncedo
β†’
54

Explain strategies for managing schema changes in PySpark over time.

Spark/Big Datamediumpartitionspark0.8 min read
AccentureYash Technologies
β†’
55

How do you drop columns with null values in PySpark?

Spark/Big Datamediumpartitionspark0.6 min read
DatameticaGlobant
β†’
56

How do you handle data skewness in Spark?

Spark/Big Datamediumjoinpartitionspark0.7 min read
AccentureBitwise
β†’
57

How would you read data from a web API using PySpark?

Spark/Big Datamediumairflowpartitionspark0.7 min read
AltimetrikInfosys
β†’
58

What is Adaptive Query Execution (AQE) in Spark 3.x, and how does it improve performance?

Spark/Big Datamediumjoinpartitionspark0.6 min read
HashedInSnowflake
β†’
59

What is the difference between repartition and coalesce in Spark?

Spark/Big Datamediumpartitionspark0.6 min read
AccentureFedEx Dataworks
β†’
60

When and how do you use Broadcast Join in Spark?

Spark/Big Datamediumjoinsparksql0.6 min read
Delivery HeroFragma Data Systems
β†’

Reading isn't practice. Get AI feedback on your answers.

Type or paste your answer to any of these questions and our AI Coach scores it, highlights gaps, and rewrites it at FAANG quality. Free to try.

Try AI Answer Coach β€” FreeStart a Mock Interview
Previous12345...24Next
Categories
All QuestionsSQLSpark / Big DataPython / CodingSystem DesignCloud / ToolsBehavioral
By Company
AmazonGoogleDatabricksSnowflakeMicrosoftNetflixUberTCS
Interview Guides
All GuidesTop SQL QuestionsTop Spark QuestionsTop Python QuestionsTop System DesignSQL Window FunctionsETL QuestionsData Modeling
Products
AI Interview CoachAnswer AnalyzerSQL PlaygroundResume AnalyzerInterview PacksPricing
Company
About UsContact UsAI DisclosureDisclaimerTerms of ServicePrivacy Policy
Β© 2026 DataEngPrep.tech. All rights reserved.
AboutBlogContactDisclaimer