DataEngPrep.tech
QuestionsBlogStore
Get PDF Bundle

Interview Questions

Real questions from top companies

700+ Easy450+ Medium650+ Hard
All CategoriesBehavioralSpark/Big DataSQLPython/CodingSystem Design/ArchitectureCloud/ToolsGeneral/Othereasymediumhard
41

How does Spark's Catalyst Optimizer work? Explain its stages.

Spark/Big Datahardjoinoptimizationspark0.5 min read
DunnhumbyFragma Data SystemsHashedIn
→
42

How do you handle late-arriving data in Spark Structured Streaming?

Spark/Big Datahardsparkwindow0.5 min read
BitwiseIncedoSwiggy
→
43

What is the difference between Managed and External tables in Hive/Spark?

Spark/Big Dataeasyspark0.4 min read
CitiDunnhumbyFragma Data Systems
→
44

What is the small-file problem in Spark, and how do you solve it?

Spark/Big Datahardpartitionspark0.5 min read
Daniel WellingtonIncedoSwiggy
→
45

Explain the concept of Broadcast Join in Spark. When should it be used?

Spark/Big Datamediumjoinsparksql0.4 min read
Delivery HeroDunnhumbyFragma Data Systems
→
46

How do you optimize Spark jobs for better performance? Mention at least 5 techniques.

Spark/Big Datahardjoinoptimizationpartition0.5 min read
Fragma Data SystemsPresidioSwiggy
→
47

What is the difference between a list and a tuple in Python?

Python/Codingeasypython0.3 min read
AccentureDelivery HeroFragma Data Systems
→
48

Explain the difference between shallow copy and deep copy in Python.

Python/Codingeasypython0.5 min read
Delivery HeroDunnhumbyFragma Data Systems
→
49

Write a Python function to find the first non-repeating character in a string.

Python/Codingeasypython0.4 min read
Delivery HeroDunnhumbyFragma Data Systems
→
50

What are decorators in Python, and how do they work?

Python/Codingeasypython0.3 min read
Delivery HeroFragma Data SystemsSwiggy
→
51

Explain the difference between args and kwargs in Python.

Python/Codingeasypython0.3 min read
Delivery HeroFragma Data SystemsSwiggy
→
52

How do you ensure smooth communication between data scientists, business teams, and developers?

Behavioraleasy0.7 min read
AccentureYash Technologies
→
53

How do you handle conflicts within a team? Provide an example.

Behavioralhard0.7 min read
EPAMJIO
→
54

How do you handle disagreements within a team?

Behavioraleasy0.6 min read
ExpediaWarner Bros Discovery
→
55

Tell me about a time when you faced a challenging situation at work and how you handled it.

Behavioralmediumairflowpartitionspark0.7 min read
FreechargeWalmart
→
56

What challenges did you face, and how did you tackle them?

Behavioralmediumjoinpartitionspark0.6 min read
Delivery HeroGrover
→
57

What is the most difficult task you've ever worked on?

Behavioraleasylakehousesparksql0.6 min read
CognizantIncedo
→
58

What would you do if a pipeline failed and you couldn't find the reason?

Behavioralmediumpartitionspark0.7 min read
Delivery HeroGrover
→
59

Why are you leaving your current company?

Behavioralhard0.6 min read
AareteIncedo
→
60

Why do you want to join this company?

Behavioralmediumjoin0.5 min read
AccentureDelivery HeroFragma Data Systems
→
Previous12345...94Next