Interview Questions

Real questions from top companies · medium

700+ Easy450+ Medium650+ Hard

All Categories Behavioral Spark/Big Data SQL Python/Coding System Design/Architecture Cloud/Tools General/Othereasy medium hard

What is the difference between repartition and coalesce in Apache Spark?

Spark/Big Datamediumpartitionpythonspark1 min read

BCGCitiDunnhumbyFragma Data Systems+3

→

Write an SQL query to find the second-highest salary from an employee table.

SQLmediumpartitionsqlwindow0.8 min read

AccentureBCGCognizantIncedo+2

→

What is the difference between cache() and persist() in Spark? When would you use each?

Spark/Big Datamediumpartitionspark0.7 min read

AccentureCoforgeFreechargeImpetus+1

→

What is the difference between groupByKey and reduceByKey in Spark?

Spark/Big Datamediumpartitionspark0.8 min read

AccentureCapcoCoforgeNagarro+1

→

What is the difference between narrow and wide transformations in Apache Spark? Explain with examples.

Spark/Big Datamediumjoinpartitionpython0.9 min read

CoforgeDelivery HeroDunnhumbyFragma Data Systems+1

→

Demonstrate the difference between DENSE_RANK() and RANK()

SQLmediumpartitionwindow0.5 min read

CapcoImpetusKPMGWipro

→

Discuss differences between ROW_NUMBER(), RANK(), and DENSE_RANK(), and provide examples from your projects.

SQLmediumwindow0.5 min read

AareteAccentureFossil GroupYash Technologies

→

Explain the differences between Data Warehouse, Data Lake, and Delta Lake

SQLmediumbigquerypartitionsnowflake0.5 min read

FractalKPMGMatrixMeesho

→

Explain the differences between Repartition and Coalesce. When would you use each?

SQLmediumjoinpartition0.5 min read

DatameticaFedEx DataworksNihilentPresidio

→

What is the difference between partitioning and bucketing in Spark, and when would you use bucketing?

SQLmediumjoinpartitionspark0.5 min read

CitiCoforgeHCLLTIMindtree

→

What strategies can you use to handle skewed data in Spark?

Spark/Big Datamediumjoinpartitionspark0.5 min read

BCGBitwiseCitiHashedIn

→

Can you explain the difference between OLTP and OLAP?

SQLmediumbigquerysnowflakesql0.4 min read

AccentureCognizantEPAMYash Technologies

→

Describe a time when you had to optimize a slow SQL query. What steps did you take?

SQLmediumjoinsql0.5 min read

AareteAccentureFossil GroupYash Technologies

→

Explain the difference between INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL JOIN.

SQLmediumjoin0.5 min read

AccentureCognizantEPAMYash Technologies

→

How do you handle NULL values in SQL? Mention functions like COALESCE and NULLIF.

SQLmediumjoinsql0.4 min read

AccentureCognizantEPAMYash Technologies

→

What is the difference between WHERE and HAVING clauses in SQL?

SQLmediumsql0.3 min read

AccentureCognizantEPAMYash Technologies

→

Write a Python function to check if a string is a palindrome.

Python/Codingmediumjoinpython0.4 min read

CapcoHashedInLTIMindtree

→

Describe a scenario where partitioning and bucketing would improve query performance.

SQLmediumjoinpartition0.7 min read

Daniel WellingtonGoldman SachsSwiggy

→

Explain the types of triggers in ADF, including schedule, tumbling window, and event-based triggers.

SQLmediumpartitionwindow0.5 min read

FedEx DataworksNihilentVirtusa

→

How do you remove duplicate rows in BigQuery?

SQLmediumbigquerypartition0.6 min read

EYIncedoTech Mahindra

→

Reading isn't practice. Get AI feedback on your answers.

Type or paste your answer to any of these questions and our AI Coach scores it, highlights gaps, and rewrites it at FAANG quality. Free to try.

Try AI Answer Coach — Free Start a Mock Interview

1 2 3...24 Next