DataEngPrep.tech

JavaScript is required to use this application. Please enable JavaScript in your browser settings or disable any extensions that may be blocking scripts.

Essential cookies keep authentication working. With your permission, we also use analytics cookies to understand and improve the product. Read our Privacy Policy

DataEngPrep.tech

Questions Practice AI Coach Dashboard Pricing Blog

Interview Questions

Real questions from top companies · medium

700+ Easy450+ Medium650+ Hard

All Categories Behavioral Spark/Big Data SQL Python/Coding System Design/Architecture Cloud/Tools General/Othereasy medium hard

What is a self-join, and when would you use it?

SQLmediumjoin2 min read

What is normalization and denormalization? When would you use each?

SQLmediumetljoin2 min read

What is the difference between a view and a materialized view?

SQLmedium2 min read

Write an SQL query to find duplicate emails in a users table.

SQLmediumpartitionsqlwindow0.5 min read

Daniel WellingtonGoldman SachsSwiggy

Triggers in ADF, especially tumbling window triggers.

SQLmediumpartitionwindow0.5 min read

AccentureYash Technologies

What is a window function? Explain with an example.

SQLmediumjoinpartitionwindow0.5 min read

What is the difference between OLTP and OLAP?

SQLmediumbigquerysnowflake2 min read

Write a SQL query to find top 3 earners in each department.

SQLmediumpartitionsql2 min read

FedEx DataworksIncedo

Write a query to find the top three highest-paid employees in each department using window functions.

SQLmediumpartitionwindow2 min read

Bristol Myers SquibbWipro

Write complex SQL queries involving multiple joins, subqueries, and data aggregation logic.

SQLmediumjoinpartitionsql0.7 min read

AppleTiger Analytics

Convert complex SQL (CTEs, window functions, subqueries) to production-grade PySpark. Discuss when to use spark.sql() vs. DataFrame API, and the implications for testability, partitioning, and execution predictability.

Spark/Big Datamediumpartitionpythonspark0.8 min read

DatameticaS&P Global

Explain how Adaptive Query Execution changes the economics of Spark tuning. What problems does it solve at runtime, and when might you still need manual intervention (e.g., salting, broadcast hints)?

Spark/Big Datamediumjoinpartitionspark0.6 min read

FedEx DataworksPWC

Architect incremental load in ADF + Databricks with idempotency, late-arrival handling, and cost/scalability implications of watermark vs. change data capture.

Spark/Big Datamediumpartition1 min read

Explain strategies for managing schema changes in PySpark over time.

Spark/Big Datamediumpartitionspark0.8 min read

AccentureYash Technologies

How do you drop columns with null values in PySpark?

Spark/Big Datamediumpartitionspark0.6 min read

DatameticaGlobant

How do you handle data skewness in Spark?

Spark/Big Datamediumjoinpartitionspark0.7 min read

AccentureBitwise

How would you read data from a web API using PySpark?

Spark/Big Datamediumairflowpartitionspark0.7 min read

AltimetrikInfosys

What is Adaptive Query Execution (AQE) in Spark 3.x, and how does it improve performance?

Spark/Big Datamediumjoinpartitionspark0.6 min read

HashedInSnowflake

What is the difference between repartition and coalesce in Spark?

Spark/Big Datamediumpartitionspark0.6 min read

AccentureFedEx Dataworks

When and how do you use Broadcast Join in Spark?

Spark/Big Datamediumjoinsparksql0.6 min read

Delivery HeroFragma Data Systems

Reading isn't practice. Get AI feedback on your answers.

Type or paste your answer to any of these questions and our AI Coach scores it, highlights gaps, and rewrites it at FAANG quality. Free to try.

Try AI Answer Coach — Free Start a Mock Interview

Previous 1 2 3 4 5...9 Next

Categories

All Questions SQL Spark / Big Data Python / Coding System Design Cloud / Tools Behavioral

By Company

Amazon Google Databricks Snowflake AWS Azure Microsoft Netflix Uber TCS

Interview Guides

All Guides Top SQL Questions Top Spark Questions PySpark Questions Top Python Questions Top System Design Kafka Questions Airflow Questions SQL Window Functions ETL Questions Data Modeling

Products

AI Interview Coach Answer Analyzer SQL Playground Resume Analyzer Answer Vault PDFs Pricing

Company

About & Editorial Policy Contact Us AI Disclosure Disclaimer Terms of Service Privacy Policy

© 2026 DataEngPrep.tech. All rights reserved.

About Blog Contact Disclaimer