Data engineering interview questions
Write an SQL query to find duplicate emails in a users table.
Triggers in ADF, especially tumbling window triggers.
What is a CTE (Common Table Expression)? What are its uses?
What is a window function? Explain with an example.
What is the difference between OLTP and OLAP?
Write a SQL query to find top 3 earners in each department.
Write a query to find the top three highest-paid employees in each department using window functions.
Write complex SQL queries involving multiple joins, subqueries, and data aggregation logic.
Add Row Numbers using window function in PySpark
Add a column to the Employees table that shows the name of the employee with the next higher employee_id.
Add a new column with manager names for each employee using a self-join.
Add a new column with the average salary by department.
Advanced SQL with CTEs and Conditional Joins
Aggregate surface areas and calculate cumulative surface area using the LAG function.
Analyze the output of various joins (LEFT, RIGHT, INNER, CROSS, FULL OUTER) on the given tables.
Are you comfortable with the variable pay structure, and what are your expectations for the base salary?
Building ETL pipelines to capture changes when new records are inserted into source tables?
CSV Without Column Names/Schema - how to read
Calculate the cumulative transaction amount for each month using a transaction table.
Can CASE statements be used in an UPDATE query?
Type or paste your answer to any of these questions and our AI Coach scores it, highlights gaps, and rewrites it at FAANG quality. Free to try.
SQL is the most tested topic in data engineering interviews. Most companies dedicate an entire round to SQL, typically asking 3-5 questions covering window functions, CTEs, joins, optimization, and platform-specific features.
Focus on: window functions (RANK, ROW_NUMBER, LAG/LEAD), CTEs and recursive queries, query optimization and execution plans, indexing strategies, and platform-specific features for BigQuery, Redshift, or Snowflake depending on the company.
Yes. Data engineering SQL rounds emphasize analytical queries (window functions, aggregations), large-scale optimization (partitioning, indexing), and data warehouse concepts (star schema, slowly changing dimensions). Software engineering SQL tends to focus on CRUD operations and basic joins.
For a mid-level data engineering role, plan 2-4 weeks of focused SQL practice. Cover window functions, CTEs, optimization, and practice writing queries under time pressure. Use real interview questions from companies you're targeting.