Data engineering interview questions · easy
Count the number of nulls in each column of a table.
Create a SQL query to identify customers with purchases above a dynamic threshold.
Data Modeling and Airflow Scheduling - star schema, cron, backfill
Database vs Data Warehouse vs Data Mart vs Data Lake
Define cursors and stored procedures and their use cases.
Describe a scenario where you had to collaborate with a cross-functional team to deliver a solution.
Describe a scenario where you had to make trade-offs between data processing speed and accuracy. How did you approach this situation and what was the outcome?
Describe a situation where you made a mistake in a data pipeline. How did you identify and fix it?
Describe a situation where you prioritized business needs over technical elegance. How did you manage trade-offs?
Describe how you would implement Slowly Changing Dimensions (SCD) in an ETL workflow.
Describe the process for migrating data from an on-premises SQL database to AWS. What services and strategies would you use?
Did you review the job description? Why are you interested in this role?
Discuss a situation where you had to balance technical priorities and business goals.
Discuss how you handled null values or unstructured data in your previous projects.
Does a Common Table Expression store data? If not, how does it function in SQL?
Error Handling in T-SQL - TRY...CATCH, THROW, RAISEERROR
Explain Coalesce vs ISNULL. What are the differences in SQL?
Explain ETL process flags and segregation of steps.
Explain Kafka messaging guarantees and Snowflake schema evolution.
Explain Slowly Changing Dimensions (SCD) and its types
Type or paste your answer to any of these questions and our AI Coach scores it, highlights gaps, and rewrites it at FAANG quality. Free to try.
SQL is the most tested topic in data engineering interviews. Most companies dedicate an entire round to SQL, typically asking 3-5 questions covering window functions, CTEs, joins, optimization, and platform-specific features.
Focus on: window functions (RANK, ROW_NUMBER, LAG/LEAD), CTEs and recursive queries, query optimization and execution plans, indexing strategies, and platform-specific features for BigQuery, Redshift, or Snowflake depending on the company.
Yes. Data engineering SQL rounds emphasize analytical queries (window functions, aggregations), large-scale optimization (partitioning, indexing), and data warehouse concepts (star schema, slowly changing dimensions). Software engineering SQL tends to focus on CRUD operations and basic joins.
For a mid-level data engineering role, plan 2-4 weeks of focused SQL practice. Cover window functions, CTEs, optimization, and practice writing queries under time pressure. Use real interview questions from companies you're targeting.