DataEngPrep.tech

JavaScript is required to use this application. Please enable JavaScript in your browser settings or disable any extensions that may be blocking scripts.

DataEngPrep.tech

Questions Practice AI Coach Dashboard Packs Blog

Interview Questions

Real questions from top companies

700+ Easy450+ Medium650+ Hard

All Categories Behavioral Spark/Big Data SQL Python/Coding System Design/Architecture Cloud/Tools General/Othereasy medium hard

How would you handle data type changes for an existing column?

SQLmediumwindow0.8 min read

How would you handle duplicate or corrupted data in a batch ETL job?

SQLmediumetlpartitionspark0.7 min read

How would you handle null values in a dataset, especially in a single column?

SQLeasy0.7 min read

How would you handle nulls in a SQL join? Provide examples using COALESCE.

SQLmediumjoinsql0.7 min read

Bristol Myers Squibb

How would you identify duplicate records based on a composite key in SQL?

SQLmediumpartitionsqlwindow0.8 min read

How would you optimize a SQL query for better performance when working with large datasets?

SQLhardjoinoptimizationpartition0.7 min read

How would you optimize a query fetching sales data across multiple countries with billions of rows?

SQLmediumbigquerypartitionsnowflake0.6 min read

How would you optimize a query with multiple joins and subqueries?

SQLmediumjoin0.7 min read

American Express

How would you prevent small file problems in S3 when loading data into Redshift?

SQLmediumetlpartitionspark0.6 min read

How would you retrieve the first and last order for each customer from a sales table?

SQLmediumpartitionwindow0.7 min read

Identify and remove duplicate records from a table, keeping the most recent record based on a timestamp column.

SQLmediumpartitionsparksql0.6 min read

Identify consecutive numbers in a column (at least 3 consecutive).

SQLeasy0.7 min read

If manual partitions are created in a Hive data-warehouse table directory, and you query records from those partitions, will you see the data? If not, how can this be fixed?

SQLmediumpartition0.6 min read

Implement a CASE WHEN condition - medium difficulty

SQLmedium0.7 min read

In Python, process a large CSV in chunks and remove duplicate records based on email and timestamp.

SQLhardpython0.5 min read

Indexing - True/False question on indexes and query optimization

SQLhardjoinoptimization0.6 min read

Indexing – Types and Benefits?

SQLmediumjoin0.5 min read

Indexing: When to Use and Avoid

SQLmediumjoinsql0.5 min read

Integration of Snowflake with external data sources such as S3, GCS, and Blob Storage?

SQLmediumsnowflakewindow0.4 min read

Joins: Different types and their use cases

SQLmediumjoin0.5 min read

Reading isn't practice. Get AI feedback on your answers.

Type or paste your answer to any of these questions and our AI Coach scores it, highlights gaps, and rewrites it at FAANG quality. Free to try.

Try AI Answer Coach — Free Start a Mock Interview

Previous 1...54 55 56 57 58...94 Next

Categories

All Questions SQL Spark / Big Data Python / Coding System Design Cloud / Tools Behavioral

By Company

Amazon Google Databricks Snowflake Microsoft Netflix Uber TCS

Interview Guides

All Guides Top SQL Questions Top Spark Questions Top Python Questions Top System Design SQL Window Functions ETL Questions Data Modeling

Products

AI Interview Coach Answer Analyzer SQL Playground Resume Analyzer Interview Packs Pricing

Company

About Us Contact Us AI Disclosure Disclaimer Terms of Service Privacy Policy

© 2026 DataEngPrep.tech. All rights reserved.

About Blog Contact Disclaimer