DataEngPrep.tech
QuestionsPracticeAI CoachDashboardPacksBlog
ProLogin

Interview Questions

Real questions from top companies

700+ Easy450+ Medium650+ Hard
All CategoriesBehavioralSpark/Big DataSQLPython/CodingSystem Design/ArchitectureCloud/ToolsGeneral/Othereasymediumhard
1101

How would you handle data type changes for an existing column?

SQLmediumwindow0.8 min read
Capco
β†’
1102

How would you handle duplicate or corrupted data in a batch ETL job?

SQLmediumetlpartitionspark0.7 min read
Adidas
β†’
1103

How would you handle null values in a dataset, especially in a single column?

SQLeasy0.7 min read
Infosys
β†’
1104

How would you handle nulls in a SQL join? Provide examples using COALESCE.

SQLmediumjoinsql0.7 min read
Bristol Myers Squibb
β†’
1105

How would you identify duplicate records based on a composite key in SQL?

SQLmediumpartitionsqlwindow0.8 min read
Amazon
β†’
1106

How would you optimize a SQL query for better performance when working with large datasets?

SQLhardjoinoptimizationpartition0.7 min read
Tredence
β†’
1107

How would you optimize a query fetching sales data across multiple countries with billions of rows?

SQLmediumbigquerypartitionsnowflake0.6 min read
Adidas
β†’
1108

How would you optimize a query with multiple joins and subqueries?

SQLmediumjoin0.7 min read
American Express
β†’
1109

How would you prevent small file problems in S3 when loading data into Redshift?

SQLmediumetlpartitionspark0.6 min read
Capco
β†’
1110

How would you retrieve the first and last order for each customer from a sales table?

SQLmediumpartitionwindow0.7 min read
Wipro
β†’
1111

Identify and remove duplicate records from a table, keeping the most recent record based on a timestamp column.

SQLmediumpartitionsparksql0.6 min read
Goldman Sachs
β†’
1112

Identify consecutive numbers in a column (at least 3 consecutive).

SQLeasy0.7 min read
Incedo
β†’
1113

If manual partitions are created in a Hive data-warehouse table directory, and you query records from those partitions, will you see the data? If not, how can this be fixed?

SQLmediumpartition0.6 min read
Dunnhumby
β†’
1114

Implement a CASE WHEN condition - medium difficulty

SQLmedium0.7 min read
Wolters Kluwer
β†’
1115

In Python, process a large CSV in chunks and remove duplicate records based on email and timestamp.

SQLhardpython0.5 min read
Amazon
β†’
1116

Indexing - True/False question on indexes and query optimization

SQLhardjoinoptimization0.6 min read
Myntra
β†’
1117

Indexing – Types and Benefits?

SQLmediumjoin0.5 min read
Comcast
β†’
1118

Indexing: When to Use and Avoid

SQLmediumjoinsql0.5 min read
NAB
β†’
1119

Integration of Snowflake with external data sources such as S3, GCS, and Blob Storage?

SQLmediumsnowflakewindow0.4 min read
Snowflake
β†’
1120

Joins: Different types and their use cases

SQLmediumjoin0.5 min read
ZS Associates
β†’

Reading isn't practice. Get AI feedback on your answers.

Type or paste your answer to any of these questions and our AI Coach scores it, highlights gaps, and rewrites it at FAANG quality. Free to try.

Try AI Answer Coach β€” FreeStart a Mock Interview
Previous1...5455565758...94Next
Categories
All QuestionsSQLSpark / Big DataPython / CodingSystem DesignCloud / ToolsBehavioral
By Company
AmazonGoogleDatabricksSnowflakeMicrosoftNetflixUberTCS
Interview Guides
All GuidesTop SQL QuestionsTop Spark QuestionsTop Python QuestionsTop System DesignSQL Window FunctionsETL QuestionsData Modeling
Products
AI Interview CoachAnswer AnalyzerSQL PlaygroundResume AnalyzerInterview PacksPricing
Company
About UsContact UsAI DisclosureDisclaimerTerms of ServicePrivacy Policy
Β© 2026 DataEngPrep.tech. All rights reserved.
AboutBlogContactDisclaimer