DataEngPrep.tech
QuestionsPracticeAI CoachDashboardPacksBlog
ProLogin

Interview Questions

Real questions from top companies Β· medium

700+ Easy450+ Medium650+ Hard
All CategoriesBehavioralSpark/Big DataSQLPython/CodingSystem Design/ArchitectureCloud/ToolsGeneral/Othereasymediumhard
101

How do you manage data storage in AWS?

Cloud/Toolsmediumpartition0.2 min read
Wipro
β†’
102

How do you merge data from different sources in ADF while maintaining data quality?

Cloud/Toolsmediumjoinwindow0.3 min read
Virtusa
β†’
103

How would you optimize an ADF pipeline for high performance?

Cloud/Toolsmediumpartition0.2 min read
Persistent Systems
β†’
104

How would you migrate 1TB of data using ADF?

Cloud/Toolsmediumpartition0.8 min read
Virtusa
β†’
105

How would you optimize cost when using AWS for large-scale data processing?

Cloud/Toolsmediumpartition0.7 min read
Wipro
β†’
106

Lambda vs. Glue: Discuss use cases for both services.

Cloud/Toolsmediumetljoinspark0.7 min read
Bitwise
β†’
107

What alternatives to Kinesis would you consider for real-time data ingestion?

Cloud/Toolsmediumpartition0.5 min read
Capco
β†’
108

What integration challenges might you face with Glue Catalog in non-AWS environments?

Cloud/Toolsmediumbigquerypartition0.4 min read
Capco
β†’
109

APPLY Operator - CROSS APPLY and OUTER APPLY

General/Othermediumjoinsql0.3 min read
Kaseya
β†’
110

An existing job running longer suddenly: how to analyze the issue?

General/Othermediumpartitionspark0.4 min read
Citi
β†’
111

Calculate a 7-day moving average of clicks for each user_id

General/Othermediumpartitionsparksql0.2 min read
Matrix
β†’
112

Calculate a 7-day moving average of orders for each city in the Swiggy database.

General/Othermediumpartitionsparksql0.2 min read
Swiggy
β†’
113

Calculate cumulative sales for each product in each store, ordered by sale_date

General/Othermediumpartitionsparksql0.2 min read
Matrix
β†’
114

Calculate the total number of transactions (units sold) for each product.

General/Othermediumjoinsparksql0.1 min read
Wayfair
β†’
115

Calculate the total sales amount for customers born between 1998-01-15 and 2000-01-15.

General/Othermediumjoinsparksql0.2 min read
Aarete
β†’
116

Compute the moving average of daily transactions over a 7-day window.

General/Othermediumsparksqlwindow0.3 min read
Goldman Sachs
β†’
117

Data Shuffling Causes and Techniques

General/Othermediumjoinpartitionspark0.2 min read
Nagarro
β†’
118

Describe a scenario where you had to optimize a slow-running data pipeline.

System Design/Architecturemediumjoinpartition0.2 min read
Swiggy
β†’
119

Describe a time when you had to deal with a major data quality issue. How did you handle it?

General/Othermediumjoin0.2 min read
Goldman Sachs
β†’
120

Describe the concept of data sharding and when to use it.

General/Othermediumjoinpartition0.6 min read
Goldman Sachs
β†’

Reading isn't practice. Get AI feedback on your answers.

Type or paste your answer to any of these questions and our AI Coach scores it, highlights gaps, and rewrites it at FAANG quality. Free to try.

Try AI Answer Coach β€” FreeStart a Mock Interview
Previous1...45678...24Next
Categories
All QuestionsSQLSpark / Big DataPython / CodingSystem DesignCloud / ToolsBehavioral
By Company
AmazonGoogleDatabricksSnowflakeMicrosoftNetflixUberTCS
Interview Guides
All GuidesTop SQL QuestionsTop Spark QuestionsTop Python QuestionsTop System DesignSQL Window FunctionsETL QuestionsData Modeling
Products
AI Interview CoachAnswer AnalyzerSQL PlaygroundResume AnalyzerInterview PacksPricing
Company
About UsContact UsAI DisclosureDisclaimerTerms of ServicePrivacy Policy
Β© 2026 DataEngPrep.tech. All rights reserved.
AboutBlogContactDisclaimer