DataEngPrep.tech
QuestionsPracticeAI CoachDashboardPacksBlog
ProLogin

Interview Questions

Real questions from top companies in General/Other Β· medium

700+ Easy450+ Medium650+ Hard
All CategoriesBehavioralSpark/Big DataSQLPython/CodingSystem Design/ArchitectureCloud/ToolsGeneral/Othereasymediumhard
1

How would you read data from a web API? What steps would you follow after reading the data?

General/Othermediumetlpartition0.7 min read
AltimetrikInfosys
β†’
2

What is the difference between SQL and NoSQL databases?

General/Othermediumjoinsql0.7 min read
AareteDunnhumbyFragma Data Systems
β†’
3

APPLY Operator - CROSS APPLY and OUTER APPLY

General/Othermediumjoinsql0.3 min read
Kaseya
β†’
4

An existing job running longer suddenly: how to analyze the issue?

General/Othermediumpartitionspark0.4 min read
Citi
β†’
5

Calculate a 7-day moving average of clicks for each user_id

General/Othermediumpartitionsparksql0.2 min read
Matrix
β†’
6

Calculate a 7-day moving average of orders for each city in the Swiggy database.

General/Othermediumpartitionsparksql0.2 min read
Swiggy
β†’
7

Calculate cumulative sales for each product in each store, ordered by sale_date

General/Othermediumpartitionsparksql0.2 min read
Matrix
β†’
8

Calculate the total number of transactions (units sold) for each product.

General/Othermediumjoinsparksql0.1 min read
Wayfair
β†’
9

Calculate the total sales amount for customers born between 1998-01-15 and 2000-01-15.

General/Othermediumjoinsparksql0.2 min read
Aarete
β†’
10

Compute the moving average of daily transactions over a 7-day window.

General/Othermediumsparksqlwindow0.3 min read
Goldman Sachs
β†’
11

Data Shuffling Causes and Techniques

General/Othermediumjoinpartitionspark0.2 min read
Nagarro
β†’
12

Describe a time when you had to deal with a major data quality issue. How did you handle it?

General/Othermediumjoin0.2 min read
Goldman Sachs
β†’
13

Describe the concept of data sharding and when to use it.

General/Othermediumjoinpartition0.6 min read
Goldman Sachs
β†’
14

Describe your approach to managing data deduplication.

General/Othermediumpartitionwindow0.5 min read
Fragma Data Systems
β†’
15

Discuss Primary, Foreign, and Composite Keys.

General/Othermediumjoin0.3 min read
Datametica
β†’
16

Discuss the average data volume handled and strategies used for efficient processing.

General/Othermediumpartitionspark0.2 min read
Yash Technologies
β†’
17

Explain how you would implement a caching mechanism for frequently accessed video metadata.

General/Othermediumpartition0.3 min read
Disney+ Hotstar
β†’
18

Extract insights from given JSON data using your preferred framework.

General/Othermediumpartitionspark0.2 min read
Flipkart
β†’
19

Fetch the rows with the highest scores for each student in a year.

General/Othermediumjoinpartitionsql0.3 min read
Deolite
β†’
20

Find All Numbers that Appear at Least Three Times Consecutively

General/Othermediumpartition0.2 min read
Meesho
β†’

Reading isn't practice. Get AI feedback on your answers.

Type or paste your answer to any of these questions and our AI Coach scores it, highlights gaps, and rewrites it at FAANG quality. Free to try.

Try AI Answer Coach β€” FreeStart a Mock Interview
123Next
Categories
All QuestionsSQLSpark / Big DataPython / CodingSystem DesignCloud / ToolsBehavioral
By Company
AmazonGoogleDatabricksSnowflakeMicrosoftNetflixUberTCS
Interview Guides
All GuidesTop SQL QuestionsTop Spark QuestionsTop Python QuestionsTop System DesignSQL Window FunctionsETL QuestionsData Modeling
Products
AI Interview CoachAnswer AnalyzerSQL PlaygroundResume AnalyzerInterview PacksPricing
Company
About UsContact UsAI DisclosureDisclaimerTerms of ServicePrivacy Policy
Β© 2026 DataEngPrep.tech. All rights reserved.
AboutBlogContactDisclaimer