DataEngPrep.tech
QuestionsPracticeAI CoachDashboardPacksBlog
ProLogin

Interview Questions

Real questions from top companies Β· medium

700+ Easy450+ Medium650+ Hard
All CategoriesBehavioralSpark/Big DataSQLPython/CodingSystem Design/ArchitectureCloud/ToolsGeneral/Othereasymediumhard
141

Match countries in a pairwise format

General/Othermediumjoinpartitionpython0.3 min read
KPMG
β†’
142

Reverse operation for splitting values back to original format

General/Othermediumetljoinpython0.1 min read
Pubmatic
β†’
143

Shell commands for renaming a file?

General/Othermediumwindow0.1 min read
Citi
β†’
144

Solve Minimum Remove to Make Valid Parentheses.

General/Othermediumjoinpython0.3 min read
Disney+ Hotstar
β†’
145

Steps to Verify Source and Target Data Match After Load

General/Othermediumjoinpartition0.2 min read
Verizon
β†’
146

WAQ for Desired Output (Node Parent Relationship)

General/Othermediumjoin0.3 min read
TCS
β†’
147

What are the benefits of the COPY command's MANIFEST option?

General/Othermediumpartition0.2 min read
Capco
β†’
148

What are the best practices for logging and monitoring bad data?

System Design/Architecturemedium0.3 min read
PWC
β†’
149

What are the limitations of Assert Transformations in complex data flows?

System Design/Architecturemediumpartition0.2 min read
Virtusa
β†’
150

What steps do you take to troubleshoot a slow-running Spark job?

General/Othermediumjoinpartitionspark0.2 min read
Dunnhumby
β†’
151

What strategies do you use to handle network bottlenecks?

General/Othermediumpartitionspark0.2 min read
Virtusa
β†’
152

What would you do if a job misses its SLA? How would you handle such situations?

General/Othermediumpartition0.3 min read
Meesho
β†’
153

What would you do if the files are stored in multiple folders with varying retention policies?

General/Othermediumpartitionspark0.2 min read
Virtusa
β†’
154

Create a Python program to demonstrate the use of set operations (union, intersection).

Python/Codingmediumjoinpython0.1 min read
American Express
β†’
155

Describe Spark's memory management model. How do you handle heap memory overhead issues?

Python/Codingmediumjoinpartitionspark0.2 min read
American Express
β†’
156

Differentiate SORT BY, ORDER BY, DISTRIBUTE BY, and CLUSTER BY

Python/Codingmediumpartition0.2 min read
Matrix
β†’
157

Extended the solution to determine the nth largest element in an array.

Python/Codingmediumpartition0.2 min read
Expedia
β†’
158

GeoPandas - definition and features

Python/Codingmediumjoin0.1 min read
NAB
β†’
159

Grouping and aggregation functions?

Python/Codingmediumpartitionsnowflakewindow0.2 min read
Snowflake
β†’
160

How many cities does each department operate in? List the top 3 departments in terms of the most number of cities. In case of a tie, order by dept_id.

Python/Codingmediumsql0.2 min read
Freight Tiger
β†’

Reading isn't practice. Get AI feedback on your answers.

Type or paste your answer to any of these questions and our AI Coach scores it, highlights gaps, and rewrites it at FAANG quality. Free to try.

Try AI Answer Coach β€” FreeStart a Mock Interview
Previous1...678910...24Next
Categories
All QuestionsSQLSpark / Big DataPython / CodingSystem DesignCloud / ToolsBehavioral
By Company
AmazonGoogleDatabricksSnowflakeMicrosoftNetflixUberTCS
Interview Guides
All GuidesTop SQL QuestionsTop Spark QuestionsTop Python QuestionsTop System DesignSQL Window FunctionsETL QuestionsData Modeling
Products
AI Interview CoachAnswer AnalyzerSQL PlaygroundResume AnalyzerInterview PacksPricing
Company
About UsContact UsAI DisclosureDisclaimerTerms of ServicePrivacy Policy
Β© 2026 DataEngPrep.tech. All rights reserved.
AboutBlogContactDisclaimer