General questions from Citi data engineering interviews.
These general questions are sourced from Citi data engineering interviews. Each includes an expert-level answer. This set leans toward fundamentals — 7 easy, 2 medium, and 0 hard questions. Recurring themes are python, spark, and airflow — these patterns appear most often in real interviews and reward the deepest preparation. Average answer is around 1 minute of reading — plan roughly 1 hour to work through the full set thoughtfully.
This collection contains 9 curated questions: 7 easy, 2 medium. There's a strong foundation of fundamentals-focused questions — ideal for building confidence before tackling advanced topics.
The most frequently tested areas in this set are python (3), spark (2), airflow (2), partition (1), and window (1). Focusing on these topics will give you the highest return on your preparation time.
Start with the easy questions to warm up and solidify fundamentals. Medium-difficulty questions form the bulk of real interviews — spend the most time here and practice explaining your reasoning out loud. For each question, try answering before revealing the solution. Use our AI Mock Interview to simulate real interview conditions and get instant feedback on your responses.
Agile methodologies used?
An existing job running longer suddenly: how to analyze the issue?
How is Oozie called?
Oozie workflow files (how many used)?
Shell commands for renaming a file?
Shell: change permissions?
Shell: command to check processes running in the background?
Using shell, how to find the difference between two files?
What type of wrapper is used, or which language is used?
Get full access to 1,800+ expert answers, AI mock interviews, and personalized progress tracking.