Citi Data Engineer Interview Questions & Answers (2026)
Practice the 39 most asked data engineering questions at Citi. Covers Spark/Big Data, SQL, General/Other and more.
Key Takeaways
- βWhy Citi Tests These Questions
- βTop 5 Most Asked Questions at Citi
- βCategory Breakdown for Citi Interviews
- βHow to Prepare
Why Citi Tests These Questions
Citi is known for rigorous data engineering interviews that focus on practical, production-level knowledge. With 39 questions in our vault, the most common category is Spark/Big Data (16 questions).
Difficulty breakdown: 14 easy, 11 medium, 14 hard. Expect system design and optimization questions at senior levels.
Top 5 Most Asked Questions at Citi
- Q1: What is the difference between repartition and coalesce in Apache Spark?
- Q2: What is the difference between SparkSession and SparkContext in Spark?
- Q3: What is the difference between partitioning and bucketing in Spark, and when would you use bucketing?
- Q4: What strategies can you use to handle skewed data in Spark?
- Q5: What is the difference between Managed and External tables in Hive/Spark?
Category Breakdown for Citi Interviews
- Spark/Big Data: 16 questions
- General/Other: 9 questions
- SQL: 8 questions
- Python/Coding: 3 questions
- System Design/Architecture: 3 questions
How to Prepare
Focus on Spark/Big Data questions first, as they dominate Citi's interview pattern. Practice the top-frequency questions below, then move to adjacent categories. For senior roles, expect 1-2 system design rounds.
Reviewed by Aditya Kumar Β· DataEngPrep Editorial Team
Drafted by the editorial team and signed off by Aditya Kumar, founder and lead editor at DataEngPrep. Questions are sourced from real interviews, initial answers are drafted with AI assistance, and every section is human-edited for technical accuracy, relevance to current FAANG hiring rubrics, and clarity. Articles are reviewed periodically as interview patterns evolve.
Related Articles
Practice These Questions
Think you can answer these questions? Find out in 30 seconds
Paste your answer and get instant AI feedback β see exactly where your answer is weak and how a FAANG-level candidate would respond.