Interview questions
Preparing for a data engineering interview at Fragma Data Systems? This page contains 65 real interview questions sourced from verified Fragma Data Systems interview experiences. Questions are sorted by frequency — the ones asked most often appear first.
Fragma Data Systems data engineering interviews typically focus on Spark/Big Data, SQL, and System Design/Architecture. The interview bar skews toward harder problems (31 hard vs. 19 easy), suggesting emphasis on depth and system-level thinking.
Use the difficulty filters above to focus your preparation. For each question, attempt your own answer first, then compare with our expert solution. You can also practice these questions in our AI Mock Interview Coach for real-time feedback.
What is the difference between repartition and coalesce in Apache Spark?
What is the difference between narrow and wide transformations in Apache Spark? Explain with examples.
What are your salary expectations for this role?
Describe the difference between Spark RDDs, DataFrames, and Datasets.
Explain the difference between Spark's map() and flatMap() transformations.
How does Spark's Catalyst Optimizer work? Explain its stages.
What is the difference between Managed and External tables in Hive/Spark?
Explain the concept of Broadcast Join in Spark. When should it be used?
How do you optimize Spark jobs for better performance? Mention at least 5 techniques.
What is the difference between a list and a tuple in Python?
Explain the difference between shallow copy and deep copy in Python.
Write a Python function to find the first non-repeating character in a string.
What are decorators in Python, and how do they work?
Explain the difference between args and kwargs in Python.
Why do you want to join this company?
Describe the data pipeline architecture you've worked with.
What is the difference between OLTP and OLAP?
What is the difference between SQL and NoSQL databases?
Unlock all 1,800+ expert answers, AI mock interviews, resume analyzer, SQL playground, and personalized progress tracking.