Design a fault-tolerant Spark Streaming checkpoint strategy: what to persist, recovery semantics, and cost/scalability trade-offs with checkpoint frequency.
Spark/Big Datahard
3
Given a streaming dataset from Kafka, how would you ingest the data in real-time using Spark?
Spark/Big Datahard
4
Describe the ZS projects you worked on
General/Otherhard
5
Can you explain the concept of polymorphism and inheritance in Java with examples?
Python/Codinghard
6
Design a Custom API that can query a backend server and return customer data such as the number of orders placed by a user based on their user ID
SQLhard
7
Design the data model for an ETL pipeline that ingests data from a database and loads it into Snowflake
SQLhard
8
After cleaning, how would you store the transformed data into Delta Lake?
Spark/Big Datahard
+20 More Questions with Expert Answers
Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.