DataEngPrep.tech

JavaScript is required to use this application. Please enable JavaScript in your browser settings or disable any extensions that may be blocking scripts.

DataEngPrep.tech

Questions Practice AI Coach Dashboard Packs Blog

Interview Questions

Real questions from top companies · medium

700+ Easy450+ Medium650+ Hard

All Categories Behavioral Spark/Big Data SQL Python/Coding System Design/Architecture Cloud/Tools General/Othereasy medium hard

Write a PySpark script to process data stored in Delta format and transform it into Parquet.

Spark/Big Datamediumpartitionspark0.7 min read

Write a PySpark script to read a CSV file, filter rows where the age column is less than 18, and write the result to a new CSV file.

Spark/Big Datamediumpartitionspark0.6 min read

Write a complete PySpark program from import statements to the stop statement, covering transformations and actions.

Spark/Big Datamediumjoinpartitionpython0.6 min read

Write a transformation in PySpark to join and clean multiple raw input sources

Spark/Big Datamediumjoinpartitionpython0.7 min read

Write code to read data from Delta Lake in S3 and perform upsert based on primary key

Spark/Big Datamediumpartitionspark0.6 min read

Write maintainable, efficient Pandas or PySpark code.

Spark/Big Datamediumjoinpartitionpython0.6 min read

Your Kafka producer schema has changed, and the new data includes additional fields. How would you ensure backward compatibility using Schema Registry while consuming data from the same topic?

Spark/Big Datamediumpartition0.6 min read

Z-Ordering - use cases for partitioned Delta tables

Spark/Big Datamediumjoinpartition0.7 min read

How do you ensure the scalability of a data pipeline handling rapidly growing data volumes?

System Design/Architecturemediumpartitionsnowflakespark2.6 min read

How do you handle pipeline failures or delays?

System Design/Architecturemediumairflowwindow2.1 min read

Reading isn't practice. Get AI feedback on your answers.

Type or paste your answer to any of these questions and our AI Coach scores it, highlights gaps, and rewrites it at FAANG quality. Free to try.

Try AI Answer Coach — Free Start a Mock Interview

Previous 1...22 23 24

Categories

All Questions SQL Spark / Big Data Python / Coding System Design Cloud / Tools Behavioral

By Company

Amazon Google Databricks Snowflake Microsoft Netflix Uber TCS

Interview Guides

All Guides Top SQL Questions Top Spark Questions Top Python Questions Top System Design SQL Window Functions ETL Questions Data Modeling

Products

AI Interview Coach Answer Analyzer SQL Playground Resume Analyzer Interview Packs Pricing

Company

About Us Contact Us AI Disclosure Disclaimer Terms of Service Privacy Policy

© 2026 DataEngPrep.tech. All rights reserved.

About Blog Contact Disclaimer