DataEngPrep.tech

JavaScript is required to use this application. Please enable JavaScript in your browser settings or disable any extensions that may be blocking scripts.

DataEngPrep.tech

Questions Practice AI Coach Dashboard Packs Blog

Interview Questions

Real questions from top companies · easy

700+ Easy450+ Medium650+ Hard

All Categories Behavioral Spark/Big Data SQL Python/Coding System Design/Architecture Cloud/Tools General/Othereasy medium hard

Describe your approach to managing offsets in Kafka.

Spark/Big Dataeasyspark0.3 min read

Fragma Data Systems

Discuss Delta Logs file format and its significance.

Spark/Big Dataeasy0.4 min read

Discuss the process of moving files in Databricks File System (DBFS).

Spark/Big Dataeasyspark0.3 min read

Executor vs Driver in Spark

Spark/Big Dataeasyspark0.4 min read

Explain Bronze/Silver/Gold Layers.

Spark/Big Dataeasy0.4 min read

Explain your approach to monitoring and logging Spark jobs in AWS. What tools would you use to identify performance bottlenecks?

Spark/Big Dataeasyspark0.6 min read

How do you compare the time investment and value of a task?

Spark/Big Dataeasy0.5 min read

How do you handle bad data in Databricks?

Spark/Big Dataeasy0.5 min read

How do you handle failures in Airflow tasks, and what retry strategies can you use?

Spark/Big Dataeasyairflowpython0.5 min read

How do you handle schema evolution in Spark, especially when reading data from sources like Parquet or Avro?

Spark/Big Dataeasyspark0.5 min read

How do you prioritize your tasks in a multi-project environment?

Spark/Big Dataeasy0.5 min read

Sqoop Incremental Import?

Spark/Big Dataeasysql0.6 min read

Sqoop command for importing multiple tables

Spark/Big Dataeasyairflowsql0.5 min read

Suppose you have a DAG that ingests data from multiple databases. How would you increase task parallelism in Airflow to improve performance without overloading the system?

Spark/Big Dataeasyairflowsql0.6 min read

Suppose you need to import 5 tables from an external RDBMS (like MySQL) into Hadoop HDFS. Write the Sqoop command

Spark/Big Dataeasyairflowsql0.6 min read

Task Dependencies in DAG

Spark/Big Dataeasyairflow0.5 min read

What are Hadoop commands for Get and Merge?

Spark/Big Dataeasyspark0.4 min read

What are the advantages of using Dataproc over a traditional Hadoop setup?

Spark/Big Dataeasyspark0.5 min read

What are the advantages of using Delta Lake over Parquet?

Spark/Big Dataeasy0.5 min read

What are the differences between %pip and %conda commands in Databricks?

Spark/Big Dataeasypython0.6 min read

Reading isn't practice. Get AI feedback on your answers.

Type or paste your answer to any of these questions and our AI Coach scores it, highlights gaps, and rewrites it at FAANG quality. Free to try.

Try AI Answer Coach — Free Start a Mock Interview

Previous 1...32 33 34 35 36 Next

Categories

All Questions SQL Spark / Big Data Python / Coding System Design Cloud / Tools Behavioral

By Company

Amazon Google Databricks Snowflake Microsoft Netflix Uber TCS

Interview Guides

All Guides Top SQL Questions Top Spark Questions Top Python Questions Top System Design SQL Window Functions ETL Questions Data Modeling

Products

AI Interview Coach Answer Analyzer SQL Playground Resume Analyzer Interview Packs Pricing

Company

About Us Contact Us AI Disclosure Disclaimer Terms of Service Privacy Policy

© 2026 DataEngPrep.tech. All rights reserved.

About Blog Contact Disclaimer