DataEngPrep.tech
QuestionsPracticeAI CoachDashboardPacksBlog
ProLogin

Interview Questions

Real questions from top companies Β· easy

700+ Easy450+ Medium650+ Hard
All CategoriesBehavioralSpark/Big DataSQLPython/CodingSystem Design/ArchitectureCloud/ToolsGeneral/Othereasymediumhard
661

Describe your approach to managing offsets in Kafka.

Spark/Big Dataeasyspark0.3 min read
Fragma Data Systems
β†’
662

Discuss Delta Logs file format and its significance.

Spark/Big Dataeasy0.4 min read
Hexaware
β†’
663

Discuss the process of moving files in Databricks File System (DBFS).

Spark/Big Dataeasyspark0.3 min read
Capgemini
β†’
664

Executor vs Driver in Spark

Spark/Big Dataeasyspark0.4 min read
Presidio
β†’
665

Explain Bronze/Silver/Gold Layers.

Spark/Big Dataeasy0.4 min read
Altimetrik
β†’
666

Explain your approach to monitoring and logging Spark jobs in AWS. What tools would you use to identify performance bottlenecks?

Spark/Big Dataeasyspark0.6 min read
EPAM
β†’
667

How do you compare the time investment and value of a task?

Spark/Big Dataeasy0.5 min read
Delivery Hero
β†’
668

How do you handle bad data in Databricks?

Spark/Big Dataeasy0.5 min read
PWC
β†’
669

How do you handle failures in Airflow tasks, and what retry strategies can you use?

Spark/Big Dataeasyairflowpython0.5 min read
Citi
β†’
670

How do you handle schema evolution in Spark, especially when reading data from sources like Parquet or Avro?

Spark/Big Dataeasyspark0.5 min read
Coforge
β†’
671

How do you prioritize your tasks in a multi-project environment?

Spark/Big Dataeasy0.5 min read
PLEO
β†’
672

Sqoop Incremental Import?

Spark/Big Dataeasysql0.6 min read
Altimetrik
β†’
673

Sqoop command for importing multiple tables

Spark/Big Dataeasyairflowsql0.5 min read
Meesho
β†’
674

Suppose you have a DAG that ingests data from multiple databases. How would you increase task parallelism in Airflow to improve performance without overloading the system?

Spark/Big Dataeasyairflowsql0.6 min read
Dunnhumby
β†’
675

Suppose you need to import 5 tables from an external RDBMS (like MySQL) into Hadoop HDFS. Write the Sqoop command

Spark/Big Dataeasyairflowsql0.6 min read
Meesho
β†’
676

Task Dependencies in DAG

Spark/Big Dataeasyairflow0.5 min read
Verizon
β†’
677

What are Hadoop commands for Get and Merge?

Spark/Big Dataeasyspark0.4 min read
Altimetrik
β†’
678

What are the advantages of using Dataproc over a traditional Hadoop setup?

Spark/Big Dataeasyspark0.5 min read
Aarete
β†’
679

What are the advantages of using Delta Lake over Parquet?

Spark/Big Dataeasy0.5 min read
Puma
β†’
680

What are the differences between %pip and %conda commands in Databricks?

Spark/Big Dataeasypython0.6 min read
TCS
β†’

Reading isn't practice. Get AI feedback on your answers.

Type or paste your answer to any of these questions and our AI Coach scores it, highlights gaps, and rewrites it at FAANG quality. Free to try.

Try AI Answer Coach β€” FreeStart a Mock Interview
Previous1...3233343536Next
Categories
All QuestionsSQLSpark / Big DataPython / CodingSystem DesignCloud / ToolsBehavioral
By Company
AmazonGoogleDatabricksSnowflakeMicrosoftNetflixUberTCS
Interview Guides
All GuidesTop SQL QuestionsTop Spark QuestionsTop Python QuestionsTop System DesignSQL Window FunctionsETL QuestionsData Modeling
Products
AI Interview CoachAnswer AnalyzerSQL PlaygroundResume AnalyzerInterview PacksPricing
Company
About UsContact UsAI DisclosureDisclaimerTerms of ServicePrivacy Policy
Β© 2026 DataEngPrep.tech. All rights reserved.
AboutBlogContactDisclaimer