DataEngPrep.tech
QuestionsBlogStore
Get PDF Bundle

Interview Questions

Real questions from top companies

700+ Easy450+ Medium650+ Hard
All CategoriesBehavioralSpark/Big DataSQLPython/CodingSystem Design/ArchitectureCloud/ToolsGeneral/Othereasymediumhard
1

PySpark Coding Challenge - dataset with 4-5 columns, solve data processing problem on paper

Spark/Big Datahard
2

PySpark Coding Challenge: Transform input dataset with columns id, dob, name to add age, firstname, lastname

Spark/Big Datahard
3

Read CSV, filter, and write to table using PySpark

Spark/Big Datahard
4

Running Tasks in Parallel

Spark/Big Datahard
5

Salting Implementation - provide example

Spark/Big Datahard
6

Schema evolution - techniques for handling schema changes in PySpark

Spark/Big Datahard
7

Setting Dependencies for Tasks in DAG

Spark/Big Datahard
8

Share your experience in working with big data technologies such as Hadoop, Spark, or AWS EMR. How have you leveraged these tools in your previous projects?

Spark/Big Datahard

+20 More Questions with Expert Answers

Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.

Get PDF Bundle — from $21Try Free Sample
Previous1...7879808182...94Next