JavaScript is required to use this application. Please enable JavaScript in your browser settings or disable any extensions that may be blocking scripts.
Data engineering interview questions · medium
What work is done by the executor memory in Spark?
When and how do you use Broadcast Join?
Write a Python script to find the count of each word in a text file using Spark.
Write the PySpark code to find the second highest salary in each department.
Accumulators - use as shared variable for write-only operations
Broadcast Joins and Shuffle Merge Joins?
Broadcast join - how it optimizes joins
Can you explain the concept of mappers in Spark, and how are they used in data transformations?
Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.