JavaScript is required to use this application. Please enable JavaScript in your browser settings or disable any extensions that may be blocking scripts.
Questions tagged spark
Explain the role of DAGs (Directed Acyclic Graphs) in Spark.
Explain your approach to monitoring and logging Spark jobs in AWS. What tools would you use to identify performance bottlenecks?
Explain your choice of streaming framework (Kafka, Spark Streaming, etc.).
Fault Tolerance in Spark vs. Hadoop?
Given a DataFrame with columns id and name, add a new column department: If id < 100 assign HR, if id >= 100 and id < 200 assign admin.
Handling Skewness in Data - salting, broadcast join
Handling custom data types in Spark
Have you worked with UDFs in Spark? When do you use them, and how do they differ from built-in functions?
Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.