JavaScript is required to use this application. Please enable JavaScript in your browser settings or disable any extensions that may be blocking scripts.
Questions tagged spark
How do you pass global variables between pipelines?
How do you use dependency tracing to identify root causes in pipeline failures?
How does HDFS handle fault tolerance?
How does Presto fetch data from a data catalog?
How does Spark handle distributed computing, and what challenges have you faced while working on distributed systems?
How does data flow through the system? From ingestion to processing and storage?
How to adapt the same pipeline to a cloud environment?
How to capture data lineage for Spark code, using a DataHub-based example?
Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.