Describe how to pass data between tasks in Airflow using XComs.
Spark/Big Dataeasy
2
Describe the role of a workflow orchestrator like Airflow in a data pipeline.
Spark/Big Dataeasy
3
How do you handle failures in Airflow tasks, and what retry strategies can you use?
Spark/Big Dataeasy
4
Sqoop command for importing multiple tables
Spark/Big Dataeasy
5
Suppose you have a DAG that ingests data from multiple databases. How would you increase task parallelism in Airflow to improve performance without overloading the system?
Spark/Big Dataeasy
6
Suppose you need to import 5 tables from an external RDBMS (like MySQL) into Hadoop HDFS. Write the Sqoop command
Spark/Big Dataeasy
7
Task Dependencies in DAG
Spark/Big Dataeasy
8
What is a DAG in Apache Airflow, and how is it used for scheduling workflows?
Spark/Big Dataeasy
+8 More Questions with Expert Answers
Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.