Interview questions · easy
What is the difference between Managed and External tables in Hive/Spark?
Agile methodologies used?
How is Oozie called?
Oozie workflow files (how many used)?
Shell: change permissions?
Shell: command to check processes running in the background?
Using shell, how to find the difference between two files?
What type of wrapper is used, or which language is used?
Amazon Deequ usage and what sort of quality checks are done using it?
Shell: how to run jobs/scripts in the background?
How to view Oozie jobs?
Describe how to pass data between tasks in Airflow using XComs.
How do you handle failures in Airflow tasks, and what retry strategies can you use?
What is a DAG in Apache Airflow, and how is it used for scheduling workflows?
Type or paste your answer to any of these questions and our AI Coach scores it, highlights gaps, and rewrites it at FAANG quality. Free to try.