Easy-level cloud & tools questions from real data engineering interviews.
These easy cloud & tools questions are selected from real interviews at top companies. Each question includes a detailed expert answer and pro tip to help you nail your interview.
What are Airflow Operators? Give examples.
Explain the difference between Azure Data Factory (ADF) and Databricks.
How do you handle data security and compliance in a cloud environment?
What is Azure Data Factory (ADF), and what are its main components?
What is the role of the Integration Runtime (IR) in ADF?
API calling with Airflow?
Airflow operators, hooks, and scheduler functionality?
Azure Functions vs. Logic Apps?
Can you explain your experience with Docker and Kubernetes?
Can you explain your experience with Jenkins in your project?
Cloud Composer Overview
Compare ADF vs. Databricks.
Data Factory vs. Databricks: When to use which?
Describe a real-world use case for using Step Functions with Lambda in a data workflow.
Describe a scenario where AWS Data Pipeline is preferred over Glue. Why?
Describe an AWS EC2 instance and how IAM roles/policies enhance security.
Describe how to secure sensitive data in cloud storage solutions.
Describe how to set up retries and timeout for tasks in Cloud Composer.
Describe how you deploy code to a production environment using Jenkins
Describe how you would use AWS Glue to schedule and manage Spark jobs.
Describe step scaling policies vs. target tracking policies in AWS Auto Scaling.
Describe the process and use cases of implementing Azure Data Factory pipelines.
Describe using Step Functions to handle retries and error notifications.
Difference between linked services and datasets in ADF.
Differentiate between global and local variables in ADF.
Discuss how versioning works in S3 and its use cases, such as data recovery and auditing.
Discuss the key differences between AWS Glue, Lambda, and Data Pipeline for orchestrating data workflows.
Discuss versioning in S3.
Docker - purpose and handling dependencies
Error Handling in ADF?
Explain AWS Lake Formation and its benefits.
Explain GetMetadata, ForEach, and Copy Data in Azure Data Factory.
Explain Microsoft Fabric and its use in data integration.
Explain Step Functions for orchestration of workflows.
Explain a linked service and how to create one.
Explain how Access Control Lists (ACLs) can affect IAM role permissions.
Explain how Infrastructure as Code (IaC) works in AWS and its advantages
Explain how Step Functions integrate with other AWS services.
Explain how you would configure an S3 bucket policy to allow access only from a specific EC2 instance
Explain linked services and how they are created.
Explain the Terraform lifecycle for deploying a new cluster on AWS
Explain the difference between S3 One Zone-IA and S3 Standard-IA.
Explain the difference between Service Principal and Managed Identity in Azure.
Explain the differences between Azure IR, Self-hosted IR, and Azure-SSIS IR
Explain the differences between Azure SQL Database, Azure SQL Managed Instance, and Azure Synapse.
Explain the role of Airflow DAGs in Cloud Composer.
Explain the use of Web Activity in ADF.
Explain using IAM roles for secure cross-account access to an S3 bucket.
Explain when you would use Glue instead of Lambda for a data ingestion use case.
Fabric dataflows vs. ADF dataflows
Fabric pipelines vs. ADF pipelines
GCP Authentication with Jenkins
How Airflow operates in a Kubernetes environment
How Airflow stores logs and the role of its backend database
How are Logic Apps used in ADF projects?
How do Logic Apps enhance notification workflows for monitoring pipelines?
How do you copy all files from one source path to target in ADF?
How do you delete files older than 30 days using ADF?
How do you handle API rate limits in ADF?
How do you monitor and log data pipelines in AWS?
Download the complete interview prep bundle with expert answers. Study offline, on your commute, anywhere.