#etl

Questions tagged etl · hard

All easy (50+)medium (10+)hard (60+)

Design an ETL pipeline using Kafka and Spark Streaming

Spark/Big Datahard

Difference between Presto vs. Spark underlying architecture

Spark/Big Datahard

Explain Hive, its purpose, and its default metadata storage.

Spark/Big Datahard

Explain how Glue's Spark-based architecture handles data parallelism.

Spark/Big Datahard

How do you set up CI/CD for a PySpark ETL workflow?

Spark/Big Datahard

How does Databricks create clusters for running Spark jobs?

Spark/Big Datahard

How would you optimize your Spark Streaming ETL pipeline for high throughput and low latency?

Spark/Big Datahard

List all the technologies you have worked on in your project (e.g., Spark, Hadoop, Hive, Databricks).

Spark/Big Datahard

+20 More Questions with Expert Answers

Unlock all 1,800+ expert answers, AI mock interviews, resume analyzer, SQL playground, and personalized progress tracking.

Unlock Full Access Try AI Coach Free

Previous 1 2 3 4 Next

Other Tags

#join #partition #python #spark #optimization #sql #window #airflow #bigquery #snowflake #lakehouse

#etl

Questions tagged etl · hard

All easy (50+)medium (10+)hard (60+)

Design an ETL pipeline using Kafka and Spark Streaming

Spark/Big Datahard

Difference between Presto vs. Spark underlying architecture

Spark/Big Datahard

Explain Hive, its purpose, and its default metadata storage.

Spark/Big Datahard

Explain how Glue's Spark-based architecture handles data parallelism.

Spark/Big Datahard

How do you set up CI/CD for a PySpark ETL workflow?

Spark/Big Datahard

How does Databricks create clusters for running Spark jobs?

Spark/Big Datahard

How would you optimize your Spark Streaming ETL pipeline for high throughput and low latency?

Spark/Big Datahard

List all the technologies you have worked on in your project (e.g., Spark, Hadoop, Hive, Databricks).

Spark/Big Datahard

+20 More Questions with Expert Answers

Unlock all 1,800+ expert answers, AI mock interviews, resume analyzer, SQL playground, and personalized progress tracking.

Unlock Full Access Try AI Coach Free

Previous 1 2 3 4 Next

Other Tags

#join #partition #python #spark #optimization #sql #window #airflow #bigquery #snowflake #lakehouse