#spark

Questions tagged spark · medium

All easy (120+)medium (130+)hard (410+)

121

What is the difference between partitions and repartitions in Spark, and when do you use each?

Spark/Big Datamedium

122

What is the most common performance bottleneck in Spark jobs, and how would you resolve it?

Spark/Big Datamedium

123

What performance tuning techniques do you apply in both Sqoop and Spark to optimize their execution?

Spark/Big Datamedium

124

What strategies would you use to optimize Spark jobs for both performance and cost on AWS?

Spark/Big Datamedium

125

When would you choose a broadcast join over a shuffle join? Any memory risks?

Spark/Big Datamedium

126

Which Spark property controls the number of shuffle partitions?

Spark/Big Datamedium

127

Write PySpark code to extract data from a CSV and create a table.

Spark/Big Datamedium

128

Write PySpark code to save a DataFrame in Parquet format to an S3 bucket.

Spark/Big Datamedium

+17 More Questions with Expert Answers

Unlock all 1,800+ expert answers, AI mock interviews, resume analyzer, SQL playground, and personalized progress tracking.

Unlock Full Access Try AI Coach Free

Previous 1...5 6 7

Other Tags

#join #partition #python #optimization #sql #window #airflow #etl #bigquery #snowflake #lakehouse

#spark

Questions tagged spark · medium

All easy (120+)medium (130+)hard (410+)

121

What is the difference between partitions and repartitions in Spark, and when do you use each?

Spark/Big Datamedium

122

What is the most common performance bottleneck in Spark jobs, and how would you resolve it?

Spark/Big Datamedium

123

What performance tuning techniques do you apply in both Sqoop and Spark to optimize their execution?

Spark/Big Datamedium

124

What strategies would you use to optimize Spark jobs for both performance and cost on AWS?

Spark/Big Datamedium

125

When would you choose a broadcast join over a shuffle join? Any memory risks?

Spark/Big Datamedium

126

Which Spark property controls the number of shuffle partitions?

Spark/Big Datamedium

127

Write PySpark code to extract data from a CSV and create a table.

Spark/Big Datamedium

128

Write PySpark code to save a DataFrame in Parquet format to an S3 bucket.

Spark/Big Datamedium

+17 More Questions with Expert Answers

Unlock all 1,800+ expert answers, AI mock interviews, resume analyzer, SQL playground, and personalized progress tracking.

Unlock Full Access Try AI Coach Free

Previous 1...5 6 7

Other Tags

#join #partition #python #optimization #sql #window #airflow #etl #bigquery #snowflake #lakehouse