DataEngPrep.tech

Spark/Big Datamediumjoinpartitionpython0.9 min read

What is the difference between narrow and wide transformations in Apache Spark? Explain with examples.

CoforgeDelivery HeroDunnhumbyFragma Data Systems+1

Spark/Big Datamediumpartitionspark2 min read

Explain the difference between Spark's map() and flatMap() transformations.

Spark/Big Datahardjoinoptimizationspark3 min read

How does Spark's Catalyst Optimizer work? Explain its stages.

DunnhumbyFragma Data SystemsHashedIn

Spark/Big Dataeasyspark2 min read

What is the difference between Managed and External tables in Hive/Spark?

CitiDunnhumbyFragma Data Systems

Spark/Big Datamediumjoinsparksql2 min read

Explain the concept of Broadcast Join in Spark. When should it be used?

Python/Codingeasypython2 min read

Explain the difference between shallow copy and deep copy in Python.

Python/Codingeasypython2 min read

Write a Python function to find the first non-repeating character in a string.

General/Otherhardbigqueryetloptimization0.7 min read

Have you worked on Data Warehousing projects?

AareteDunnhumby

General/Otherhardbigqueryetljoin0.7 min read

What is the difference between OLTP and OLAP?

AareteDunnhumbyFragma Data Systems

General/Othermediumjoinsql0.7 min read

What is the difference between SQL and NoSQL databases?

AareteDunnhumbyFragma Data Systems

SQLeasybigquery0.5 min read

Explain Common Table Expressions (CTEs) and their benefits.

SQLmediumjoinpartitionsql0.5 min read

Explain SQL Window Functions with examples.

SQLmediumbigqueryjoinpartition0.5 min read

Explain the use of the MERGE statement in SQL.

SQLmediumjoinsql0.5 min read

How do you handle NULL values in SQL? Mention functions like COALESCE and ISNULL.

SQLhardjoinoptimizationpartition0.6 min read

How do you optimize a long-running SQL query?

SQLmediumpartitionsql0.6 min read

How would you handle duplicate records in an SQL table?

Spark/Big Datahardjoinoptimizationspark0.7 min read

What is Spark's Catalyst Optimizer? Explain its stages.

DunnhumbyFragma Data Systems

Python/Codingeasypython0.7 min read

Write a Python function to find the first non-repeating character in a string.

Delivery HeroDunnhumby

SQLmediumpartition0.6 min read

If manual partitions are created in a Hive data-warehouse table directory, and you query records from those partitions, will you see the data? If not, how can this be fixed?

Dunnhumby