Real questions from top companies
Given two dataframes (df1: id, name and df2: id, country, address, city, count), join them, filter for rows where country = 'Singapore', and pivot the output. Sort cities in descending order of population count
Given two tables, calculate the row count for different types of joins (inner, left, right, and full outer)
HAVING vs WHERE - explain
Handle nulls, duplicates, and inconsistent timestamp formats in data.
Hierarchical Employee-Manager Query - navigate hierarchy with head of company as employee_id = 1
How can you automate data insertion into BigQuery using Python?
How can you delete partitions from a table in Hive using a command?
How did you manage a situation where you lacked knowledge for a task?
How do partitioning strategies differ between source and sink?
How do partitions improve query performance in fact tables?
How do quarantine tables ensure data quality in downstream pipelines?
How do these policies affect query performance?
How do tumbling window triggers ensure data consistency in batch processing?
How do you convert 3 rows into one column in SQL?
How do you count occurrences in a column in SQL?
How do you create a new table with the same structure as an existing one?
How do you design a scalable and fault-tolerant data warehouse on a cloud platform?
How do you find duplicates in a table based on one or two columns?
How do you get new records from a table/file without a modified column? Discuss approaches like hashing or row comparison.
How do you grant access for views?
Type or paste your answer to any of these questions and our AI Coach scores it, highlights gaps, and rewrites it at FAANG quality. Free to try.