SQL questions from American Express data engineering interviews.
These sql questions are sourced from American Express data engineering interviews. Each includes an expert-level answer.
Describe a scenario where you used Databricks for real-time data processing.
Describe a cross-team data project where you had to align architectural boundaries, ownership, and SLAs. How did you handle conflicting priorities, technical debt, and the scalability of communication as the number of stakeholders grew?
Implement a recursive query for hierarchy (employee-manager). Explain the termination guarantees, depth limits, and when a recursive CTE becomes a scalability bottleneck. What alternatives exist for graph-scale hierarchies in Spark or a data lake?
Explain bloom filters in Spark: how they reduce I/O and when they introduce false positives that hurt performance. What are the scalability and cost implications of enabling dynamic partition pruning and bloom filter pushdown at petabyte scale?
Given a table of sales data, use window functions to calculate a running total.
How do you handle schema evolution in data lakes or data warehouses?
How would you optimize a query with multiple joins and subqueries?
Write a query to find the first number repeating consecutively three times in a sequence.
Download the complete interview prep bundle with expert answers. Study offline, on your commute, anywhere.