Real interview questions asked at S&P Global. Practice the most frequently asked questions and land your next role.
S&P Global data engineering interviews test your ability across multiple domains. These questions are sourced from real S&P Global interview experiences and sorted by frequency. Practice the ones that matter most.
Convert complex SQL (CTEs, window functions, subqueries) to production-grade PySpark. Discuss when to use spark.sql() vs. DataFrame API, and the implications for testability, partitioning, and execution predictability.
What is the size of the teams I've worked with and how we handled sprints during the project?
Why are you considering leaving your current company?
Given the input string "AAABBBCCCDDDAAA," compress it to output "A3B3C3D3A3."
Explain the differences between Redshift and Snowflake, and how I've used them in previous projects.
Explain the scalability, performance, and cost-efficiency of both Redshift and Snowflake in different use cases.
Write a query to find the 5th highest salary in an employee table and calculate the number of employees whose salary is greater than that of their manager.
Explain how I handle performance optimizations, scheduling tasks, and monitoring DAGs in Airflow.
Provide specific examples of challenges faced with PySpark and SQL and solutions implemented.
How does data flow through the system? From ingestion to processing and storage?
Download the complete interview prep bundle with expert answers. Study offline, on your commute, anywhere.