JavaScript is required to use this application. Please enable JavaScript in your browser settings or disable any extensions that may be blocking scripts.
Questions tagged spark · medium
Lead and Lag in SQL Using PySpark DataFrame API
Optimizing Spark Jobs when they take longer than expected
Solve a coding question related to window functions using SQL and PySpark.
Solve a problem using a window function in Spark or SQL.
Teradata to Hadoop migration and handling data with SCD Type 2?
What are Slowly Changing Dimensions (SCD), and how would you implement them for tracking customer data changes?
What factors determine the optimal number of partitions for a large file?
What is dynamic partition pruning, and how does it optimize query execution?
Get the complete 1,800+ question library with detailed, expert-level answers covering SQL, Spark, System Design, Python, Cloud, and Behavioral topics.