Intelligent Scaling in Amazon Redshift

Published in SIGMOD 2024, 2024

Vikram Nathan, Vikramank Singh, Zhengchun Liu, Mohammad Rahman, Andreas Kipf, Dominik Horn, Davide Pagano, Gaurav Saxena, Balakrishnan (Murali) Narayanaswamy, Tim Kraska

Cloud-based data warehouses are built to be easy to use, requiring minimal intervention from customers as their workloads scale. However, there are still many dimensions of a workload that they do not scale with automatically. For example, in cloud-managed clusters, large ad-hoc queries and ETL workloads must use the same cluster size provisioned for the rest of the workload, and warehouse size does not automatically grow as the underlying data grows in size, causing queries to slow down.In this paper,we describe SageMode, the latest collection of AI-powered scaling and optimization techniques in Amazon Redshift, released in preview at re:Invent 2023, which enable it to scale both vertically and horizontally to adapt to all types of workload variability. Sage Mode dynamically provisions compute resources to run heavy queries efficiently and automatically optimizes warehouse size for the customer’s workload, even as it shifts over time. We show that, depending on the workload, Sage Mode improves either cost or average query execution time by up to 7.6× and 14.2×, respectively, over existing baselines.

Download paper here