Scaling With Presto on Spark

This blog was co-written with Shradha Ambekar, Staff Software Engineer at Intuit and Ariel Weisberg, Software Engineer at Facebook.

Overview

Presto was originally designed to run interactive queries against data warehouses, but now it has evolved into a unified SQL engine on top of open data lake analytics for both interactive and batch workloads. Popular workloads on data lakes include: