Apache Airflow

Data Lake Analytics Platform

Starburst activates the data in and around your lake

Our platform includes the capabilities needed to discover, organize, and consume data on a data lake without the need for time-consuming and costly migrations. Trusted by companies like Comcast, Grubhub, and Priceline, Starburst helps companies make better decisions faster.

Learn More

What is Apache Airflow?

Apache Airflow is a widely adopted orchestration engine that allows you to schedule and run complex data pipelines. Airflow provides many plug-and-play operators and hooks to integrate with many third-party services like the Starburst Galaxy engine (Trino).

By integrating Apache Airflow with Starburst Data, you can leverage the powerful scheduling, monitoring, and task execution capabilities of Airflow while utilizing the distributed SQL query capabilities of Starburst Data for efficient data processing.

Continue Learning

How to Use Starburst and Airflow to Create Resilient Data Pipelines

Start Free with
Starburst Galaxy

Up to $500 in usage credits included

  • Query your data lake fast with Starburst's best-in-class MPP SQL query engine
  • Get up and running in less than 5 minutes
  • Easily deploy clusters in AWS, Azure and Google Cloud
For more deployment options:
Download Starburst Enterprise

Please fill in all required fields and ensure you are using a valid email address.