Starburst vs. Athena

Starburst Galaxy offers up to 10.2x faster SQL at a fraction of the cost while providing a simple, open, and highly scalable end-to-end data and analytics platform to power your open data lakehouse.

What is Starburst Galaxy?

Starburst Galaxy is a price-performant, fully managed, multi-cloud data and analytics platform powered by Trino, a leading open-source distributed MPP SQL query engine. Starburst Galaxy is used for both interactive ad-hoc analytics and long-running workloads like batch and ETL/ELT, and offers high scalability and query completion rates even as the amount of data, query volume, and query complexity increases. The service runs federated queries across data lakes, cloud data warehouses, on-premises databases, and relational data management systems like PostgreSQL and MySQL. Galaxy also supports fault-tolerant execution, smart indexing and caching, Data Products, and universal search and schema discovery.

What is Amazon Athena?

Amazon Athena, available in serverless and dedicated versions, is a query service that analyzes data in Amazon Web Services (primarily Amazon S3) using standard SQL for ad-hoc analytics. Amazon Athena serverless has no infrastructure for customers to manage, and they only pay for queries that run. Amazon Athena was originally built on a fork of Presto (PrestoDB version .217), originally released in January 2019.

Starburst is a Leader in Enterprise Big Data Analytics

Don’t take our word for it. Starburst is named #1 for Quality of Support and Ease of Use in G2 Crowd’s Grid Report based on real customer reviews. Additionally, customers said this about Starburst:

100% of users rated Starburst 4+ stars
100% of users believe Starburst is headed in the right direction
96% meets users requirements
93% of users would recommend

G2 Winter Report

Simplicity

Going beyond key platform governance and management capabilities, a modern data and analytics platform empowers data teams with easy-to-use functionality that increases productivity without adding complexity. It allows businesses to use a range of existing investments in just a few clicks. It enables data teams to easily discover, create, govern, analyze and share federated data products from distributed data sets across the organization.

Starburst Galaxy

Amazon Athena (Serverless)

Automated AWS compute plane set-up

Automated data maintenance

Limited, only for partition evolution and table stat collection

Multi-cloud platform

Built-in data security

Requires use of Lakeformation

Data Products

Automated cluster management

Built-in real-time usage monitoring

Built-in query scheduler

Requires Lambda functions

Built-in Natural Language Processing

Automated data lake optimization

Enables automated data maintenance across data compaction, profiling and statistics, vacuuming, and data retention

Predictable pricing

Comparison based on publicly available information as of July 8, 2024. * In preview. Contact us to learn more.

Access

True data access empowers organizations with the ability to use all their data, no matter where it lives, across data lakes, data warehouses, and databases while having confidence in security and governance controls. True access is about meeting business needs on time while adhering to regulatory data sovereignty requirements. Your open lakehouse should free your data sources for analytics and AI, not confine them in another way.

Starburst Galaxy

Amazon Athena (Serverles)

Cloud data federation

On-premise data federation

AWS service account

Time-based policies

RBAC

ABAC

Column/Row masking

Does not offer dynamic data masking or attribute based.

SSO via AWS IAM, Okta, Azure AD, and Google

Universal Search and schema discovery

Limited to only metadata stored in AWS Glue Data Catalog. Lacks automatic schema evolution, and no advanced search features

Uses Trino connectors for federation

Athena uses data source connectors that run on AWS Lambda to run federated queries.

In platform universal search and schema discovery

Glue crawler available at a additional charge

Optimized first party connectors - parallelism, cached views, dynamic filtering, security, and authentication

Query sharing

Data Products sharing

Data profiling

Data lineage

Streaming ingest

Must use Amazon MSK for streaming data, Amazon Athena can then be used to query that data

Comparison based on publicly available information as of July 8, 2024. * In preview. Contact us to learn more.

Scalability

Internet scale matters in an internet-powered world but not every workload needs that power and performance. Your open data lakehouse, powers modern data and analytics and puts control of performance and costs in your hands. It ensures high-performance scalability is available at a click of a button or automatically when you need it most while optimizing price-to-performance for all analytics workloads. It also instills confidences that queries will execute as scheduled, even at high concurrencies.

Starburst Galaxy

Amazon Athena (Serverless)

Works with S3 Express One Zone

Ad-hoc and interactive queries

Results and repeated subquery caching

High concurrency

Limited with default of 20 concurrent queries per account per region

Control over concurrency and prioritization

Limited through query queuing

Fault Tolerant Execution

Starburst offers Enhanced FTE above FTE in OS Trino and lets you run queries up to ~60 TB of query memory

See Amazon Athena docs under limitations

Built-in data catalog

Autoscales by adding more nodes per cluster

Adds more clusters

Customizable scaling for cost and performance optimization

Consistently executes long-running batch queries

Smart indexing and caching

Fine-grained resource management

Comparison based on publicly available information as of July 8, 2024. * In preview. Contact us to learn more.

Optionality

Open file and table formats are table stakes in providing optionality. Your open lakehouse goes beyond the fundamentals to ensure your business has full control over your data by accessing data where it lives across hybrid and multi-cloud data architectures, by allowing choice in cloud providers, security, and BI tools, and ensuring expert Trino support is available if and when your teams need it most.

Starburst Galaxy

Amazon Athena (Serverles)

OS Trino query engine

Supports popular open file formats

Supports Python

Supports hybrid and cloud data architectures

Supports data catalogs beyond AWS Glue

Runs on multiple clouds

Expert in-house Trino support

Natively run SQL on Iceberg, Delta Lake, Hudi, and Hive table formats

Only supports Iceberg, Delta Lake, and Hive

In platform capability to migrate Hive to Delta or Iceberg tables

Comparison based on publicly available information as of July 8, 2024. * In preview. Contact us to learn more.

Athena laid our foundation, but growth demands prompted a shift to Starburst. With its Warp Speed indexing and caching, costs were cut by 70%, seamlessly aligning with our expansion. Starburst not only caters to our growth but elevates performance and optimization in our data engineering landscape.

Pankaj Arora

Associate Director of Data Engineering, Junglee Games

Learn Morechevron_right

The bottom line is that Starburst Galaxy is a huge force multiplier for us. Based on my experience in previous roles, I’ve been able to accomplish what would’ve taken two to three engineers in half the time and one tenth of the cost [compared to Athena].

Director of Software and Engineering, Cybersecurity Solutions Provider

Learn Morechevron_right

Fortune 100 Cloud Computing Provider

With Starburst, we can maximize the value of our data. We are now able to run queries on tables with terabytes of data in just a few seconds.

Staff Engineer, A Fortune 100 Cloud Computing Company

Learn Morechevron_right

We were using data in the way we could. It was getting more expensive, slower, and feeble. We had to change our approach and look for other ways of enabling our users without infrastructure penalties. We were over-run by the limitations of our latest solution [Athena]... Starburst gives us a single platform to explore more data through connectivity, maintain data quality and governance, and provide the data to all of our employees using their visualization tools of choice.

André Gortari

Data Engineering Manager at Banco Inter

Learn Morechevron_right

Free test drive | Watch | Contact us

Access and analyze your data with elastic scale and high performance your business demands. Take Starburst Galaxy for a free test drive, watch the on-demand demo (no form fill needed), or contact us.

Start your Galaxy Trial

More resources

2024 GigaOm Radar- Data Lakes and Lakehouses

How To Migrate Queries From Amazon Athena To Starburst Galaxy

Hive vs. Iceberg

Start for Free with Starburst

Up to $500 in usage credits included

Discover
Easily search across data sources and clouds to find the data you need.
Govern
Streamline data governance with built-in RBAC and ABAC.
Analyze
Run internet-scale workloads with the power of Trino.
Fast
Accelerate queries with smart indexing and caching technologies like Warp Speed.