Cookie Notice
This site uses cookies for performance, analytics, personalization and advertising purposes.
For more information about how we use cookies please see our Cookie Policy.
Manage Consent Preferences
These cookies are essential in order to enable you to move around the website and use its features, such as accessing secure areas of the website.
These are analytics cookies that allow us to collect information about how visitors use a website, for instance which pages visitors go to most often, and if they get error messages from web pages. This helps us to improve the way the website works and allows us to test different ideas on the site.
These cookies allow our website to properly function and in particular will allow you to use its more personal features.
These cookies are used by third parties to build a profile of your interests and show you relevant adverts on other sites. You should check the relevant third party website for more information and how to opt out, as described below.
Fully managed in the cloud
Self-managed anywhere
Starburst Galaxy brings the open data lakehouse architecture to life using open file and table formats and an optimized open MPP SQL query engine with OS Trino. Starburst Galaxy is used for both interactive ad-hoc analytics and long-running workloads like batch and ETL/ELT, and offers high scalability and query completion rates even as the amount of data (terabytes of data), query volume, and query complexity increases. It runs federated queries across data lakes, cloud data warehouses, on-premises databases, and relational data management systems like PostgreSQL and MySQL. Galaxy also supports enhanced fault-tolerant execution, smart indexing and caching, Data Products, universal search and schema discovery while truly separating compute and storage between Starburst and your cloud storage.
Databricks, traditionally known for its Apache Spark origins and data science and machine learning capabilities, introduced Databricks SQL, available as a serverless and non-serverless data warehouse in the Databricks Data Intelligence Platform (formerly known as the Lakehouse Platform) to run SQL and BI applications using its proprietary Photon engine. It offers a governance model via a proprietary data catalog, use of open formats (Delta Lake being the preferred for a first-class experience), APIs, and integrations with third party tools. It provides general compute resources for SQL queries, visualizations, and dashboards that are executed against the tables in data in cloud object stores. It also enables find and share insights with the built-in SQL editor, visualizations and dashboards.
“While other solutions, such as Databricks, were considered, none were as seamless and performant as Starburst, the fully supported, production-tested and enterprise-grade distribution of open source Trino.”
— Patrice Linel, Senior Manager of Data Science & Data Engineering
Learn more
“Databricks would work with our data lake platform, but if we chose Databricks, we would have to cancel all the initiatives that we had in our big data platform using AWS. So we are betting on the AWS stack with Starburst and we think we’re going to be successful in this journey.”
— Carlos Pedrosa, Superintendent of Information Technology Governance
Learn More
“After our acquisition, we needed to efficiently integrate the new business. We couldn’t achieve this with Databricks because there was no way to federate across data sources. Starburst is the only SQL engine that provides access to AWS and GCP data, which saves us years in data migrations and thousands of dollars in cost over Databricks. Starburst will change the way companies look to integrate acquisitions, providing near instant ROI.”
— Anonymous, SVP Data & Analytics
“We chose Starburst [over Databricks] because their open-stack approach aligns with our commitment to avoiding vendor lock-in. It also gives us a lower TCO with similar performance and ease of use.”
— Anonymous, Engineering Manager
Don’t take our word for it. Starburst is named #1 for Quality of Support and Ease of Use in G2 Crowd’s Grid Report based on real customer reviews. Additionally, customers said Starburst beat out Databricks in all of these categories:
A simple and intuitive experience includes everything from simple pricing models, ease of governance and management, minimal overhead and time to get started, a simplified customization and configuration experience, and most importantly just easy to use by any SQL user — a PhD should be optional.
Starburst Galaxy
Databricks SQL
Built-in data security
Built-in data security
Automated cluster management
Automated cluster management
Built-in real-time usage monitoring
Built-in real-time usage monitoring
Automated data maintenance
Automated data maintenance
*
GenAI text-to-SQL
GenAI text-to-SQL
*
Get started within 10 minutes from first sign-on
Get started within 10 minutes from first sign-on
Simple pricing
Simple pricing
Comparison based on publicly available information as of November 30, 2023
*In preview. Contact us to learn more.
Empower data teams with the ability to securely use all their data assets, no matter where they live, across data lakes, data warehouses, and databases. With your modern data lake analytics platform easily discover, create, govern, share, and collaborate on curated data sets by connecting your data silos before, during, and after your modernization journey.
Starburst Galaxy
Databricks SQL
Universal search
Universal search
Schema discovery
Schema discovery
Role and attribute-based access controls
Role and attribute-based access controls
Data lineage
Data lineage
Column/row masking
Column/row masking
Automatic end-to-end encryption
Automatic end-to-end encryption
Private link for AWS, Azure, and Google
Private link for AWS, Azure, and Google
*
SSO for client connectivity
SSO for client connectivity
*
Automatic data classification
Automatic data classification
*
Streaming ingest
Streaming ingest
Integration with AWS LakeFormation
Integration with AWS LakeFormation
Supports popular data sources for federation
Supports popular data sources for federation
Supports hybrid data architectures
Supports hybrid data architectures
Optimized connectors for federation
Optimized connectors for federation
First-class federation for multiple data catalogs
First-class federation for multiple data catalogs
Federated data products
Federated data products
Data product sharing
Data product sharing
Time-based access control policies
Time-based access control policies
Data observability
Data observability
Data profiling
Data profiling
Metadata tagging
Metadata tagging
Comparison based on publicly available information as of November 30, 2023
*In preview. Contact us to learn more.
A modern data lake analytics platform with high concurrency puts the control in your hands to ensure performant scalability is available when you need it most while optimizing price-to-performance for all analytics workloads.
Starburst Galaxy
Databricks SQL
High concurrency
High concurrency
Fault tolerant execution (FTE)
Fault tolerant execution (FTE)
Ad-hoc and interactive queries
Ad-hoc and interactive queries
Materialized views
Materialized views
Autoscaling by adding/removing incremental nodes
Autoscaling by adding/removing incremental nodes
Customizable scaling for cost and performance optimization
Customizable scaling for cost and performance optimization
Smart caching
Smart caching
Results and repeated subquery caching
Results and repeated subquery caching
*
Smart indexing
Smart indexing
Index and cache resilience
Index and cache resilience
Comparison based on publicly available information as of November 30, 2023
*In preview. Contact us to learn more.
A modern data lake analytics platform takes the lakehouse architecture beyond the basics of open file and table formats by providing choice in hybrid or cloud environments, more data federation, seamless cross-cloud and cross-region analytics, choice in data catalogs without compromising the user experience and offering an enhanced MPP SQL query engine based on open standards and supported by the largest internet companies in the world.
Starburst Galaxy
Databricks SQL
Run on multiple clouds
Run on multiple clouds
Supports popular open file formats
Supports popular open file formats
Standard ASNI SQL
Standard ASNI SQL
Supports Python
Supports Python
Dataframe API for Python
Dataframe API for Python
*
OS MPP SQL query engine
OS MPP SQL query engine
First-class data federation with first and third-party data catalogs
First-class data federation with first and third-party data catalogs
Natively run SQL on Iceberg, Delta Lake, Hudi, and Hive table formats
Natively run SQL on Iceberg, Delta Lake, Hudi, and Hive table formats
Built-in cross-cloud and cross-region querying
Built-in cross-cloud and cross-region querying
Supports hybrid cloud architecture
Supports hybrid cloud architecture
*
Supports Apache Ranger
Supports Apache Ranger
In platform capability to migrate Hive and Delta tables to Iceberg
In platform capability to migrate Hive and Delta tables to Iceberg
Comparison based on publicly available information as of November 30, 2023
*In preview. Contact us to learn more.
Access and analyze your data with elastic scale and high performance your business demands. Get started with a free Galaxy trial, watch this tutorial on Starburst Galaxy, or contact us.
Unlike traditional SQL warehouses, Databricks SQL is a platform that provides SQL data warehousing capabilities on data stored in a cloud data lake. It is primarily used for data exploration, ad hoc analytics (without the need of data pipelines) and interactive big data analytics. The platform allows users to query data from a limited number of federated data sources using non-optimized connectors and requiring the use of their proprietary data catalog, Unity Catalog, for an optimal data federation experience.
Databricks SQL also offers its own visualization tooling and integration with various visualization BI tools such as Tableau, PowerBI, ThoughtSpot, Looker, and others to make it easier for data analyst to consume the data.
A workspace within the Databricks platform is an environment for data engineers, data science teams and others to access Databricks resources with the option of running a single or multiple workspaces within your Databricks account.
Similar to Starburst, Databricks SQL uses the American National Standards Institute (ANSI) for SQL, which is a common standard for most SQL databases.
Yes. Similar to Starburst Galaxy, within Databricks SQL there is an environment via SQL endpoints for running SQL queries, creating dashboards with visualization tools, and sharing query results.
Yes, similar to Starburst Galaxy, you can write your SQL queries or scripts in any text editor of your choice and then copy and paste them into Databricks SQL or a Databricks notebook for execution.
No, Databricks SQL is not a program. It is a SQL tool within the Databricks Lakehouse platform.
Databricks on Azure or Azure Databricks is the result of a collaboration between Microsoft and Databricks to bring Databricks on as a first party service to Microsoft Azure. The core functionality of Databricks remains the same across the three hyperscale cloud providers but there are some differences in how it is integrated with each cloud provider’s services.
We’ll send you a free download of Starburst, and a Starburst expert will reach out to schedule a call.
© Starburst Data, Inc. Starburst and Starburst Data are registered trademarks of Starburst Data, Inc. All rights reserved. Presto®, the Presto logo, Delta Lake, and the Delta Lake logo are trademarks of LF Projects, LLC
Up to $500 in usage credits included