×

Tag: SQL

Showing 51 results

Run optimized geospatial queries with Trino

March 23, 2023

The Trino open source distributed query engine is known as a choice for running ad-hoc analysis where there’s no need to model the data and...

Lie #3 — You’re ready for the AI + ML deep end

February 3, 2023

You’ve hired pedigreed data scientists and engineers, invested in shiny new software, and perhaps even reorganized your entire business, all in the hopes of...

Trino for Large-Scale ETL @ Lyft

January 25, 2023

Lyft operates one of the largest transportation networks in the world. A business like ours depends on data on so many levels. Data relating...

Over 80 Data & Analytics Statistics, Data, Trends, and Facts

December 28, 2022

Most organizations have data and continue to generate and collect it on a daily basis, but have a far more difficult time in getting...

Reliving the Hype: Highlights from Trino Summit 2022

November 18, 2022

Last week in San Francisco was one for the Trino history books. After three years of planning, rescheduling, planning, and rescheduling some more, Starburst...

Introducing Full Query Passthrough For Faster Query Federation

November 15, 2022

Best-in-class SQL query functionality has always been and remains a fundamental principle that defines Starburst’s query engine. With the recent implementation of full query...

Second Edition of Trino: The Definitive Guide

October 5, 2022

Starburst has played a key role in the Trino community for a long time now. We contribute  to the success of Trino every day....

4 Key Things You Should Know About Indexing

September 22, 2022

Data indexing radically accelerates query run time and concurrency without the need for massive compute resources. But before expecting indexing to solve all your...

The Difference Between Micro-Partitioning vs. Indexing and a Better Way

September 8, 2022

When optimizing your analytics database performance, one of the most important decisions is to choose how data is stored and accessed. There are two...

Scaling Up: When to Migrate from PostgreSQL to a Data Lake

July 13, 2022

One of the true pillars of the tech revolution, PostgreSQL is an OLTP database designed primarily to handle transactional workloads. The technology has been...

Confessions of a Space Quest League Advocate

July 6, 2022

Mission 2 Wrap and Mission 3 Launch We all know at least one pandemic puzzler, a devoted crossworder, or a religious wordler who finds...

Employee Perspective: Accelerating Data-Driven Insights in AdTech

June 16, 2022

Before I joined Starburst, I worked in the AdTech industry where companies buy and sell user data for online targeting advertisement campaigns or ML/AI-based...

The Benefit Of Using An Externally-Audited Data Analytics Solution

June 2, 2022

As a business begins to see the challenges of distributed data access, the selection of a query engine becomes critical for business operations.  For...

The Past, Present, and Future of Trino

May 24, 2022

Recently, I had the pleasure of chatting with Ravit Jain on his show “The Ravit Show” to discuss the evolution of Trino and where...

ETL vs Interactive Queries: The Case for Both

May 5, 2022

This is Part 1 of a 2-part blog about how Trino can support both interactive and batch use cases.  In Part 1, we will...

Faster Query Processing: CPU Time

March 25, 2022

A key engineering responsibility at Starburst is on performance enhancements. One is to reduce the amount of time that a CPU has to work...

What A SQL Query Engine Can Do For Big Data

February 16, 2022

Nod with me if you’ve suffered from the following problems with processing and analyzing Big Data via a centralized approach: different query languages, niche...

Top 6 Reasons to Migrate to the Cloud

January 25, 2022

Starburst released the 2021 State of Data market research report, conducted by Enterprise Management Associates (EMA), in collaboration with Red Hat, early last year....

The Right Way to Query Across Data Sources in Tableau (or, The Cross-Database Join Is Not Always Your Friend)

January 13, 2022

Summary Use the right tool for the right job. Not doing so means the difference between your Tableau viz rendering in seconds vs. minutes...

Achieving Lightning-Fast Analytics on the Salesforce Customer 360

January 6, 2022

Over the past twenty or so years, companies have experienced a Cambrian explosion of where their customer data resides.Cloud and on-premises enterprise applications aim...

Enabling Data Sovereignty with Starburst Stargate

December 29, 2021

In the data analytics and compliance world, data sovereignty is a concept that has our attention. Policy makers suggest that the best way to...

How Data Access Helps with ML/AI Projects

October 29, 2021

Today in the data space, when you peruse technology solutions, it's very difficult to put your finger on just exactly what each firm's product...

The Analytics Engine for Distributed Data

October 1, 2021

The idea of a single source of truth has been around since the beginning of big data. However, over the years, through the data...

Dynamic Filtering: Supporting High Speed Access to Data

September 20, 2021

Analysts are often tasked with deriving insights for business units where the data can span multiple locations.  This is increasingly true today when the...

How Assurance Unlocked More Business Value with Starburst

September 9, 2021

By leveraging Starburst, Assurance was able to improve conversion rates, reduce costs, and enable robust modeling. Read the full case study here. ...

Accelerating Data Science with Trino

August 31, 2021

At our Datanova for Data Scientists conference on July 14, I held a discussion with Dain Sundstrom and David Philips, CTOs of Starburst, about...

Why Performance Matters: Parquet, Delta Lake, Dynamic Filtering

August 26, 2021

My fascination with SQL query performance started quite some time ago and I contributed a paper on efficient processing of data warehousing during my...

Kafka and Starburst: 3 Considerations for Accelerating Time to Value

July 27, 2021

What is Kafka? Apache Kafka was created at LinkedIn and open sourced into the Apache Software foundation in early 2011. Kafka was developed to...

Data Federation and Data Virtualization Never Worked in the Past But Now it’s Different

July 13, 2021

Thirty years ago it was already commonplace for large businesses to have hundreds --- even thousands of different database instances managing data from the...

The State of Data Analysts

June 28, 2021

The world of data analysis is constantly changing and evolving, and sometimes it can be hard to keep up with. I had the pleasure...

Query Federation Made Simple at Comcast

June 24, 2021

The media and telecommunications provider now known as Comcast began as a regional operator with just five channels and 12,000 customers. Today, Comcast has...

Redefine Your Analytics Without ETL Using Starburst and Amazon EKS

June 14, 2021

+ As more and more organizations are looking to the cloud to help fulfill their operational and analytics needs; so is the data center...

Data Mesh: The Answer to the Data Warehouse Hypocrisy

March 25, 2021

Note: I start this piece with some technical background that has nothing to do with the data mesh, and is only relevant to data...

Top 10 Reasons to Migrate from OS Presto on EMR to Starburst Enterprise Presto

November 13, 2020

In today’s data architecture economy, there are no shortages of options when it comes to choosing various distributions and deployment strategies for a given...

Presto Turns Eight Years Old!

August 26, 2020

Presto just turned eight years old only a few weeks ago, and it's just getting started.  ...

The Death of Apache Drill

August 6, 2020

One of the things that really drew me to and got me excited about Presto over 4 years ago was that it wasn’t tied...

Presto & Data Science: Getting Data Into the Hands of Data Scientists (Faster)

June 26, 2020

A few days ago I read a Gartner report stating that data scientists spend 23% of their time on data collection and preparation. I...

Free Presto Book to Support the Community

April 14, 2020

As you probably know, Starburst is one of the main contributors and sponsors of the Presto open source project and the community around Presto....

Presto at Pinterest

March 12, 2020

Article reposted from Medium with permission from the author,  Ashish Singh | Pinterest Engineer, Data Engineering As a data-driven company, many critical business decisions...

Presto on Kubernetes

August 2, 2019

Kubernetes (K8s) eases the burden and complexity of configuring, deploying, managing, and monitoring containerized applications. We are excited to announce the availability and support...

The 4 Stages to Big Data Nirvana (In the Cloud)

July 18, 2019

Nirvana - a state of perfect happiness; an ideal or idyllic place.  In big data “Nirvana” is a wishlist of items: The ability to...

Presto Summit 2019 Recap

June 25, 2019

*If you are looking for the 2019 NYC Presto Summit, event info and registration can be found here.* Presentation Slides Presentation Videos   The...

How Storage Compute Separation is Changing the Way Enterprises Interact with Their Data

June 4, 2019

I’m sure you know the difference between storage and compute, and why the separation of these two layers is such a critical piece of...

General SQL Features in Presto

January 1, 2019

Welcome to the Advanced SQL Features in Presto series. In this series you are going to cover a set of SQL features that expands...

Querying data in S3 using Presto and Looker

July 10, 2018

With more and more companies using AWS for their many data processing and storage needs,  it’s never been easier to query this data with...

Presto Available on AWS Marketplace!

June 19, 2018

  Today I am excited to announce the availability of Presto on AWS Marketplace by Starburst. The Presto AWS Marketplace offering is based on...

Presto Memory Connector

May 1, 2018

Originally posted http://prestodb.rocks/news/presto-memory There is a highly efficient connector for Presto! It works by storing all data in memory on Presto Worker nodes, which...

Presto Join Enumeration

April 17, 2018

Karol Sobczak, Co-founder and Software Engineer at Starburst Welcome back to the series of blog posts (checkout our previous post!) about Presto's first Cost-Based...

Presto Cost-Based Optimizer rocks the TPC benchmarks!

April 3, 2018

Wojciech Biela, Co-founder at Starburst Introduction As mentioned in our previous blog about the Starburst Presto release and its hottest addition - the Cost...

Presto gets EVEN FASTER, with a 10-15x performance boost in upcoming release!

March 20, 2018

Next week, we will be releasing the Starburst Distribution of Presto 195e. Based on prestosql/presto 0.195, Starburst’s 195e will ship with Presto’s first cost-based...

Presto – Next Chapter

December 13, 2017

As you may have learned from our first press release, we have announced the creation of Starburst, a new independent company solely focused on...

Start Free with
Starburst Galaxy

Up to $500 in usage credits included

  • Query your data lake fast with Starburst's best-in-class MPP SQL query engine
  • Get up and running in less than 5 minutes
  • Easily deploy clusters in AWS, Azure and Google Cloud
For more deployment options:
Download Starburst Enterprise

Please fill in all required fields and ensure you are using a valid email address.

s