Tag: How-To Guides

Showing 22 results

Automated maintenance for Apache Iceberg tables in Starburst Galaxy

February 15, 2023

This post is part of the Iceberg blog series. Read the entire series: Introduction to Apache Iceberg in Trino Iceberg Partitioning and Performance Optimizations...

How To Migrate Queries From Amazon Athena To Starburst Galaxy

November 3, 2022

As a data enthusiast, one of my goals is to understand how organizations attempt to create a trusted and accurate single source of truth...

Build a Data Lakehouse Reporting Structure with dbt and Starburst Galaxy

October 18, 2022

Since my first introduction to dbt, I was intrigued to say the least. Working as a data engineer, I was attempting to manage complicated...

Building Reporting Structures on S3 using Starburst Galaxy and Apache Iceberg

October 4, 2022

AWS S3 has become one of the most widely used storage platforms in the world. Companies store a variety of data on S3 from...

Bleeding edge Java

September 20, 2022

In honor of the release of Java 19, we present this series of blog posts on how to use the latest _bleeding edge_ features...

Practical Security And Policy-Based Governance In A Data Mesh

August 11, 2022

Proponents of Data Mesh understand its many game-changing benefits for large scale organizations. For those who are new to this reimagined framework, Data Mesh...

AWS Dev Day Recap: Data Lake Analytics with Starburst Galaxy

August 5, 2022

On Wednesday, August 3rd, I had the opportunity to share a hands-on lab exploring Data Lake reporting structures with my AWS partner in crime,...

Near Real-Time Ingestion For Trino

August 4, 2022

It is quite popular in today's data climate for modern data architectures to have some sort of batch processing system to move data into...

Part 2: How to Run Batch Processes Using Starburst Galaxy

May 19, 2022

This is Part 2 of a 2-part blog about how Trino can support both interactive and batch use cases. In Part 1, we explored...

Achieving Lightning-Fast Analytics on the Salesforce Customer 360

January 6, 2022

Over the past twenty or so years, companies have experienced a Cambrian explosion of where their customer data resides.Cloud and on-premises enterprise applications aim...

Kafka and Starburst: 3 Considerations for Accelerating Time to Value

July 27, 2021

What is Kafka? Apache Kafka was created at LinkedIn and open sourced into the Apache Software foundation in early 2011. Kafka was developed to...

Rapid Controlled Access to Data with Starburst and Immuta

June 16, 2021

A growing number of enterprises are experiencing the benefits of the Starburst single point of access to all of their data that allows them...

Redefine Your Analytics Without ETL Using Starburst and Amazon EKS

June 14, 2021

+ As more and more organizations are looking to the cloud to help fulfill their operational and analytics needs; so is the data center...

Managing Secrets in Trino

June 3, 2021

Most companies want to follow good security practices. With the number of security breaches coming out daily, it almost feels like a matter of...

A Gentle Introduction to the Hive Connector

February 12, 2021

One of the most confusing aspects when starting with the Hive connector comes from the complex Hive model and overlapping use cases of this...

Presto & Data Science: Getting Data Into the Hands of Data Scientists (Faster)

June 26, 2020

A few days ago I read a Gartner report stating that data scientists spend 23% of their time on data collection and preparation. I...

Presto at Pinterest

March 12, 2020

Article reposted from Medium with permission from the author,  Ashish Singh | Pinterest Engineer, Data Engineering As a data-driven company, many critical business decisions...

Consumption Layers 101

July 12, 2019

The typical big data infrastructure is a Frankenstein’s monster of legacy hardware, cloud connections, and storage environments. Data exists in different silos, in every...

Querying data in S3 using Presto and Looker

July 10, 2018

With more and more companies using AWS for their many data processing and storage needs,  it’s never been easier to query this data with...

Presto Memory Connector

May 1, 2018

Originally posted http://prestodb.rocks/news/presto-memory There is a highly efficient connector for Presto! It works by storing all data in memory on Presto Worker nodes, which...

Presto Join Enumeration

April 17, 2018

Karol Sobczak, Co-founder and Software Engineer at Starburst Welcome back to the series of blog posts (checkout our previous post!) about Presto's first Cost-Based...

Presto Cost-Based Optimizer rocks the TPC benchmarks!

April 3, 2018

Wojciech Biela, Co-founder at Starburst Introduction As mentioned in our previous blog about the Starburst Presto release and its hottest addition - the Cost...

Start Free with
Starburst Galaxy

Up to $500 in usage credits included

  • Query your data lake fast with Starburst's best-in-class MPP SQL query engine
  • Get up and running in less than 5 minutes
  • Easily deploy clusters in AWS, Azure and Google Cloud
For more deployment options:
Download Starburst Enterprise

Please fill in all required fields and ensure you are using a valid email address.