Tag: Data Pipelines

Showing 18 results

How to use Starburst and Airflow to create resilient data pipelines

July 25, 2023

Build resilient data pipelines with Airflow and Starburst Galaxy (the fastest and easiest way to use Trino) by leveraging Docker

Starburst data lake certification and training

July 24, 2023

Data analytics certification program to learn about topics such as data lakes and data lakehouses, and modern table formats like Apache Iceberg.

Data pipelines and data lakes: Transforming raw data into actionable insights

July 20, 2023

ETL operates as the engine behind the data pipeline process, moving data from a raw state to a consumable one. Let’s unpack the way in which this typically operates in a modern data lake or data lakehouse. Later, we’ll take a tour to see how Starburst Galaxy fits in this picture and how it can be used to construct the Land, Structure and Consume layers typical of a modern data lake.

How fast access to data and quality ML code can enable competitive differentiation and innovation

February 16, 2023

2022 ended with many successful AI models being deployed, including OpenAI’s ChatGPT. There’s no doubt that there will be plenty more successes in 2023....

Trino for Large-Scale ETL @ Lyft

January 25, 2023

Lyft operates one of the largest transportation networks in the world. A business like ours depends on data on so many levels. Data relating...

Over 80 Data & Analytics Statistics, Data, Trends, and Facts

December 28, 2022

Most organizations have data and continue to generate and collect it on a daily basis, but have a far more difficult time in getting...

Building lakehouse with dbt and Trino

November 30, 2022

In this series, we demonstrate how to build data pipelines using dbt and Trino with data directly from your operational systems. They can use...

Reliving the Hype: Highlights from Trino Summit 2022

November 18, 2022

Last week in San Francisco was one for the Trino history books. After three years of planning, rescheduling, planning, and rescheduling some more, Starburst...

Second Edition of Trino: The Definitive Guide

October 5, 2022

Starburst has played a key role in the Trino community for a long time now. We contribute  to the success of Trino every day....

Building Reporting Structures on S3 using Starburst Galaxy and Apache Iceberg

October 4, 2022

AWS S3 has become one of the most widely used storage platforms in the world. Companies store a variety of data on S3 from...

Rethinking SIEM Solutions

September 13, 2022

As organizations strive to become more agile, there has been a mass movement jumping headfirst into what is called a security data lake. Gartner...

A Better Solution For Managing and Maintaining Data Pipelines, Now In Public Preview

July 6, 2022

Customers who want a single, super fast and easy-to-use solution for both interactive and longer-running data pipeline queries now have a solution: take advantage...

Employee Perspective: Accelerating Data-Driven Insights in AdTech

June 16, 2022

Before I joined Starburst, I worked in the AdTech industry where companies buy and sell user data for online targeting advertisement campaigns or ML/AI-based...

Transforming Your Data Pipelines with Starburst

June 9, 2022

Current State of ETL/ELT Extract-transform-load, more commonly known by its street name “ETL”, has been around since the early days of computing. Bringing together...

New release of the dbt-trino adapter

May 9, 2022

dbt labs released version 1.1 of the dbt-core project in late April, 2022. This did not catch the maintainers of the dbt-trino project by...

ETL vs Interactive Queries: The Case for Both

May 5, 2022

This is Part 1 of a 2-part blog about how Trino can support both interactive and batch use cases.  In Part 1, we will...

Redefine Your Analytics Without ETL Using Starburst and Amazon EKS

June 14, 2021

+ As more and more organizations are looking to the cloud to help fulfill their operational and analytics needs; so is the data center...

Presto Memory Connector

May 1, 2018

Originally posted http://prestodb.rocks/news/presto-memory There is a highly efficient connector for Presto! It works by storing all data in memory on Presto Worker nodes, which...

Start for Free with Starburst Galaxy

Up to $500 in usage credits included

Please fill in all required fields and ensure you are using a valid email address.

Start Free with
Starburst Galaxy

Up to $500 in usage credits included

  • Query your data lake fast with Starburst's best-in-class MPP SQL query engine
  • Get up and running in less than 5 minutes
  • Easily deploy clusters in AWS, Azure and Google Cloud
For more deployment options:
Download Starburst Enterprise

Please fill in all required fields and ensure you are using a valid email address.