Cookie Notice
This site uses cookies for performance, analytics, personalization and advertising purposes.
For more information about how we use cookies please see our Cookie Policy.
Manage Consent Preferences
These cookies are essential in order to enable you to move around the website and use its features, such as accessing secure areas of the website.
These are analytics cookies that allow us to collect information about how visitors use a website, for instance which pages visitors go to most often, and if they get error messages from web pages. This helps us to improve the way the website works and allows us to test different ideas on the site.
These cookies allow our website to properly function and in particular will allow you to use its more personal features.
These cookies are used by third parties to build a profile of your interests and show you relevant adverts on other sites. You should check the relevant third party website for more information and how to opt out, as described below.
Showing 100 results
Data analytics certification program to learn about topics such as data lakes and data lakehouses, and modern table formats like Apache Iceberg.
Data engineers have typically functioned as a central hub for engineering tasks. They work with multiple departments and business units across the enterprise. But as the decentralized, data-product-driven architecture of the data mesh approach becomes more popular, and more organizations find themselves on this decentralization journey, what happens to that centralized data team?
In our last post, we discussed two methods for running geospatial analysis with Trino and the Hive connector and explored a few optimization techniques...
The Trino open source distributed query engine is known as a choice for running ad-hoc analysis where there’s no need to model the data and...
More than any other industry, Financial Services is likely to only partially realize the elusive utopian state of 'the single source of truth' for...
We are eleven days into the new year, and I have spent the past two weeks exerting unreasonable amounts of effort trying to make...
Over the past few weeks, we’ve shared a few examples of what it means to be a data rebel. Hopefully you’ve recognized yourself in...
Accessing data in cloud storage has been an ongoing challenge for analysts, data engineers, and organizations as a whole. Additional work is required to...
Most organizations have data and continue to generate and collect it on a daily basis, but have a far more difficult time in getting...
The shift to cloud-based software-as-a-service platforms is accelerating in just about every tech industry. So it wasn’t much of a surprise to the analytics...
As we’ve gone from Data Mesh theory to practice, organizations have been shifting their focus towards the central tenet of Data Mesh — building...
This post is part of the Iceberg blog series. Read the entire series: Introduction to Apache Iceberg in Trino Iceberg Partitioning and Performance Optimizations...
In this series, we demonstrate how to build data pipelines using dbt and Trino with data directly from your operational systems. They can use...
This post is part of the Iceberg blog series. Read the entire series: Introduction to Apache Iceberg in Trino Iceberg Partitioning and Performance Optimizations...
Last week in San Francisco was one for the Trino history books. After three years of planning, rescheduling, planning, and rescheduling some more, Starburst...
This post is part of the Iceberg blog series. Read the entire series: Introduction to Apache Iceberg in Trino Iceberg Partitioning and Performance Optimizations...
A data lakehouse combines the principles of a data lake and a data warehouse to include the best of both worlds. Data lakehouses are...
This post is part of the Iceberg blog series. Read the entire series: Introduction to Apache Iceberg in Trino Iceberg Partitioning and Performance Optimizations...
I have been in and around data since my days with Microsoft Access, Excel, and SQL Server circa 2000, and was fortunate to witness...
It’s finally here! We are closing in on the final countdown to Trino Summit 2022, and I can feel myself getting more excited with...
Since my first introduction to dbt, I was intrigued to say the least. Working as a data engineer, I was attempting to manage complicated...
Since Datanova: The Data Mesh Summit and our in-person executive discussions on data products and Data Mesh, we’ve been validating the data product approach...
Corporate data is no doubt a valuable asset. Except it’s an open secret that data alone isn’t inherently valuable, nor will it produce valuable...
Starburst has played a key role in the Trino community for a long time now. We contribute to the success of Trino every day....
AWS S3 has become one of the most widely used storage platforms in the world. Companies store a variety of data on S3 from...
Data virtualization revolutionized the data infrastructure space by serving data consumers directly on top of data stores, without the need to move data elsewhere....
In the big data analytics world, enabling analytics on unstructured text is a powerful capability. For that reason, it would be of use that...
Since Datanova: The Data Mesh Summit and our in-person executive discussions on data products and Data Mesh, we’ve been validating the data product approach...
As organizations strive to become more agile, there has been a mass movement jumping headfirst into what is called a security data lake. Gartner...
How to choose the right solution for your big data analytics engine When optimizing your analytics database performance, one of the most important decisions...
Since Datanova: The Data Mesh Summit and our in-person executive discussions on data products and Data Mesh, we’ve been validating the data product approach...
The glory days of SIEM are over. Security teams are not only measured by their ability to collect as much data as possible, but...
Proponents of Data Mesh understand its many game-changing benefits for large scale organizations. For those who are new to this reimagined framework, Data Mesh...
On Wednesday, August 3rd, I had the opportunity to share a hands-on lab exploring Data Lake reporting structures with my AWS partner in crime,...
It is quite popular in today's data climate for modern data architectures to have some sort of batch processing system to move data into...
Metabase can now be used to connect to SEP, Starburst Galaxy, and Trino as a BI tool and client.Metabase excels at providing BI insights...
Next-Gen data management and analytics strategies We’ve all lived it. Heard it. Adapted to it. The next analytics strategy with numerous ‘modern’ technologies to...
One of the true pillars of the tech revolution, PostgreSQL is an OLTP database designed primarily to handle transactional workloads. The technology has been...
Customers who want a single, super fast and easy-to-use solution for both interactive and longer-running data pipeline queries now have a solution: take advantage...
Mission 2 Wrap and Mission 3 Launch We all know at least one pandemic puzzler, a devoted crossworder, or a religious wordler who finds...
I’m excited to announce the acquisition of Varada, a data analytics accelerator, based out of Tel Aviv, Israel. Varada offers a data lake analytics...
Before I joined Starburst, I worked in the AdTech industry where companies buy and sell user data for online targeting advertisement campaigns or ML/AI-based...
About a month into my first job I finished building my first data pipeline ever. I soaked in the “I Made THAT!” moment, and...
Current State of ETL/ELT Extract-transform-load, more commonly known by its street name “ETL”, has been around since the early days of computing. Bringing together...
Best-in-class organizations need fast, reliable data analytics that enable business leadership to identify patterns and key insights that will help them predict the best...
Recently, I had the pleasure of chatting with Ravit Jain on his show “The Ravit Show” to discuss the evolution of Trino and where...
This is Part 2 of a 2-part blog about how Trino can support both interactive and batch use cases. In Part 1, we explored...
dbt labs released version 1.1 of the dbt-core project in late April, 2022. This did not catch the maintainers of the dbt-trino project by...
This is Part 1 of a 2-part blog about how Trino can support both interactive and batch use cases. In Part 1, we will...
The key to success for any company is deriving business value from data in a robust, scalable, and timely fashion. A huge part of...
Calling all data pros! Are you ready for a $20k payday? Yes, you heard it right – you could be walking away with $20,000...
A key engineering responsibility at Starburst is on performance enhancements. One is to reduce the amount of time that a CPU has to work...
This blog was co-authored by Alex Breshears, Product Manager at Starburst In today’s global economy, it’s impossible to understate the importance of being able...
I participated in a panel discussion with Karl Eklund, Principal Architect at Red Hat, and William Schnoeppner, Director of Research and Consulting at EMA...
Nod with me if you’ve suffered from the following problems with processing and analyzing Big Data via a centralized approach: different query languages, niche...
Let this summit be the one that will pull you out of your PJs and into the data products lab with your lab coat...
Start off the year right by registering for our two-day virtual conference, Datanova: The Data Mesh Summit. As of late, with the rise of...
Starburst released the 2021 State of Data market research report, conducted by Enterprise Management Associates (EMA), in collaboration with Red Hat, early last year....
So far, we’ve highlighted a few reasons why you should attend Datanova: The Data Mesh Summit: The Woz and Justin Borgman. The next reason...
Summary Use the right tool for the right job. Not doing so means the difference between your Tableau viz rendering in seconds vs. minutes...
The self-professed “troublemaker” Zhamak Dehghani, who coined Data Mesh will join us for not one but two sessions! ...
The original vision of Starburst was to make querying distributed data as simple, fast, and painless as possible. Starburst Galaxy, our serverless, fully-managed SaaS...
I think of Starburst Stargate as the Lord of the Rings feature. Or the galactic empire feature. In a prior blog post, I introduced...
As companies shift their analytical ecosystems from on-premise to cloud and try to avoid “data lock-in”, we’re noticing some very interesting data patterns. This...
I am increasingly getting asked about the difference between the Data Fabric and the Data Mesh. They are both emerging paradigms designed to solve...
Over the past few years the “modern data stack” has entered the vernacular of the data world, describing a standardized, cloud-based data and analytics...
I’m one of those strange people who has always enjoyed doing performance testing. The thought of spinning up lots of machines to do my...
If you haven’t heard of Trino before, it is a query engine that speaks the language of many genres of databases. As such, Trino...
Data Mesh is based on four central concepts, the second of which is data as a product. In this blog, we’ll explore what that...
Insane in the domain! Insane in the brain! Crazy insane, got no domain! - Cypress Hill, sort of Data Mesh is based on four...
The idea of a single source of truth has been around since the beginning of big data. However, over the years, through the data...
You might have heard about Data Mesh recently, which is a modern approach to managing data and analytics in a distributed, domain-driven fashion. At...
Despite the investments and effort poured into next-generation data storage systems, monolithic, centralized data warehouses and data lakes have failed to provide the line...
Every five years, a small group of leaders in the data management research community get together to do a self assessment --- what are...
Today’s digital world is an expanding frontier of emerging technologies. There are endless innovations, inspired by data, informed by data, enabled by data, and...
By leveraging Starburst, Assurance was able to improve conversion rates, reduce costs, and enable robust modeling. Read the full case study here. ...
My fascination with SQL query performance started quite some time ago and I contributed a paper on efficient processing of data warehousing during my...
As companies shift their analytical ecosystems from on-premise to cloud and try to avoid “data lock-in”, we’re noticing some very interesting data patterns. This...
Kafka was created at LinkedIn and open sourced into the Apache Software foundation in early 2011. It was developed to optimize writes especially for...
Thirty years ago it was already commonplace for large businesses to have hundreds --- even thousands of different database instances managing data from the...
This is the fifth episode in our video series, Starburst Elements, focused around anything and everything Starburst. In this episode, our Product Manager Vishal...
The media and telecommunications provider now known as Comcast began as a regional operator with just five channels and 12,000 customers. Today, Comcast has...
A growing number of enterprises are experiencing the benefits of the Starburst single point of access to all of their data that allows them...
Amazon EKS and Starburst help new users easily adopt, manage, and operate Kubernetes ...
Today we announced Starburst Stargate, the industry’s first gateway for global cross-cloud analytics. I’m excited to share more behind why we built this and...
Welcome back to the Trino on Ice blog series that has so far covered some very interesting high level concepts of the Iceberg model,...
Most companies want to follow good security practices. With the number of security breaches coming out daily, it almost feels like a matter of...
At Starburst, we believe in building optionality into your data architecture & strategy. To us, optionality means building for flexibility so that you don’t...
Welcome back to this blog series discussing the amazing features of Apache Iceberg. In the last two blog posts, we’ve covered a lot of...
This is the fourth episode in our video series, Starburst Elements, focused around anything and everything Starburst. In this episode, our Product Manager Vishal...
In-place table evolution and cloud compatibility with Iceberg ...
This is the third episode in our video series, Starburst Elements, focused around anything and everything Starburst. In this episode, our Product Manager Vishal...
After debuting our blog series on data pandemic stories with a story from Tableau and a perspective from Privacera, we are excited to bring...
We’re excited to debut this blog series ‘Trino on Ice’ with a gentle introduction to Iceberg. Stay tuned for future posts from the Trino...
After debuting our blog series on data pandemic stories with a story from Tableau, we’re excited to bring you a viewpoint from Syed Mahmood,...
Note: I start this piece with some technical background that has nothing to do with the data mesh, and is only relevant to data...
TL;DR: The Hive connector is what you use in Starburst Enterprise for reading data from object storage that is organized according to the rules...
Datanova is just next week. More than 2,000 data and analytics leaders will join us to learn more about how to unlock the value...
We love data engineers at Starburst. They are our people, even when their Starburst Data equivalents try to trick Marketing into pronouncing the data...
Just like Linux, there are multiple derivatives of Presto. Trino is the one maintained by the creators of Presto. “What's in a name? That...
© Starburst Data, Inc. Starburst and Starburst Data are registered trademarks of Starburst Data, Inc. All rights reserved. Presto®, the Presto logo, Delta Lake, and the Delta Lake logo are trademarks of LF Projects, LLC
Up to $500 in usage credits included