Showing 12 of 232 results
Governing data products at scale: Promoting collaboration between producers and consumers
July 25, 2024
At the recent Governing data products to scale session, Laurent Dresse, the Chief Evangelist at DataGalaxy, promotes collaboration between data product producers and consumers. ...
Upgrading your SQL engine: The first migration pathway from Hadoop to Starburst
July 16, 2024
Enterprises face significant challenges with their Hadoop infrastructures, driving the need for modernization. Upgrading the SQL engine is a highly effective solution, providing data...
3 Data Ingestion Best Practices: The Trends to Drive Success
July 11, 2024
It’s time to talk about data pipelines, specifically data ingestion best practices. Typically, a data engineering pipeline has three stages: Data Ingestion Data Transformation...
What’s next for Trino
July 3, 2024
It seems like only yesterday that Trino celebrated being around for a decade. Born out of Facebook to address the need for improved performance...
5 Ways to solve the disparate data problem and drive business outcomes
July 1, 2024
Enterprise data architectures are not pristine. They evolve to create a patchwork of different data systems, structures, and formats. Somehow, engineers must stitch everything...
Transitioning from Hadoop to modern lakehouses
June 20, 2024
As organizations strive to harness the full potential of their data, the limitations of legacy Hadoop systems become increasingly apparent. Hadoop's architecture has been...
Real-world insights from Asurion: Data quality in practice
June 13, 2024
Why should organizations pay attention to data quality is the heart of the question. Moreover, how does big data and inconsistent data impact data-driven...
Why Apache Iceberg will accelerate competition for compute engines
June 13, 2024
Apache Iceberg emerged last week triumphant, having won the race to become king of the data lakehouse. In many ways, this was a long...
Advanced Data Management: Trino, Hadoop, and AWS for a Robust Lakehouse
June 12, 2024
Apache Hadoop revolutionized enterprise data management by offering an open-source alternative to expensive proprietary data systems. Companies could process massive datasets using the commodity...
Snowflake, Databricks, Tabular, Iceberg, what does it all mean?
June 11, 2024
What happened last week? Snowflake Summit ran from Tuesday (June 4, 2024) through Thursday. This year, the conference was overshadowed by two significant announcements:...
3 Iceberg partitioning best practices to improve performance
June 5, 2024
Imagine that your desk resembled the above image. Now you need to find all the invoices for a particular month to calculate your average...
Enhancing Apache Hadoop Data Management with Trino and Starburst
June 1, 2024
For almost two decades, companies have built big data processing architectures based on the Hadoop ecosystem. To extend the Hadoop project beyond its core...