×
×

What is an Icehouse?

A New Hope for Big Data

Upgrade your data architecture with Trino and Apache Iceberg

Leader

Leader in Big Data Processing & Distribution

Leader

Highest User Adoption for Enterprise Big Data Analytics

Explore Icehouse With Starburst

Are you interested in learning more about Icehouse? A member of our team will be in touch soon to show you the architecture.

Understanding the Benefits of an Icehouse

Data Warehouse vs. Data Lakehouse

Why a data lakehouse gives you warehouse-like performance for a lower cost.

Data Warehouse

Data Lakehouse

Data Volume

Data Volume

TBs-PBs

PBs+

Data Types

Data Types

Structured

All (Structured, Semi-Structured, Unstructured)

Performance

Performance

High

High

Data Quality

Data Quality

High

High

Cost

Cost

$$$

$$ (Separation of Storage & Compute)

Open

Open

No

Yes

Use Cases

Use Cases

Business Intelligence & Reporting; Workloads

Business Intelligence & Reporting; Data Applications; Data Science; Machine Learning

The Next Era of the End-to-End Data Lakehouse

The Benefits of Starburst’s Icehouse Built With Trino & Apache Iceberg

Optimized for Big Data Analytics

Improve the scalability and responsiveness of your architecture with the lakehouse that’s proven at petabyte scale

Get Speed Without Increased Costs

Achieve data warehouse performance with a more scalable architecture without the added costs

Leverage Cutting Edge Innovation

Adopt the technology that revolutionized Netflix, Apple, Shopify, Stripe

Create the Hyperscale architecture you have always dreamed of

Data warehousing solutions simply can’t scale to big data

  • Storage: Separation of storage and compute that supports independent scaling
  • Processing: Trino is multi-parallel processing engine that supports high concurrency
  • Table Format: Iceberg built for cloud storage with decoupled metadata that supports large tables

Achieve industry-leading price performance for SQL workloads

Data warehousing solutions simply can’t scale to big data

  • Performance: Achieve the same performance as your data warehouse while optimizing your spend
  • Costs: Expect 4X cost savings over time
  • Risk: Remove any risk of having your data restricted with 0% of the lock-in of a data warehouse

Use a familiar SQL interface

The same SQL interface you’ve been used to working with

  • Support: Ensure you have the right support for DML statements and your table needs
  • Compliance: Guaranteed ACID-compliance so that all database transactions are completed easily
  • Schema Evolution: Provided schema evolution will allow you to easily modify your database without disruption

The perfect data architecture without the hassle

All on a fully-managed platform with end-to-end data pipeline support from ingestion to data sharing

  • Ingestion: Using unreliable data ingestion can create complications with data accuracy, can increase complexities with data analysis, which can ultimately lead to data unreliability.
  • Governance: Simplify data governance. With the right architecture, you can eliminate the need to integrate a whole separate governance system.
  • Analyze: Execute SQL queries on Iceberg tables fast with advanced performance optimization tools

COMPARING ICEHOUSE DATA TABLES

Apache Iceberg vs. Delta Lake

Apache Iceberg

Databricks Delta Lake

Transaction Support (ACID Compliance)

Transaction Support (ACID Compliance)

Full

Full

File Format

File Format

Parquet, ORC, Avro

Parquet

Schema Evolution

Schema Evolution

Full

Limited

(Only supports adds/reorders of columns)

Partition Evolution

Partition Evolution

Yes

No

Versioning

Versioning

Yes

Yes

Time Travel

Time Travel

Yes

Yes

Materialized Views

Materialized Views

Yes

No

Community & Ecosystem

Community & Ecosystem

Growing

Growing

Integrations

Integrations

Interoperable

Tight integration with Databricks

Use Cases

Use Cases

General purpose data lakehouses

Optimized for Databricks data lakehouses

Apache Iceberg vs. Apache Hive

Apache Iceberg

Apache Hive

Transaction Support (ACID Compliance)

Transaction Support (ACID Compliance)

Full

Only w/Hive ACID

File Format

File Format

Parquet, ORC, Avro

Parquet, ORC, Avro

Schema Evolution

Schema Evolution

Full

Limited

(No guarantees of correctness)

Partition Evolution

Partition Evolution

Yes

No

Versioning

Versioning

Yes

No

Time Travel

Time Travel

Yes

No

Materialized Views

Materialized Views

Yes

Yes

Community & Ecosystem

Community & Ecosystem

Growing

Established

Integrations

Integrations

Interoperable

Interoperable

Use Cases

Use Cases

General purpose data lakehouses

General purpose data lake w/limited DML support

Start Free with
Starburst Galaxy

Up to $500 in usage credits included

  • Query your data lake fast with Starburst's best-in-class MPP SQL query engine
  • Get up and running in less than 5 minutes
  • Easily deploy clusters in AWS, Azure and Google Cloud
For more deployment options:
Download Starburst Enterprise

Please fill in all required fields and ensure you are using a valid email address.