
JPMC's Personal Guide to Starburst


A Brief Introduction to Starburst
A single point of access to query data that lives in any data system
Based on open source Trino, Starburst gives you the flexibility to run federated
interactive and ETL workloads using a single query engine, significantly reducing data
movement delays and costs.
From globally dispersed internal data to 3rd party data, Starburst empowers customers
to make informed risk mitigation, revenue-generating, and process optimization
decisions while adhering to data sovereignty, regulatory, and data security requirements.

Use Cases for Phase 1
AWM
Analytics Reporting & AI/ML Workflows
Executive Sponsors: Michael Urciuoli, Michael Heizer, Rod Thomas, Eddie Hsu, Ram Jois
Modernize Data Galaxy platform and accelerate AWS migration to streamline Advisor and DS team workflows and drive new revenue opportunities. Federate HDFS and on prem/cloud S3 with 10-100x performance increase on Hive / Impala current solution
CIB
IBDH Data Hub
Executive Sponsors: Grant McKenzie, Prashant Reddy, Thilak Maskibal
Investment banking data hub – Modernize analytics platform and accelerate cloud migration for banking applications, reports and Data science consumers
Workforce Technology
Epx Platform Employee 360
Executive Sponsors: Andy Goldberg, Christian Winter, Erik Palfrey, Andrew Feig, Krishna Kannan
Falcon – Federated Data Lake (FDL) – federated governed data mesh / lake ecosystem on JPMC AWS public cloud (S3, LF, Oracle)
CB
Innovation Economy Platform and Enterprise BI Toolkit
Executive Sponsors: Ananth Hedge
Platform capability to enable business users to gather, analyze, and visualize business – empowering key user groups to access information required to make better data-driven decisions.
Critical Starburst Capabilities

Performance and Scalability
- Proven at PB scale, thousands of nodes (Citi, BOFA, FINRA)
- Performance and scalability tested by Wealth Management
- 10x more concurrency than Dremio and Athena
- 10-100x faster than Hive and Impala on Hadoop

Federated Query Engine for Data Mesh Hybrid & Multi Cloud Optionality
- Tested connectivity to JPMC Cloudera and S3
- 50+ Enhanced Performance & Parallel Connectors; Oracle, Teradata, Hive, Snowflake, S3 Data lake
- Parquet accelerator, ORC, Delta Lake, Iceberg
- Starburst Stargate, Data Products
Security & Governance
- Deployed in JPMC environment (SEALID, GKS, Authentication)
- Only vendor with a native Immuta integration
- Tested Glue integration. Lake Formation
- RBAC, ABAC, Ranger Plugin – Column & Row level OOTB

Enterprise Support & Monitoring
- Fully supported, production-tested and enterprise grade distribution of open source Trino. Version control and release management
- Starburst Insights – ability to track cluster metrics and debug queries to optimize query performance and compute resources.
Open Architecture
- Open Core Architecture – Pluggable & fully extensible by JPMC
- Vendor Agnostic – No vendor lock in, or proprietary formats
- 5000+ community members, 2000+ pull requests a year, 400 committers
Key Resources for JPMC
On-Demand Sessions
Strategic Partners
Starburst integrates with a wide range of cloud, technology, and consulting partners,
enabling you to access your data wherever and however you may need.

Customer Case Studies
Join us virtually!
Your Dedicated Starburst Team
Meet the team working to bring JPMC’s analytics anywhere.
Get in touch
Want to try Starburst? Have questions? We're here to help.