Querying SharePoint Data in Starburst

knikhilreddy99 · December 1, 2025, 11:36pm

Hi ,

We’re exploring ways to connect Starburst Enterprise (SEP 476-e.1) to SharePoint or OneDrive to query CSV or Excel files stored in corporate document libraries.

Currently, our production environment uses:

SEP deployed on OpenShift (OCP)
Multiple Hive/Iceberg catalogs backed by HCP-S3 endpoints
Dynamic catalog management enabled

We have several business teams storing data files in SharePoint and would like to query them directly in Starburst (similar to S3 or Azure Blob).

Questions:

Is there an existing or planned SharePoint / OneDrive connector for SEP or Trino?
If not, is there any recommended approach to ingest SharePoint data — e.g. via Microsoft Graph API, WebDAV, or rclone → S3 sync — that integrates cleanly with Starburst catalogs?
Would the community be open to a custom connector contribution (Graph API based) that exposes SharePoint folders as read-only tables (CSV/XLSX)?
Are there any best practices for scheduling or automating this ingestion into HCP-S3 for querying via Hive/Iceberg?

Any guidance, roadmap notes, or references to similar use cases would be appreciated.

lester · December 2, 2025, 2:13am

One approach would be to copy the existing Google Sheets connector — Trino 478 Documentation and then see if it can be modified to use the Graph API; Excel workbooks and charts API overview - Microsoft Graph | Microsoft Learn .