Hi ,
We’re exploring ways to connect Starburst Enterprise (SEP 476-e.1) to SharePoint or OneDrive to query CSV or Excel files stored in corporate document libraries.
Currently, our production environment uses:
-
SEP deployed on OpenShift (OCP)
-
Multiple Hive/Iceberg catalogs backed by HCP-S3 endpoints
-
Dynamic catalog management enabled
We have several business teams storing data files in SharePoint and would like to query them directly in Starburst (similar to S3 or Azure Blob).
Questions:
-
Is there an existing or planned SharePoint / OneDrive connector for SEP or Trino?
-
If not, is there any recommended approach to ingest SharePoint data — e.g. via Microsoft Graph API, WebDAV, or rclone → S3 sync — that integrates cleanly with Starburst catalogs?
-
Would the community be open to a custom connector contribution (Graph API based) that exposes SharePoint folders as read-only tables (CSV/XLSX)?
-
Are there any best practices for scheduling or automating this ingestion into HCP-S3 for querying via Hive/Iceberg?
Any guidance, roadmap notes, or references to similar use cases would be appreciated.