Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.unstructured.io/llms.txt

Use this file to discover all available pages before exploring further.

This page describes how to establish private connectivity between your dedicated instance and Azure-managed services. For self-hosted applications, see Connecting to customer-managed services on Azure.

Azure managed services

This section covers Azure-managed services that have service-specific private connectivity requirements. The tables below use Order to indicate the general sequence of the information exchange — items with the same Order value can usually be provided at the same stage. Values like <UNSTRUCTURED_SP_ID> and <DI_AZURE_SUBSCRIPTION_ID> are placeholders; contact your Unstructured account team or open a ticket in the Unstructured Support Portal to get the actual values for your dedicated instance.
Each Azure service you connect to via Private Link must be in the same Azure region as your dedicated instance — cross-region Private Link is not supported. The example region codes in the tables below are illustrative; use the region your dedicated instance is deployed in. See Known limitations.

Azure Blob Storage

This applies to Azure Blob Storage and Azure Data Lake Storage Gen2 (ADLS Gen2).
OrderInformation RequiredDescriptionExampleOwner
1Storage Account NameName of the Azure Storage AccountcustomerstorageacctCustomer
1Storage Account Resource IDFull Azure Resource ID of the storage account/subscriptions/…/providers/Microsoft.Storage/storageAccounts/customerstorageacctCustomer
1Container Name(s)Containers Unstructured needs to accessdocuments, processed-dataCustomer
1Storage Account RegionRegion where storage account is locatedeastusCustomer
1Sub-resource Typeblob, dfs (for ADLS Gen2), file, table, queueblob, dfsCustomer
2Unstructured Service Principal IDService Principal ID for RBAC role assignment<UNSTRUCTURED_SP_ID>Unstructured
2Unstructured Subscription IDSubscription ID for private endpoint approval<DI_AZURE_SUBSCRIPTION_ID>Unstructured
For ADLS Gen2, use sub-resource type dfs for hierarchical namespace operations. You must approve the private endpoint connection request. To do so, navigate to Storage Account > Networking > Private endpoint connections. Example: Azure role assignment The following Azure CLI command assigns the Storage Blob Data Reader role to Unstructured’s service principal so the platform can read blobs in your storage account.
az role assignment create \
  --assignee "<UNSTRUCTURED_SP_ID>" \
  --role "Storage Blob Data Reader" \
  --scope "/subscriptions/<SUBSCRIPTION_ID>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.Storage/storageAccounts/<STORAGE_ACCOUNT>"
For read/write access, use the Storage Blob Data Contributor role instead of Storage Blob Data Reader. Replace:
  • <UNSTRUCTURED_SP_ID> — Service principal ID for Unstructured (provided during onboarding)
  • <SUBSCRIPTION_ID> — Your Azure subscription ID
  • <RESOURCE_GROUP> — Resource group that contains the storage account
  • <STORAGE_ACCOUNT> — Storage account name

Azure OpenAI

Azure OpenAI supports private endpoint access. Unstructured creates a private endpoint in our VNet to access the your Azure OpenAI resource privately.
OrderInformation RequiredDescriptionExampleOwner
1Azure OpenAI Resource NameName of the Azure OpenAI resourcecustomer-openaiCustomer
1Azure OpenAI Resource IDFull Azure Resource ID/subscriptions/…/providers/Microsoft.CognitiveServices/accounts/customer-openaiCustomer
1Azure OpenAI RegionRegion where the resource is deployedeastusCustomer
1Deployment Name(s)Model deployment names to accessgpt-4-deployment, embedding-deploymentCustomer
1Sub-resource TypeAlways account for Azure OpenAIaccountCustomer
2Unstructured Service Principal IDService Principal ID for RBAC role assignment<UNSTRUCTURED_SP_ID>Unstructured
2Unstructured Subscription IDSubscription ID for private endpoint approval<DI_AZURE_SUBSCRIPTION_ID>Unstructured
Example: Azure role assignment for OpenAI Use Azure CLI to assign the Cognitive Services OpenAI User role to Unstructured’s service principal for your Azure OpenAI resource:
az role assignment create \
  --assignee "<UNSTRUCTURED_SP_ID>" \
  --role "Cognitive Services OpenAI User" \
  --scope "/subscriptions/<SUBSCRIPTION_ID>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.CognitiveServices/accounts/<OPENAI_RESOURCE>"
Replace:
  • <UNSTRUCTURED_SP_ID> — Service principal ID for Unstructured (provided during onboarding)
  • <SUBSCRIPTION_ID> — Your Azure subscription ID
  • <RESOURCE_GROUP> — Resource group that contains the storage account
  • <OPENAI_RESOURCE> — Your Azure OpenAI Resource ID
After the role assignment:
  1. Firewall: On the Azure OpenAI resource, configure networking so public access is disabled and traffic is allowed only as required for your private endpoint design.
  2. Approve the connection: In the Azure portal, go to the OpenAI resource > Networking > Private endpoint connections, then approve the pending private endpoint connection from Unstructured.

Azure Databricks

Azure Databricks supports Private Link for both front-end (user-to-workspace) and back-end (compute-to-control plane) connectivity. You must have Private Link enabled on your workspace.
OrderInformation RequiredDescriptionExampleOwner
1Databricks Workspace URLThe workspace URLadb-1234567890123456.12.azuredatabricks.netCustomer
1Databricks Workspace Resource IDFull Azure Resource ID of the workspace/subscriptions/…/providers/Microsoft.Databricks/workspaces/customer-workspaceCustomer
1Databricks Workspace RegionAzure region where workspace is deployedeastusCustomer
1Sub-resource Typedatabricks_ui_api for workspace accessdatabricks_ui_apiCustomer
2Unstructured Subscription IDSubscription ID for private endpoint approval<DI_AZURE_SUBSCRIPTION_ID>Unstructured
3Private endpoint DNSThe private endpoint DNS for the workspaceadb-1234567890123456.12.privatelink.azuredatabricks.netCustomer
Example: Approve the Databricks private endpoint connection After Unstructured creates the private endpoint toward your workspace, approve the pending connection using Azure CLI:
az network private-endpoint-connection approve \
  --resource-group "<RESOURCE_GROUP>" \
  --resource-name "<WORKSPACE_NAME>" \
  --name "<PRIVATE_ENDPOINT_NAME>" \
  --type Microsoft.Databricks/workspaces
You can also approve the same request in the Azure portal: open the Databricks workspace resource > Networking > Private endpoint connections, then approve the pending connection from Unstructured. Replace:
  • <RESOURCE_GROUP> — Resource group that contains the Databricks workspace
  • <WORKSPACE_NAME> — Name of the Databricks workspace resource
  • <PRIVATE_ENDPOINT_NAME> — Name of the private endpoint connection (shown in the portal or provided by Unstructured)

Azure Managed Services with private endpoint access

This section applies to Azure Managed Services that support private endpoint access, including: Azure AI Search, Azure SQL Database, Azure SQL Managed Instance, Azure Cosmos DB, Azure Database for PostgreSQL, Azure Database for MySQL, and Azure Synapse Analytics.
The Order column indicates the general sequence for the information exchange. Items with the same order value can usually be provided at the same stage.
OrderInformation RequiredDescriptionExampleOwner
1Service TypeThe Azure managed service being accessedAzure AI Search, Azure SQLCustomer
1Resource NameName of the Azure resourcemy-search-serviceCustomer
1Resource IDFull Azure Resource ID/subscriptions/…/providers/Microsoft.Search/searchServices/my-searchCustomer
1Service RegionRegion where the service is hostedeastusCustomer
1Sub-Resource TypeThe sub-resource type for Private Link (see table below)searchService, sqlServerCustomer
1Service PortPort the service listens on443 (AI Search), 1433 (SQL)Customer
2Unstructured Service Principal IDService Principal ID for RBAC role assignment<UNSTRUCTURED_SP_ID>Unstructured
2Unstructured Subscription IDSubscription ID to approve for private endpoint<DI_AZURE_SUBSCRIPTION_ID>Unstructured
Sub-resource types by service:
ServiceSub-Resource Type
AI SearchsearchService
Azure SQL DatabasesqlServer
Azure SQL Managed InstancemanagedInstance
Azure Cosmos DB (SQL API)Sql
Azure Cosmos DB (MongoDB API)MongoDB
Azure Database for PostgreSQLpostgresqlServer
Azure Database for MySQLmysqlServer
Azure Synapse AnalyticsSql, SqlOnDemand, Dev