SharePoint
Ingest your files into Unstructured from SharePoint.
You’ll need:
The SharePoint prerequisites, which include the following settings.
-
The SharePoint site URL.
- Site URLs typically have the format
https://<tenant>.sharepoint.com
. - Relative site URLs typically have the format
https://<tenant>.sharepoint.com/sites/<site_name>
. - To process all sites within a tenant, use a site URL of
https://<tenant>-admin.sharepoint.com
. This requires the app to be registered at a tenant level.
- Site URLs typically have the format
-
The path in the SharePoint site from which to start parsing files, for example
"Shared Documents"
. If the connector is to process all sites within the tenant, this filter will be applied to all site document libraries. -
A SharePoint app principal with its application (client) ID, client secret, and access permissions to the SharePoint instance. Get a client ID and client secret, and set access permissions.
You can create SharePoint app principals through
https://<tenant>.sharepoint.com/_layouts/15/appregnew.aspx
.For a SharePoint app principal with site-scoped permissions, use app permission request XML such as the following to grant permissions through
https://<tenant>.sharepoint.com/_layouts/15/appinv.aspx
:For a SharePoint app principal with tenant-scoped permissions, use app permission request XML such as the following to grant permissions through
https://<tenant>-admin.sharepoint.com/_layouts/15/appinv.aspx
instead:Available
Right
settings includeRead
,Write
,Manage
, andFullControl
. To learn more, see Add-in permissions in SharePoint.
To create the source connector:
- On the sidebar, click Connectors.
- Click Sources.
- Click Add new.
- Give the connector some unique Name.
- In the Provider area, click SharePoint.
- Click Continue.
- Follow the on-screen instructions to fill in the fields as described later on this page.
- Click Save and Test.
Fill in the following fields:
- Name (required): A unique name for this connector.
- Client ID (required): The client ID provided by SharePoint for the app registration.
- Client Credential (required): The client secret associated with the client ID.
- Site URL (required): The base URL of the SharePoint site to connect to.
- Path (required): The path from which to start parsing files, for example
Shared Documents
. - Recursive: Check this box to recursively process data from subfolders within the specified path.
- Files Only: Check this box to disregard the folder structure.