Source connectors
Confluence
Connect Confluence to your preprocessing pipeline, and batch process all your documents using unstructured-ingest
to
store structured outputs locally on your filesystem.
Make sure to have the Confluence dependencies installed:
Shell
To connect to Confluence Cloud, provide the url to the Confluence Cloud instance, and your credentials:
- user_email
- api_token to authenticate into Confluence Cloud
For more details on authentication, refer to Confluence documentation.
Optional arguments:
max-num-of-spaces
: The maximum number of spaces to be ingested. E.g. pass--max-num-of-spaces 10
to set this to 10.spaces
: A comma separated list of space ids for the spaces to be ingested. Example:--spaces testteamsp1,testteamsp2
. Cannot be used together withmax-num-of-spaces
.max-num-of-docs-from-each-space
: The maximum number of documents to be ingested from each space. Example:--max-num-of-docs-from-each-space 250
.
Make sure to set the --partition-by-api
flag and pass in your API key with --api-key
:
Additionally, if you’re using Unstructured Serverless API, your locally deployed Unstructured API, or an Unstructured API
deployed on Azure or AWS, you also need to specify the API URL via the --partition-endpoint
argument.