If you’re new to Unstructured, read this note first.Before you can create a source connector, you must first sign in to your Unstructured account:
- If you do not already have an Unstructured account, sign up for free. After you sign up, you are automatically signed in to your new Unstructured Starter account, at https://platform.unstructured.io. To sign up for a Team or Enterprise account instead, contact Unstructured Sales, or learn more.
- If you already have an Unstructured Starter or Team account and are not already signed in, sign in to your account at https://platform.unstructured.io. For an Enterprise account, see your Unstructured account administrator for instructions, or email Unstructured Support at support@unstructured.io.
- For the Unstructured UI or the Unstructured API, only Couchbase Capella clusters are supported.
- For Unstructured Ingest, Couchbase Capella clusters and local Couchbase server deployments are supported.
- A Couchbase Capella account.
- A Couchbase Capella cluster.
- A bucket, scope, and collection on the cluster.
- The cluster’s public connection string.
- The cluster access name (username) and secret (password).
-
Incoming IP address allowance for the cluster.
To get Unstructured’s IP address ranges, go to
https://assets.p6m.u10d.net/publicitems/ip-prefixes.json
and allow all of the
ip_prefix
fields’ values that are listed.These IP address ranges are subject to change. You can always find the latest ones in the preceding file.
- Installation of a local Couchbase server.
- Connection details to the local Couchbase server.
- On the sidebar, click Connectors.
- Click Sources.
- Cick New or Create Connector.
- Give the connector some unique Name.
- In the Provider area, click Couchbase.
- Click Continue.
- Follow the on-screen instructions to fill in the fields as described later on this page.
- Click Save and Test.
- Name (required): A unique name for this connector.
- Bucket (required): The name of the bucket to connect to on the cluster.
- Connection string (required): The connection string for the Couchbase cluster.
- Scope: The name of the scope in the bucket. The default is
_default
if not otherwise specified. - Collection: The name of the collection in the scope. The default is
_default
if not otherwise specified. - Batch Size: The maximum number of documents per transmitted batch. The default is
50
if not otherwise specified. - Collection ID key (source connector only): The name of the collection field that contains the document ID. The default is
id
if not otherwise specified - Username (required): The cluster access name (username) that has access to the cluster.
- Password (required): The password for the corresponding username.