This page was recently updated. What do you think about it? Let us know!.

Connect Dropbox to your preprocessing pipeline, and use the Unstructured Ingest CLI or the Unstructured Ingest Python library to batch process all your documents and store structured outputs locally on your filesystem.

The requirements are as follows.

  1. A Dropbox account. Get an account.

  2. A target source or destination folder in your Dropbox account. Create a folder.

  3. A Dropbox app for your Dropbox account. To learn how to create an app, click the App Console tab on the Getting Started page.

  4. Permission for your Dropbox app to read from, and write to, the target folder in your Dropbox account as needed. To do this:

    • On the Permissions tab of your Dropbox app, check the boxes files.content.read or files.content.write or both. Learn more.

    • On the Settings tab of your Dropbox app, for App folder name, set the name of the target folder in your Dropbox account for your Dropbox app to have access to.

    • Note the remote URL to the target folder, which takes the format dropbox://<path/to/folder/in/account>.

  5. An access token for your Dropbox account. Get a token. Save this token in a secure location. Do not share it with others.

The Dropbox connector dependencies:

CLI, Python
pip install "unstructured-ingest[dropbox]"

You might also need to install additional dependencies, depending on your needs. Learn more.

The following environment variables:

  • DROPBOX_ACCESS_TOKEN - The value of your access token, represented by --token (CLI) or token (Python).
  • DROPBOX_REMOTE_URL - The remote URL to the target folder, represented by --remote-url (CLI) or remote_url (Python).

These environment variables:

  • UNSTRUCTURED_API_KEY - Your Unstructured API key value.
  • UNSTRUCTURED_API_URL - Your Unstructured API URL.

Now call the Unstructured Ingest CLI or the Unstructured Ingest Python library. The destination connector can be any of the ones supported. This example uses the local destination connector: