Connect Discord to your preprocessing pipeline, and batch process all your documents using unstructured-ingest to store structured outputs locally on your filesystem.

Make sure to have the Discord dependencies installed:

Shell
pip install "unstructured-ingest[discord]"

To ingests the contents of Discord channels, you need to supply the following information:

  • token: an authentication token used to access Discord API
  • channels: a list of discord channel ids to ingest from

Optionally you can set the number of days to go back in history of the channels via the period argument.

Make sure to set the --partition-by-api flag and pass in your API key with --api-key:

Additionally, if you’re using Unstructured Serverless API, your locally deployed Unstructured API, or an Unstructured API deployed on Azure or AWS, you also need to specify the API URL via the --partition-endpoint argument.