Installation
Full Installation
Basic Usage
For a complete set of extras catering to every document type, use:
To install unstructured
, you’ll also need to install the following system dependencies:
libmagic,
poppler,
libreoffice,
pandoc,
and tesseract.
Instruction details for these dependencies will vary by operating system. We recommend
running unstructured
from the officially supported Docker image, which has these dependencies
installed already.
Installation for Specific Document Types
If you’re processing document types beyond the basics, you can install the necessary extras:
Available document types:
Installation for Specific Data Connectors
To use any of the data connectors, you must install the specific dependency:
Available data connectors:
Was this page helpful?