Unstructured Serverless API Services include these offerings:

  Read the launch announcement.

Supported File Types

The Unstructured API supports the same file types as the Unstructured open source library. However, the Unstructured Serverless API Services include more powerful file transformation capabilities, such as a more accurate table extraction model.

CategoryFile Types
Plaintext.eml, .html, .md, .msg, .rst, .rtf, .txt, .xml
Images.png, .jpg, .jpeg, .tiff, .bmp, .heic
Documents.csv, .doc, .docx, .epub, .odt, .pdf, .ppt, .pptx, .tsv, .xlsx

Data Ingestion

Unstructured Serverless API Services support ingesting data from various sources. Learn how.

Billing

We calculate a page as follows:

  • For these file types, a page is a page, slide, or image: .pdf, .pptx, and .tiff.
  • For .docx files that have page metadata, we calculate the number of pages based on that metadata.
  • For all other file types, we calculate the number of pages as the file’s size divided by 100 KB.

Get Support

Should you require any assistance or have any questions regarding the Unstructured API, please contact our support team at support@unstructured.io.