> ## Documentation Index
> Fetch the complete documentation index at: https://docs.unstructured.io/llms.txt
> Use this file to discover all available pages before exploring further.

# Partitioning

<iframe width="560" height="315" src="https://www.youtube.com/embed/0HAWt9Xog-Y" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen />

*Partitioning* extracts content from raw unstructured files and outputs that content as structured [document elements](/ui/document-elements).

For specific file types, such as image files and PDF files, Unstructured offers special strategies to partition them. Each of these
strategies has trade-offs for output speed, cost to output, and quality of output.

PDF files, for example, vary in quality and complexity. In simple cases, traditional natural language processing (NLP) extraction techniques might
be enough to extract all the text out of a document. In other cases, advanced image-to-text models are required
to process a PDF file. Some of these strategies implement rule-based workflows, which can be faster and cheaper, because they always
extract in the same way, but you might sometimes get lower-quality resolution. Other strategies implement
model-based workflows, which can be slower and costlier because they require a model that performs inference, but you can get higher-quality resolution.
When you choose a partitioning strategy for your files, you should be mindful of these speed, cost, and quality trade-offs.
For example, the **Fast** strategy can be about 100 times faster than leading image-to-text models.

To choose one of these strategies, select one of the following four **Partition Strategy** options in the **Partitioner** node of a workflow.

<Note>You can change a workflow's preconfigured strategy only through [Custom](/ui/workflows#create-a-custom-workflow) workflow settings.</Note>

Unstructured recommends that you choose the **Auto** partitioning strategy in most cases. With **Auto**, Unstructured does all
the heavy lifting, optimizing at runtime for the highest quality at the lowest cost page-by-page.

You should consider the following additional strategies only if you are absolutely sure that your documents are of the same
type. Each of the following strategies are best suited for specific situations. Choosing one of these
strategies other than **Auto** for sets of documents of different types could produce undesirable results,
including reduction in transformation quality.

* **VLM**: For the highest-quality transformation of these file types: `.bmp`, `.gif`, `.heic`, `.jpeg`, `.jpg`, `.pdf`, `.png`, `.tiff`, and `.webp`.
* **High Res**: For all other [supported file types](/ui/supported-file-types), and for the generation of bounding box coordinates.
* **Fast**: For text-only documents.

The **Auto** partitioning strategy routes each file as a complete unit to the appropriate partitioning strategy (**VLM**, **High Res**, or **Fast**)
based on the preceding file types. Additionally, for `.pdf` files, the **Auto** partitioning strategy routes these files' pages
on a page-by-page basis, as follows:

* A page is routed to **Fast** when it contains only embedded text and no images or tables are detected.
* All other kinds of pages are routed to **VLM** or **High Res**, depending on the complexity of a page's
  content. Unstructured constantly optimizes its proprietary algorithm for routing to **VLM** or **High Res** in these cases.

## Images and tables in PDF files

The differences between the various partitioning strategies can be more clearly demonstrated by the ways each of these strategies handle images and tables within PDF files.

For example, the **Fast** partitioning strategy skips processing images altogether in PDF files:

<img src="https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Fast-Image-Example.png?fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=f3049c20120d97df8734bcb9f746ec10" alt="The Fast strategy skips processing images in PDF files" data-og-width="2988" width="2988" data-og-height="1356" height="1356" data-path="img/partitioning/Fast-Image-Example.png" data-optimize="true" data-opv="3" srcset="https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Fast-Image-Example.png?w=280&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=ed0aa95110f5233dcc244da47a11165a 280w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Fast-Image-Example.png?w=560&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=1b05434f8caf7e8580a5ffcff7af6f67 560w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Fast-Image-Example.png?w=840&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=664d78c6a93b6c95a272c135511d2b2d 840w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Fast-Image-Example.png?w=1100&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=d9707fb30e470bc695ace0691077d82f 1100w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Fast-Image-Example.png?w=1650&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=abb29a9e650856dbccae09a133c3336a 1650w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Fast-Image-Example.png?w=2500&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=9555a6009a8e84250a0c13971650852d 2500w" />

For tables, the **Fast** strategy interprets table cells in PDF files as a mixture of title, list, and uncategorized text elements:

<img src="https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Fast-Table-Example.png?fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=6c423798d881fd1a5806e98544e505e2" alt="The Fast strategy interprets table cells in PDF files as text" data-og-width="2606" width="2606" data-og-height="1398" height="1398" data-path="img/partitioning/Fast-Table-Example.png" data-optimize="true" data-opv="3" srcset="https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Fast-Table-Example.png?w=280&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=c82a94b1b751b12fdb52fc837576c1a0 280w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Fast-Table-Example.png?w=560&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=152ac9cf515829710cc5b5ae0138b67c 560w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Fast-Table-Example.png?w=840&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=cf1b73f2d3ba70e7d70512626770a0ff 840w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Fast-Table-Example.png?w=1100&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=5e96d3cbdd9530cb4b6086bf21453ef5 1100w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Fast-Table-Example.png?w=1650&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=40e21e56188103fb41c5a402c11d38fb 1650w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Fast-Table-Example.png?w=2500&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=354f5a3ccf91c031a905a247d84e2b69 2500w" />

The **High Res** strategy, by itself, processes images in PDF files sometimes with limited output:

<img src="https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Hi-Res-Image-Example.png?fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=5a0d964c1845d79a8dad3eadb9fd67ca" alt="The High Res strategy processes images in PDF files with limited results" data-og-width="2994" width="2994" data-og-height="576" height="576" data-path="img/partitioning/Hi-Res-Image-Example.png" data-optimize="true" data-opv="3" srcset="https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Hi-Res-Image-Example.png?w=280&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=8b3fcb592d932d8277ad3c0b8fffc34a 280w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Hi-Res-Image-Example.png?w=560&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=ec9e935f23a0499e55a5c7e4aeb72d9f 560w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Hi-Res-Image-Example.png?w=840&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=27f3bd3c5e93863170c0d661f618e681 840w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Hi-Res-Image-Example.png?w=1100&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=90bf41b88db456709179913774e8bef0 1100w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Hi-Res-Image-Example.png?w=1650&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=d4a9daa4d1e4ce7ff21316fb81b63fec 1650w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Hi-Res-Image-Example.png?w=2500&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=8a0041ada102d021e2379afbbd8b7998 2500w" />

However, when combined with the [image description](/ui/enriching/image-descriptions) enrichment, the **High Res** strategy can process images in PDF files with better result output:

<img src="https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Hi-Res-Image-Enriched-Example.png?fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=b0781cfa0ae624d94850a492dc787b1a" alt="The High Res strategy with image description produces better results" data-og-width="2986" width="2986" data-og-height="718" height="718" data-path="img/partitioning/Hi-Res-Image-Enriched-Example.png" data-optimize="true" data-opv="3" srcset="https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Hi-Res-Image-Enriched-Example.png?w=280&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=3e68fcd035046e635f216e053610863f 280w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Hi-Res-Image-Enriched-Example.png?w=560&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=11613f006d1163fe2e9a1948a6ca5a20 560w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Hi-Res-Image-Enriched-Example.png?w=840&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=216ec9229f9367a75d828ff6c13f02aa 840w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Hi-Res-Image-Enriched-Example.png?w=1100&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=d6c1632257ed649a65fb78b7e4bcb24b 1100w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Hi-Res-Image-Enriched-Example.png?w=1650&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=e895a703f422b2f52f30831b35b54463 1650w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Hi-Res-Image-Enriched-Example.png?w=2500&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=416f5bd7629359a22740fb086ef96b63 2500w" />

For tables, the **High Res** strategy processes tables in PDF files with the table's text and an HTML representation of the table as output:

<img src="https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hi-Res-Table-Example.png?fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=5e5475c1b3d148116bdcc14383fbd779" alt="The High Res strategy processes tables in PDF files with the table's text and HTML as output" data-og-width="3020" width="3020" data-og-height="796" height="796" data-path="img/partitioning/Hi-Res-Table-Example.png" data-optimize="true" data-opv="3" srcset="https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hi-Res-Table-Example.png?w=280&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=8d668af78414371dda7ffa2623cd88b9 280w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hi-Res-Table-Example.png?w=560&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=ac4f7ba7a4d086e36f8f2b47021f9a8d 560w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hi-Res-Table-Example.png?w=840&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=1f73b1ec8949ffe529ab03c37226dd8b 840w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hi-Res-Table-Example.png?w=1100&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=1ff76a7c147f9c170ca935d0e373c9f0 1100w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hi-Res-Table-Example.png?w=1650&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=e9deafd0c6eb590ea1c96e61c56d25c8 1650w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hi-Res-Table-Example.png?w=2500&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=41eb8b97496fe378e85535e9be8ad94d 2500w" />

When combined with the [table description](/ui/enriching/table-descriptions) and [tables to HTML](/ui/enriching/table-to-html) enrichments, the **High Res** strategy can process tables in PDF files with even richer result output:

<img src="https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hi-Res-Table-Enriched-Example.png?fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=60bb1ccb84c015b17301910cc1d1e49d" alt="The High Res strategy with table summarization and table-to-HTML enrichments produces better results" data-og-width="3004" width="3004" data-og-height="1096" height="1096" data-path="img/partitioning/Hi-Res-Table-Enriched-Example.png" data-optimize="true" data-opv="3" srcset="https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hi-Res-Table-Enriched-Example.png?w=280&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=d511cac472d7b880cb63fc610f8b7419 280w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hi-Res-Table-Enriched-Example.png?w=560&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=d02810e1afff57467595c046185227b3 560w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hi-Res-Table-Enriched-Example.png?w=840&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=a57a7a760f4ac06fe1b22dc8f8f0cc09 840w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hi-Res-Table-Enriched-Example.png?w=1100&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=42d32a0f7629d8faa5cba0b872b44730 1100w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hi-Res-Table-Enriched-Example.png?w=1650&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=1a72133e60dde6029f6f09c7bee3e452 1650w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hi-Res-Table-Enriched-Example.png?w=2500&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=7b4ab4844f6a244e813405b7bfdf12d0 2500w" />

The **VLM** strategy processes images in PDF files with image summaries and text as HTML elements as output. The following example shows GPT-4o by OpenAI being used. If
the **Auto** strategy is selected in this example, Unstructured will route to the **VLM** strategy for processing:

<img src="https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/VLM-Auto-Image-GPT-4o-Example.png?fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=fbd0c6efff975ee4e3dfdea9afe5c424" alt="The VLM strategy processes images in PDF files with image summaries and text as HTML" data-og-width="2938" width="2938" data-og-height="682" height="682" data-path="img/partitioning/VLM-Auto-Image-GPT-4o-Example.png" data-optimize="true" data-opv="3" srcset="https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/VLM-Auto-Image-GPT-4o-Example.png?w=280&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=19aa964eda330c68bd0b58c9e04203c4 280w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/VLM-Auto-Image-GPT-4o-Example.png?w=560&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=bc5b6f3df717776f25961c81c19d314f 560w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/VLM-Auto-Image-GPT-4o-Example.png?w=840&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=e0996ff9669925249bfba3282508bddf 840w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/VLM-Auto-Image-GPT-4o-Example.png?w=1100&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=786df9386e9ae79e6e859d594d11a266 1100w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/VLM-Auto-Image-GPT-4o-Example.png?w=1650&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=c8decc0381953def5eba3c1c374bacf7 1650w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/VLM-Auto-Image-GPT-4o-Example.png?w=2500&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=c12a91c943dc26ed807bae779b0b2159 2500w" />

For tables, the **VLM** strategy processes tables in PDF files with the table's text and an HTML representation of the table as output, similar to the **High Res** strategy.
The following example shows GPT-4o by OpenAI being used. If the **Auto** strategy is selected in this example, Unstructured will route to the **VLM** strategy for processing:

<img src="https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/VLM-Auto-Table-GPT-4o-Example.png?fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=474521c4ae1ec6fb90056604fbbed218" alt="The VLM strategy processes tables in PDF files with table summaries and text as HTML" data-og-width="3016" width="3016" data-og-height="912" height="912" data-path="img/partitioning/VLM-Auto-Table-GPT-4o-Example.png" data-optimize="true" data-opv="3" srcset="https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/VLM-Auto-Table-GPT-4o-Example.png?w=280&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=a032d917b610fb749771f19738ce0409 280w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/VLM-Auto-Table-GPT-4o-Example.png?w=560&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=8fec7b3e4bd8e8691a10fa324de9f995 560w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/VLM-Auto-Table-GPT-4o-Example.png?w=840&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=71bae3da3c6ff6ef7654ea539a2d027d 840w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/VLM-Auto-Table-GPT-4o-Example.png?w=1100&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=e17905685b606e3396586094b8ece834 1100w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/VLM-Auto-Table-GPT-4o-Example.png?w=1650&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=e4f5faaed5fb1490bb67d4a136418e76 1650w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/VLM-Auto-Table-GPT-4o-Example.png?w=2500&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=209ef200d438088039b28d517ed8be75 2500w" />

## Handwriting and multilanguage characters in PDF files

The differences between the various partitioning strategies can be more clearly demonstrated by the ways each of these strategies handle handwriting and multilanguage characters within PDF files.

For example, the **Fast** partitioning strategy skips processing handwriting altogether in PDF files.

The **Fast** strategy processes multilanguage characters in PDF files with limited output, depending on the language. In the following
example, Japanese hiragana characters are processed as text, but the output can be very difficult to work with:

<img src="https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-Fast.png?fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=6f73218b7522ce0a521c7f769e30a8fc" alt="The Fast strategy produces cryptic CID codes for hiragana characters" data-og-width="1384" width="1384" data-og-height="726" height="726" data-path="img/partitioning/Hiragana-Fast.png" data-optimize="true" data-opv="3" srcset="https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-Fast.png?w=280&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=46bbac315a9aaf89584cf4d0f9944a9a 280w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-Fast.png?w=560&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=d8ca72994e5b5a42d730f4ccf9406a1d 560w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-Fast.png?w=840&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=06b5772fc9a5e9de04e0171665d62c69 840w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-Fast.png?w=1100&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=93b22b0aad828cc48691d9618f26eba7 1100w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-Fast.png?w=1650&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=c488012a721bba9deee70c03c0d0f7c1 1650w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-Fast.png?w=2500&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=9e6e9dfbb76545f791d323bb9b279298 2500w" />

For handwriting, the **High Res** strategy typically produces unusable results, for example:

<img src="https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Handwriting-Hi-Res.png?fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=c5a66505562b34f6f8baeecf310d0cb6" alt="The High Res strategy typically produces unusable results for handwriting" data-og-width="1825" width="1825" data-og-height="835" height="835" data-path="img/partitioning/Handwriting-Hi-Res.png" data-optimize="true" data-opv="3" srcset="https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Handwriting-Hi-Res.png?w=280&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=7703559869224dd410abe5c008c84f32 280w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Handwriting-Hi-Res.png?w=560&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=11da5b9d7631a8a6094337d38909318c 560w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Handwriting-Hi-Res.png?w=840&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=6dad17f79e8871b9951a5f82b4c9babe 840w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Handwriting-Hi-Res.png?w=1100&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=94c57b8ff9df828e3e392637c4d1d14f 1100w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Handwriting-Hi-Res.png?w=1650&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=3da693f5e863088bc4572c8deee4fa73 1650w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Handwriting-Hi-Res.png?w=2500&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=3ab3ffe7879574019ceb069d360e2cbd 2500w" />

For multilanguage characters, the **High Res** strategy also typically produces unusable results, for example failing to recognize Japanese hiragana characters:

<img src="https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-Hi-Res.png?fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=2c4a6e8ca9df0e4125c93a1d3c00a909" alt="The High Res strategy typically produces unusable results for multilanguage characters" data-og-width="1757" width="1757" data-og-height="595" height="595" data-path="img/partitioning/Hiragana-Hi-Res.png" data-optimize="true" data-opv="3" srcset="https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-Hi-Res.png?w=280&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=60e2264ab02098e96f8118d47095a384 280w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-Hi-Res.png?w=560&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=8d016cf6bbf94ed6ee784693a843e33f 560w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-Hi-Res.png?w=840&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=b19b25d83404a4aceafe7beb2952e59f 840w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-Hi-Res.png?w=1100&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=3b5ba9d55a2b5dd77f80d8bbb18111d4 1100w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-Hi-Res.png?w=1650&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=6f73369d7e5ea82ec1c72d93354f59a2 1650w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-Hi-Res.png?w=2500&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=1e88de70f1e11a58af5bb9d38b84aec4 2500w" />

The **VLM** strategy can produce great results for handwriting, such as this example that uses GPT-4o by OpenAI:

<img src="https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Handwriting-VLM-GPT-4o.png?fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=33e3f3f255fc1a1f710acc83c7b236f1" alt="The VLM strategy can process handwriting well" data-og-width="1822" width="1822" data-og-height="822" height="822" data-path="img/partitioning/Handwriting-VLM-GPT-4o.png" data-optimize="true" data-opv="3" srcset="https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Handwriting-VLM-GPT-4o.png?w=280&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=fd7df9d589b08020db8826e4e8cd0496 280w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Handwriting-VLM-GPT-4o.png?w=560&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=01bb5ed187f3ee4297a43cfe451b1d82 560w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Handwriting-VLM-GPT-4o.png?w=840&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=3b9ef4af7c204274a0f8caf9e048f4bd 840w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Handwriting-VLM-GPT-4o.png?w=1100&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=0bd35da33591150ed143a876d5875f64 1100w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Handwriting-VLM-GPT-4o.png?w=1650&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=300b14d46aebb4414d8c044906886b61 1650w, https://mintcdn.com/unstructured-53/ognmPfo7rw6i-YTz/img/partitioning/Handwriting-VLM-GPT-4o.png?w=2500&fit=max&auto=format&n=ognmPfo7rw6i-YTz&q=85&s=ea779038b35dc382f5e7576af6b4d49b 2500w" />

The **VLM** strategy also has great support for recognizing multilanguage characters, such as this example that uses GPT-4o by OpenAI to recognize Japanese hiragana characters:

<img src="https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-VLM.png?fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=40075cefc175dd36a8024fbb6a6605cc" alt="The VLM strategy can process Japanese hiragana well" data-og-width="1770" width="1770" data-og-height="718" height="718" data-path="img/partitioning/Hiragana-VLM.png" data-optimize="true" data-opv="3" srcset="https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-VLM.png?w=280&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=098c48a739fea0282b14e9f82ea040f6 280w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-VLM.png?w=560&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=d8b22a4ec38cbb9f55d64394ef4c36b4 560w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-VLM.png?w=840&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=fa7a3a76f3a21440a81da0380f1a4dca 840w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-VLM.png?w=1100&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=10c4cfd545512220c75be028eab55f71 1100w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-VLM.png?w=1650&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=1c8ebf851e3358d93a20dda0b7c70697 1650w, https://mintcdn.com/unstructured-53/lzJ4hi3NwruEhqBQ/img/partitioning/Hiragana-VLM.png?w=2500&fit=max&auto=format&n=lzJ4hi3NwruEhqBQ&q=85&s=5d07fdab65855691532655daad6d0667 2500w" />

## Supported languages

**Fast** partitioning accepts any text inputs, though automatic language detection of those inputs is restricted to [langdetect](https://pypi.org/project/langdetect/).

**High Res** partitioning leverages Tesseract OCR. For the list of languages that Tesseract supports, see:
[Languages/Scripts supported in different versions of Tesseract](https://tesseract-ocr.github.io/tessdoc/Data-Files-in-different-versions.html).

Language support for **VLM** depends on the model used. The list of supported languages for a particular model is maintained by
that model's provider. For the list of languages that each model supports, see the following, where provided:

* Anthropic:

  * [Claude](https://docs.anthropic.com/en/docs/build-with-claude/multilingual-support)

* OpenAI

  * [GPT](https://help.openai.com/en/articles/8357869-how-to-change-your-language-setting-in-chatgpt#h_513834920e)

* Amazon Bedrock

  * [Claude](https://aws.amazon.com/bedrock/claude/)
  * [Nova](https://aws.amazon.com/ai/generative-ai/nova/)
  * [Llama](https://aws.amazon.com/bedrock/llama/)

* Vertex AI

  * [Gemini](https://cloud.google.com/vertex-ai/generative-ai/docs/models#expandable-1)

## Learn more

* <Icon icon="blog" />  [The Case for HTML as the Canonical Representation in Document AI](https://unstructured.io/blog/the-case-for-html-as-the-canonical-representation-in-document-ai)
* <Icon icon="video" />  [How to Extract Data from Complex Tables](https://unstructured.io/events/how-to-extract-data-from-complex-tables)
