Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.unstructured.io/llms.txt

Use this file to discover all available pages before exploring further.

Learn about the available chunking strategies.
NodeDescription
Chunk by CharacterCombines elements within hard and soft character limits, without using section or page boundaries.
Chunk by TitlePreserves section boundaries by starting a new chunk at each Title element.
Chunk by PagePreserves page boundaries, ensuring no chunk spans more than one page.
Chunk by SimilarityGroups topically similar consecutive elements into chunks using an embedding model.