chunk
Subtype: chunk_by_table_merging
Usage guidance
Use this strategy when documents contain long tables that span multiple pages. It only acts onTable elements with a metadata.text_as_html field on adjacent pages. It pairs well with a table2html enrichment earlier in the workflow.
Settings
The LLM provider to use for merge decisions. Allowed values:
anthropic, openai, bedrock, vertexai. Default: none.The model name for the selected provider. Defaults to the provider’s default. For a full list of the models available in Unstructured, see Available models.
Minimum LLM confidence required to merge a table pair. Allowed values:
low, medium, high. Default: medium.Maximum number of merge requests to run at once. Minimum:
1. Default: 4.
