Tables to HTML
After partitioning and chunking, you can have Unstructured generate representations of each detected table in HTML markup format.
This table-to-HTML output is done by using GPT-4o, provided through OpenAI.
Here is an example of the HTML markup output of a detected table using GPT-4o. Note specifically the text_as_html
field that is added.
Line breaks have been inserted here for readability. The output will not contain these line breaks.
Generate table-to-HTML output
To generate table-to-HTML output, in the Task drop-down list of an Enrichment node in a workflow, select Table to HTML.
You can change a workflow’s table description settings only through Custom workflow settings.
Table-to-HTML output is generated only when the Partitioner node in a workflow is set to use the High Res partitioning strategy. Learn more.
Was this page helpful?