Delta Table
Connect Delta Table to your preprocessing pipeline, and batch process all your documents using unstructured-ingest
to
store structured outputs locally on your filesystem.
Make sure to have the Delta Table dependencies installed:
AWS credentials need to be available for use with the storage options.
Specify the to the DeltaTable using the table-uri
argument, and pass a dictionary of the options to use for the storage backend
via storage_options
.
Make sure to set the --partition-by-api
flag and pass in your API key with --api-key
:
Additionally, if you’re using Unstructured Serverless API, your locally deployed Unstructured API, or an Unstructured API
deployed on Azure or AWS, you also need to specify the API URL via the --partition-endpoint
argument.
Was this page helpful?