Skip to main content
After partitioning, you can have a vision language model (VLM) optimize the fidelity of text blocks that Unstructured initially processed during its partitioning phase. Here are a few examples of Unstructured’s output of text blocks that were initially processed, and the more accurate version of these text blocks that were optimized by using Claude Sonnet 4. Irrelevant lines of output have been omitted here for brevity. Before (vertical watermarked text, represented incorrectly):
{
    "...", "...",
    "text": "3 2 0 2 t c O 9 2 ] V C . s c [ 2 v 9 0 8 6 1 . 0 1 3 2 : v i X r",
    "...", "..."
}
After (vertical watermarked text, now represented correctly from the original content):
{
    "...", "...",
    "text": "arXiv:2310.16809v2 [cs.CV] 29 Oct 2023",
    "...", "..."
}
Before (hyperlink, represented incorrectly):
{
    "...", "...",
    "text": "con/Yuliang-Liu/MultinodalOCR|",
    "...", "..."
}
After (hyperlink, now represented correctly from the original content):
{
    "...", "...",
    "text": "https://github.com/Yuliang-Liu/MultimodalOCR",
    "...", "..."
}
Before (Chinese characters, represented incorrectly):
{
    "...", "...",
    "text": "GT SHE GPT4-V: EHES",
    "...", "..."
}
After (Chinese characters, now represented correctly from the original content, expressed as Unicode):
{
    "...", "...",
    "text": "GT : \u91d1\u724c\u70e7\u814a GPT4-V: \u6587\u9759\u5019\u9e1f",
    "...", "..."
}

Optimize text fidelity

To optimize text fidelity, in an Enrichment node in a workflow, select OCR, and then choose one of the available provider (and model) combinations that are shown.
You can change a workflow’s image description settings only through Custom workflow settings.For workflows that use chunking, the Chunker node should be placed after all Enrichment nodes. Placing the Chunker node before an image descriptions Enrichment node could cause incomplete or no image descriptions to be generated.
Unstructured can optimize text fidelity for workflows that are configured as follows:
  • With a Partitioner node set to use the Auto or High Res partitioning strategy, and a text fidelity optimization node is added.
  • With a Partitioner node set to use the VLM partitioning strategy. No text fidelity optimization node is needed (or allowed).
Unstructured never generates text fidelity optimizations for workflows with a Partitioner node set to use the Fast partitioning strategy.