Upstage Document Parse:

Let LLMs read your documents with speed and accuracy

 
 

Have you ever wondered how to get large language models (LLMs) to handle complex documents? While LLMs are undoubtedly useful for document QA or RAG, they cannot natively read and process document files. To bridge this gap, we need to convert these files into LLM-readable formats. Document parsing is the process that exactly does this—transforming complex document files into formats such as HTML and Markdown. As simple as it may sound, this is not an easy problem, particularly due to the diversity of document layouts, table formats, and the variety of elements involved. However, if this is done right, we gain a significant boost in accuracy when processing documents with LLMs.

This is why Upstage has developed Document Parse, a document parser designed to enhance the performance of both RAG systems and LLMs. In enterprise environments, where parsing must meet critical demands—speed, accuracy, and usability—Document Parse excels by delivering high-quality results that ensure precise, reliable outputs from LLMs, even with complex documents.

Blazing fast with multi-page documents

Upstage Document Parse processes 100 pages in under a minute. Our tests show it is, on average, 10 times faster than competitors such as Unstructured and AWS Textract, and 4 times faster than LlamaParse, when processing documents between 50 and 100 pages with similar configuration settings.

 
 
 

Unmatched accuracy for the most difficult use cases

Upstage Document Parse outperforms both open-source and commercial alternatives, as demonstrated by our newly introduced benchmark, DP-Bench. According to this benchmark, Document Parse leads in accuracy metrics like TEDS (93.48) and TEDS-S (94.16), surpassing major competitors such as Google and Microsoft in document layout and table structure recognition by more than 5%.

 
 

Getting started

Getting started with Document Parse is simple.

1. Test with your own documents at our Playground

You can try out Document Parse with Playground UI on the Upstage Console for free. It provides HTML output for the first page of the input document, with each layout element highlighted. 

2. Explore the API from Upstage Console

You can use Upstage Console to access Document Parse and send requests via curl or your preferred REST API client such as LangChain.

curl --location --request POST 'https://api.upstage.ai/v1/document-ai/document-parse' \\
--header 'Authorization: Bearer YOUR_API_KEY' \\
--form 'document=@"/path/to/file"'
cs

3. Deploy on your own infrastructure from AWS Marketplace and Jumpstart

Document Parse is also available on the AWS Marketplace and Amazon SageMaker JumpStart. Visit the AWS Marketplace page for more details and a tutorial notebook to help you get started quickly and easily.

4. Explore more and partner with Upstage

For customers who want to install and run the model on their bare-metal servers, please contact our business partners for the on-premise solution.

Document Parse makes it easy for customers to use LLMs for document search and processing, and we're always working on making it better. Our upcoming updates will focus on inference speed, chart recognition, and security compliance. Please follow us on LinkedIn or X to stay informed about the latest updates.

 

Learn more

Also try out

 

Building Tomorrow’s Solutions Today

Talk to AI expert to find the best solution for your business.