Let LLMs read your documents with speed and accuracy
Have you ever wondered how to get large language models (LLMs) to handle complex documents? While LLMs are undoubtedly useful for document QA or RAG, they cannot natively read and process document files. To bridge this gap, we need to convert these files into LLM-readable formats. Document parsing is the process that exactly does this—transforming complex document files into formats such as HTML and Markdown. As simple as it may sound, this is not an easy problem, particularly due to the diversity of document layouts, table formats, and the variety of elements involved. However, if this is done right, we gain a significant boost in accuracy when processing documents with LLMs.
This is why Upstage has developed Document Parse, a document parser designed to enhance the performance of both RAG systems and LLMs. In enterprise environments, where parsing must meet critical demands—speed, accuracy, and usability—Document Parse excels by delivering high-quality results that ensure precise, reliable outputs from LLMs, even with complex documents.
Blazing fast with multi-page documents
Upstage Document Parse processes 100 pages in under a minute. Our tests show it is, on average, 10 times faster than competitors such as Unstructured and AWS Textract, and 4 times faster than LlamaParse, when processing documents between 50 and 100 pages with similar configuration settings.
Unmatched accuracy for the most difficult use cases
Upstage Document Parse outperforms both open-source and commercial alternatives, as demonstrated by our newly introduced benchmark, DP-Bench. According to this benchmark, Document Parse leads in accuracy metrics like TEDS (93.48) and TEDS-S (94.16), surpassing major competitors such as Google and Microsoft in document layout and table structure recognition by more than 5%.
Getting started
Getting started with Document Parse is simple.
1. Test with your own documents at our Playground
You can try out Document Parse with Playground UI on the Upstage Console for free. It provides HTML output for the first page of the input document, with each layout element highlighted.
2. Explore the API from Upstage Console
You can use Upstage Console to access Document Parse and send requests via curl or your preferred REST API client such as LangChain.
curl --location --request POST 'https://api.upstage.ai/v1/document-ai/document-parse' \\
--header 'Authorization: Bearer YOUR_API_KEY' \\
--form 'document=@"/path/to/file"'
|
cs |
3. Deploy on your own infrastructure from AWS Marketplace and Jumpstart
Document Parse is also available on the AWS Marketplace and Amazon SageMaker JumpStart. Visit the AWS Marketplace page for more details and a tutorial notebook to help you get started quickly and easily.
4. Explore more and partner with Upstage
For customers who want to install and run the model on their bare-metal servers, please contact our business partners for the on-premise solution.
Document Parse makes it easy for customers to use LLMs for document search and processing, and we're always working on making it better. Our upcoming updates will focus on inference speed, chart recognition, and security compliance. Please follow us on LinkedIn or X to stay informed about the latest updates.
Learn more
Also try out
Solar Pro Preview: The state-of-the-art model designed to deliver exceptional performance on a single GPU.
Solar DocVision Preview: A vision LLM specialized on documents.
Building Tomorrow’s Solutions Today
Talk to AI expert to find the best solution for your business.