Do you have long document images that you want to process with generative AI but can't find the right solution? You're not alone. Many industries rely on vertically long images, particularly in retail and e-commerce across Korea, Japan, and China, where product descriptions often span thousands of pixels in length.

However, when our customers benchmarked various document processing solutions, they encountered two major issues:

  1. Many products simply do not support long images.
  2. Those that do often suffer from a drastic drop in quality compared to standard-sized images.

You asked, and we delivered.

Long image parsing with Upstage Document Parse

With document-parse-250404, Upstage Document Parse now supports extremely long image processing—achieving a 38.596% improvement in accuracy over the previous version.

How our customers are using this feature

With the upgraded Document Parse, our customers go beyond simple document parsing by leveraging their choice of LLM (ideally Solar 🙂). Since the resulting HTML is highly accurate, they can extract high-quality key-values with ease. Here’s an example of a common workflow:

At Upstage, we know that the document universe is vast, with countless edge cases that existing solutions struggle to handle. Do you have a pain point when processing your documents? We’re here to listen.

Contact us to share your challenges, or try the latest Document Parse yourself in our playground.

Struggling to process loooooooong document images with Generative AI?

Minjee Kang
Minjee Kang
Minjee Kang
Minjee Kang
Products
April 15, 2025
Struggling to process loooooooong document images with Generative AI?

Share

We build intelligence for the future of work—now it’s your turn.

Start building with our API or talk to our team.

Share

Do you have long document images that you want to process with generative AI but can't find the right solution? You're not alone. Many industries rely on vertically long images, particularly in retail and e-commerce across Korea, Japan, and China, where product descriptions often span thousands of pixels in length.

However, when our customers benchmarked various document processing solutions, they encountered two major issues:

  1. Many products simply do not support long images.
  2. Those that do often suffer from a drastic drop in quality compared to standard-sized images.

You asked, and we delivered.

Long image parsing with Upstage Document Parse

With document-parse-250404, Upstage Document Parse now supports extremely long image processing—achieving a 38.596% improvement in accuracy over the previous version.

How our customers are using this feature

With the upgraded Document Parse, our customers go beyond simple document parsing by leveraging their choice of LLM (ideally Solar 🙂). Since the resulting HTML is highly accurate, they can extract high-quality key-values with ease. Here’s an example of a common workflow:

At Upstage, we know that the document universe is vast, with countless edge cases that existing solutions struggle to handle. Do you have a pain point when processing your documents? We’re here to listen.

Contact us to share your challenges, or try the latest Document Parse yourself in our playground.

Do you have long document images that you want to process with generative AI but can't find the right solution? You're not alone. Many industries rely on vertically long images, particularly in retail and e-commerce across Korea, Japan, and China, where product descriptions often span thousands of pixels in length.

However, when our customers benchmarked various document processing solutions, they encountered two major issues:

  1. Many products simply do not support long images.
  2. Those that do often suffer from a drastic drop in quality compared to standard-sized images.

You asked, and we delivered.

Long image parsing with Upstage Document Parse

With document-parse-250404, Upstage Document Parse now supports extremely long image processing—achieving a 38.596% improvement in accuracy over the previous version.

How our customers are using this feature

With the upgraded Document Parse, our customers go beyond simple document parsing by leveraging their choice of LLM (ideally Solar 🙂). Since the resulting HTML is highly accurate, they can extract high-quality key-values with ease. Here’s an example of a common workflow:

At Upstage, we know that the document universe is vast, with countless edge cases that existing solutions struggle to handle. Do you have a pain point when processing your documents? We’re here to listen.

Contact us to share your challenges, or try the latest Document Parse yourself in our playground.

The 90-Day path to
Underwriting Reinvention

See how Fortune 500 companies eliminate the bottleneck where 70% of submissions arrive incomplete.
1,000+
Submissions Analyzed
90
Days to Transform

Download the White Paper

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Make your first API call in 3 minutes.

Open the console and run the Quickstart for chat, extract, and embed

Building tomorrow’s solutions today

Talk to AI expert to find the best solution for your business.