[FEATURE] Concurrent page extraction #361

gunnsth · 2020-05-23T19:03:23Z

Is your feature request related to a problem? Please describe.
Currently extraction only supports processing pages one by one. It might be more efficient to use multiple go-routines to handle page-by-page.

Describe the solution you'd like
Explore what the easiest way to support concurrency in extractor package is.

Describe alternatives you've considered
Alternative and currently the best way for concurrency is on a document basis. I.e. one go-routine handling a single document.

Additional context
Client's comment

We often deal with documents that are 900+ pages and serially processing these with Unidoc was. Taking a long time and this a lot of money in AWS expenses.

gunnsth added extract performance feature New feature labels Jun 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Concurrent page extraction #361

[FEATURE] Concurrent page extraction #361

gunnsth commented May 23, 2020 •

edited

Loading

[FEATURE] Concurrent page extraction #361

[FEATURE] Concurrent page extraction #361

Comments

gunnsth commented May 23, 2020 • edited Loading

gunnsth commented May 23, 2020 •

edited

Loading