PDF causing issues

We are using GroupDocs.Parser (24.11.0) to extract text from documents, and this one throws a UnsupportedDocumentFormatException when loading through a stream, like we would for any other document.

ACA-MarketersPulse-201310-EN.pdf (638.9 KB)
ACA-MarketersPulse-201310-EN.pdf (639 KB)

@CMorton

Could you please share the sample code as well? We couldn’t reproduce this issue at our end using this code - Extract text from documents. Take a look at this image (19.7 KB).

See below a version that works, and one that doesn’t. We are using Streams everywhere in the app for performance concerns, and cant realistically shift to flat loading like below.

// The below is not working

await using var stream = File.OpenRead(“/Users/jp.lavoie/Downloads/ACA-MarketersPulse-201310-EN.pdf”);

using var parser = new Parser(stream);

// The below is working

using var parser = new Parser(“/Users/jp.lavoie/Downloads/ACA-MarketersPulse-201310-EN.pdf”);

1 Like

@CMorton
This issue is reproduced at our end. Therefore, we have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.

Issue ID(s): PARSERNET-2619

You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.

Hi Atir, has there been any movement on this issue?

Thanks,
Craig

@CMorton

We are still working on this ticket.

@CMorton

This issue has been resolved in API version 25.2, which is scheduled for release this month. We will notify you as soon as it becomes available for download.