PDF causing issues

CMorton · January 29, 2025, 8:48pm

We are using GroupDocs.Parser (24.11.0) to extract text from documents, and this one throws a UnsupportedDocumentFormatException when loading through a stream, like we would for any other document.

ACA-MarketersPulse-201310-EN.pdf (638.9 KB)
ACA-MarketersPulse-201310-EN.pdf (639 KB)

atir.tahir · January 29, 2025, 9:07pm

@CMorton

Could you please share the sample code as well? We couldn’t reproduce this issue at our end using this code - Extract text from documents. Take a look at this image (19.7 KB).

CMorton · January 30, 2025, 2:06pm

See below a version that works, and one that doesn’t. We are using Streams everywhere in the app for performance concerns, and cant realistically shift to flat loading like below.

// The below is not working

await using var stream = File.OpenRead(“/Users/jp.lavoie/Downloads/ACA-MarketersPulse-201310-EN.pdf”);

using var parser = new Parser(stream);

// The below is working

using var parser = new Parser(“/Users/jp.lavoie/Downloads/ACA-MarketersPulse-201310-EN.pdf”);

atir.tahir · January 30, 2025, 8:25pm

@CMorton
This issue is reproduced at our end. Therefore, we have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.

Issue ID(s): PARSERNET-2619

You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.

CMorton · February 19, 2025, 2:08pm

Hi Atir, has there been any movement on this issue?

Thanks,
Craig

atir.tahir · February 19, 2025, 7:12pm

@CMorton

We are still working on this ticket.

atir.tahir · February 21, 2025, 10:20am

@CMorton

This issue has been resolved in API version 25.2, which is scheduled for release this month. We will notify you as soon as it becomes available for download.

aspose.notifier · February 26, 2025, 8:42pm

The issues you have found earlier (filed as PARSERNET-2619) have been fixed in this update. This message was posted using Bugs notification tool by atir.tahir