We are using GroupDocs.Parser (24.11.0) to extract text from documents, and this one throws a UnsupportedDocumentFormatException when loading through a stream, like we would for any other document.
Could you please share the sample code as well? We couldn’t reproduce this issue at our end using this code - Extract text from documents. Take a look at this image (19.7 KB).
See below a version that works, and one that doesn’t. We are using Streams everywhere in the app for performance concerns, and cant realistically shift to flat loading like below.
// The below is not working
await using var stream = File.OpenRead(“/Users/jp.lavoie/Downloads/ACA-MarketersPulse-201310-EN.pdf”);
using var parser = new Parser(stream);
// The below is working
using var parser = new Parser(“/Users/jp.lavoie/Downloads/ACA-MarketersPulse-201310-EN.pdf”);
@CMorton
This issue is reproduced at our end. Therefore, we have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.
Issue ID(s): PARSERNET-2619
You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.
This issue has been resolved in API version 25.2, which is scheduled for release this month. We will notify you as soon as it becomes available for download.