Strange unicode characters in highlight text .NET

Hi,

I have some small problems with GroupDocs.Parser.
Searching the attached file for “link”, it returns some strange unicode characters in the highlight.
See screenshot from Visual Studio:
image.png (47.2 KB)

There seems to be at least “\u0013”, “\u0014” and “\u0015”, although it’s not present in the original document.
sample.zip (451 Bytes)

Code snipped used for searching:

            using (Parser parser = new Parser(documentPath))
            {
                HighlightOptions options = new HighlightOptions(40);
                IEnumerable<SearchResult> sr = parser.Search("link", new SearchOptions(false, false, false, true, options, options));

Additionally, it found the word “link” too often, as it was finding “HYPERLINK” keyword, which seems to be added by GroupDocs.Parser?
Is there any way to control that behavior, or to filter out those results?

Best regards,
Clemens

1 Like

@Clemens_Pestuka
We have opened the following new ticket(s) in our internal issue tracking system and will deliver their fixes according to the terms mentioned in Free Support Policies.

Issue ID(s): PARSERNET-2523

You can obtain Paid Support Services if you need support on a priority basis, along with the direct access to our Paid Support management team.

1 Like