Hi,
I have some small problems with GroupDocs.Parser.
Searching the attached file for “link”, it returns some strange unicode characters in the highlight.
See screenshot from Visual Studio:
image.png (47.2 KB)
There seems to be at least “\u0013”, “\u0014” and “\u0015”, although it’s not present in the original document.
sample.zip (451 Bytes)
Code snipped used for searching:
using (Parser parser = new Parser(documentPath))
{
HighlightOptions options = new HighlightOptions(40);
IEnumerable<SearchResult> sr = parser.Search("link", new SearchOptions(false, false, false, true, options, options));
Additionally, it found the word “link” too often, as it was finding “HYPERLINK” keyword, which seems to be added by GroupDocs.Parser?
Is there any way to control that behavior, or to filter out those results?
Best regards,
Clemens