This issue is reproduced at our end. We’ve logged it in our issue tracking system with ID SEARCHNET-2350 for further investigation and resolution. As there’s any update, you’ll be notified.
UseRawTextExtraction by default is enabled. Please set it to false in order to get rid of the issue.
IndexSettings settings = new IndexSettings();
settings.UseRawTextExtraction = false;
Index index = new Index(indexFolder,settings);
Secondly, we’d recommend you to always set UseRawTextExtraction to false. This improves extracted text formatting for some of the document (for example the document you shared).
i think 20.6 version have UseRawTextExtraction .
if i update search dll to 20.6 means,i’m getting some other dll license issue. dll update issue.jpg (181.3 KB)
Please follow the given steps in order to get rid of the issue (mentioned earlier):
Update all GroupDocs APIs (that you are using in the project) to the latest versions
In all project (.csproj files), where APIs are connected, add the setting <AutoGenerateBindingRedirects>false</AutoGenerateBindingRedirects> (or change to false, if settings are already there)
Delete the runtime section in the app.config file
Delete obj/ and bin/ folders in the project directories
Rebuild all projects
We prepared a sample application for you - SetLicenses.zip (5.4 KB). Please download and build it. Let us know if it resolves the issue.
We couldn’t reproduce such an exception. Have a look at this screenshot.PNG (47.0 KB). Please download this project, add missing DLL references and let us know if you can reproduce (share a screencast) the same exception.