Update and save document content programmatically in C#

I was able to get links from word document from above code. Is it possible to get from PDF? code you shared does not work for pdf.
Also if i need to get text from any smartart, is that possible?

@kanuja01,

We are investigating this. You’ll be notified about the outcomes.

Could you please let me know, if i can be able to get links from PDF document?

1 Like

@kanuja01,

Text extraction from SmartArt and hyperlink extraction from PDF are planned to be implemented in some or the future releases (probably in 20.2). As release gets on-board or there is any further information, you’ll be notified.

Could you please let me know estimated completion date?

@kanuja01,

Release will be available by the end of February. We’ll keep you updated.

Do you know, how can i search through html files?

1 Like

@kanuja01,

Please clarify, either you want to extract HTML formatted text or perform search operations on HTML files?

Hi, Thanks for asking.

  1. How do i perform search operations on HTML file ?
  2. How do i extract HTML formatted text from file?
1 Like

@kanuja01,

If you have multiple HTML files and want to perform full-text search over them, please use GroupDocs.Search for .NET. We’d recommend you to explore this documentation and this example project. Have look at this article in order to create your first search solution.

However, using GroupDocs.Parser for .NET you can extract HTML formatted text. Please go through this article and this code snippet/project.