Document with more than one pages render to one HTML-File

i.kunz · July 28, 2016, 2:35pm

Basically GroupDocs.Viewer created for each Word or PDF-Document that contains more than one page, one html file.

How can I create one html file with serveral pages (1:1 to Dokument) ?

For example, I have a doc-document that contains 10 pages. That document schould be render to ONE html file that contains 10 pages.

usman.aziz · July 28, 2016, 6:32pm

Hi there,

Thanks for using GroupDocs.Viewer API.

You can do this manually by appending html content of all pages in a single html file. Below is a sample code snippet to achieve this functionality.

string outputHtmlContent = “”;

foreach (PageHtml page in pages)

{

outputHtmlContent += page.HtmlContent;

}

File.AppendAllText(“D:\output.html”, outputHtmlContent);

In case of any confusion, please feel free to ask.

Warm Regards

i.kunz · July 29, 2016, 1:45pm

thx for the answer, its work.

My next question is, how i can limit the number of pages for rendering.

For example, I have document with 20 or more pages and I want to render only first 10 pages of that document.

mike.kingscott · November 2, 2016, 10:32am

Hi there,

With the example code I've used, this approach creates an output which contains many HTML and HEAD tags - the document still renders in a browser, but is there a way to stop the multiple tags?

I'm just getting pages out of local filestream via htmlHandler.GetPages(fs), and this also creates several @font-face declarations in each page - do those need to be accounted for? I'd rather not extract them to supporting files like the ASP.Net MVC Front End showcase project does.

Thank you,

Mike Kingscott

usman.aziz · July 29, 2016, 5:33pm

Hi there,

Yes, you can achieve this functionality by providing range of the document pages to convert in HtmlOptions or ImageOptions. You can do this using following code

HtmlOptions options = new HtmlOptions();

options.PageNumbersToConvert = Enumerable.Range(1, 10).ToList();

List pages = htmlHandler.GetPages(guid, options);

Try above mentioned code and share your feedback.

Have a nice weekend ahead.

usman.aziz · November 2, 2016, 3:15pm

Hi Mike,

Thanks for using GroupDocs.Viewer for .NET.

GroupDocs.Viewer for .NET does not provide any builtin method to escape from multiple tags in case of merging HTML pages into single page. However, you can use your own logic to extract the child elements of all head tags, trim/remove head tags from HTML content of all pages and then add a single head tag in final html page (containing body of all HTML pages) that will contain the child elements extracted at first step. You can play with HTML content of each page using following code:

foreach (PageHtml page in pages)

{

string outputHtmlContent = page.HtmlContent;

// you can use outputHtmlContent to perform your required operations on HTML content

}

Secondly, when you use embedded resources then all the resources (including font faces) get embedded in HTML pages. However, when you do not use embedded resources then the API creates a separate directory named “resources” containing all the resources in it and adds reference of these resources in HTML page. Following code snippet shows how to set embedded resources.

//Instantiate the HtmlOptions object

HtmlOptions options = new HtmlOptions();

//to get html representations of pages with embedded resources

options.IsResourcesEmbedded = false;

List pages = htmlHandler.GetPages(guid, options);

If anything is not yet clear, please feel free to tell us.

Warm Regards

i.kunz · August 1, 2016, 7:46am

hi,

thx for answer.

If I use the options with Range(1,10) this is works only for file which contains more than 10 pages. If the file has less than 10 pages, i get an exception.

I resolve this problem in wich I use the function:

HtmlOptions options = new HtmlOptions()

options.CountPagesToConvert = 10;

mike.kingscott · November 2, 2016, 3:42pm

Hello Usman,

Thank you for the swift reply. I figured I’d have to play about with the HTML as your example shows, and thank you for the clarification on the resources.

I do have another question, but I need to write some sample code first.

Thanks again,

Mike K.

usman.aziz · August 1, 2016, 9:28am

Hi there,

Its good to know that you have resolved your issue.

You can also get complete information about a document, containing number of pages, before rendering that document. This will help you to escape from any exception. For more details, please see Working with Document Information.

Warm Regards

usman.aziz · November 2, 2016, 3:51pm

Hi Mike,

Sure, you are always welcome to post your questions here. We shall be looking forward to hear from you.

Warm Regards