Free Support Forum - groupdocs.com

PDF to HTML conversion issue in .NET

Hi I used Groupdocs.conversion for converting pdf,.docx files to .html format
string inDirectory = Path.Combine(Properties.Settings.Default.DocumentStorageFolderPath, document.Location);

using (Converter converter = new Converter(inDirectory))
{
MarkupConvertOptions options = new MarkupConvertOptions
{
FixedLayout = true
};

          converter.Convert(Path.Combine(Properties.Settings.Default.DocumentStagingFolderPath, "search", "Html", convertedFileName + ".html"), options);
            }

by using this conversion its not converting properly…
i will attach the original file,html file for your reference.Venkatesh.pdf (251.3 KB)

Converted html file
image.png (251.2 KB)

html file in code view…look at the selected span - Primary Contact .
upload.jpg (188.0 KB)

the word Contact is seperated by conta and ct…

@bharathiGK,

Thank you for taking interest in GroupDocs.Conversion for .NET.

Is this the only issue you are facing? When we inspect element “Primary Contact”, the word Contact is separated. But it is rending as a collective word.

Yes.i noticed for this word alone. actually i’m applying tag (mark) for that Word Contact ,in that case due to seperation of word “conta” and “ct” couldn’t able to apply

1 Like

@bharathiGK,

This issue is reproduced at our end. Hence, we are now investigating it. Your investigation ticket ID is CONVERSIONNET-3851. As there’s any update, you’ll be notified.

Okay.Thank you

@bharathiGK,

You are welcome.