We wrote a simple python script to convert pdf to docx using the python via .NET package.
When it runs on a mac, it works ok with the exception of all the text being within text boxes (a bug which was reported in a different topic in this forum)
but the appearance looks similar to how it looks in the pdf.
When we run it on linux there are many unwanted spaces added between letters in words.
So we get words like:
“A dditional T erms and Defi ned T erms”
instead of:
“Additional Terms and Defined Terms”
I’m attaching two screenshots, output of convert running on mac:
Screenshot 2026-03-29 at 12.28.44.png (47.8 KB)
and on linux:
Screenshot 2026-03-29 at 12.29.08.png (77.0 KB)