Merged PDF documents missing Accessibility standards

Hi @AlekseiSemenchenko , i tried your code as well with merger 25.7. It seems still same accessibility failures for me.

Hi,

To ensure we’re analyzing the same scenario and understand why you’re seeing different results with the same product version,
let’s align our source code and environments using the shared GitHub repository.

Please download and run the test project from the provided repository on your setup.

Before executing, make sure to copy the following input files with exact file names into the /SampleFiles/pdf folder:

  • Condizioni_Generali_Fornitura_Estra.pdf
  • Informativa_Privacy_Qualita_Commerciale.pdf
  • Lettera_Accompagnamento_Plico_Contr.pdf
  • Modulo_Dati_Catastali_Estra.pdf
  • Modulo_Reclamo_Estra.pdf
  • Modulo_Ripensamento_Estra.pdf
  • Proposta_Fornitura_Estra_Luce.pdf

We did not include them into public repository due to possible sensitive information.

Please note: the first source file passed to the constructor (e.g., Condizioni_Generali_Fornitura_Estra.pdf)
heavily influences the accessibility structure of the resulting document:

Merger merger = new Merger(firstPdfInputStream);

If this initial file contains accessibility issues, they may be propagated into the output file.

The two errors you mentioned are actually present in the original input files and were therefore carried over into the final merged result.
In your earlier attempt, we suspect the mismatch occurred due to either:

  • an outdated or mismatched library version
  • differences in input files (e.g., the previously generated output had 19 pages instead of 18), suggesting different source content or structure.

Looking ahead, the upcoming version 25.8 will include an optional automatic correction mechanism designed to detect and fix such structural issues during processing.
This feature should improve stability when dealing with input files that contain known accessibility or structure problems.

Let me know once you’ve run the test — we’ll be able to compare results more accurately then.

Thanks!

UPD Please share as much details about your environment as possible

  • what is environment OS,
  • Java version

hey @AlekseiSemenchenko
I used same code as provided by you with same sequence of files. still the result is not as you mentioned. It is retruning me same as above response (2).pdf
Regarding env details:

  • what is environment OS : Mac and linux
  • Java version: 17

Is it possible to have a joint call and let’s get it checked together once?

hey @AlekseiSemenchenko : I am able to merge file with same output as you. But somehow we feel relying on first file for meeting the max accessibility is not the right strategy. It should consider max of all files.

Hi @pankajgupta!

You’re correct — the current fix in version 25.7 is not the ideal long-term approach. Our priority was to provide you with a quick solution as soon as possible.

For the upcoming 25.8 release, we’re working on a more general solution that will preserve accessibility artifacts from all source pages, regardless of which files are merged, to achieve the highest possible accessibility compliance. As part of this effort, we’ll also verify whether the PDF standards allow transferring these accessibility artifacts from the source documents to the corresponding pages in the destination document.

We’ll keep you updated as we make progress.

Thank you for your feedback!

Hi @yuriy.mazurchuk , @AlekseiSemenchenko ,
We are encounttering :
error: error reading bazel-out/k8-fastbuild/bin/external/com/com/groupdocs/groupdocs-merger/25.7/header_groupdocs-merger-25.7.jar; Unsupported size: 16667311 for JarEntry META-INF/MANIFEST.MF. Allowed max size: 16000000 bytes. You can use the jdk.jar.maxSignatureFileSize system property to increase the default value.

similar to: Groupdocs comparison jar SignatureFileSize issue
Can you help us on it, we aren’t able to consume this version due to it’s size.

Hi @pankajgupta,

Thanks for flagging this. We’ve published 25.7.1, which resolves the

Unsupported size … META-INF/MANIFEST.MF (max 16000000)

error. Please update to 25.7.1 and let us know if everything works on your side.

Thank you @AlekseiSemenchenko for the quick response. I have verified in local for accessibility in 25.7.1 and it seems no side effect. We will pull it to dev env and share the feedback. thanks

1 Like

hey @AlekseiSemenchenko : When can we expect 25.8 ?

Hello, within two weeks

hey @AlekseiSemenchenko : Can you please confirm on date, it’s yet not released.
cc: @amitdash

Tentatively September 14–20. We are putting in maximum effort to release as quickly as possible.