Different exceptions for pdf file extraction

Hi,

These are some exceptions we observed when extracting from PDF files:


System.IndexOutOfRangeException: Invalid index: index should be in the range [1…n] where n equals to the pages count


at Aspose.Pdf.Facades.PdfExtractor.GetAttachNames()

at \u000E\u0010\u0003.\u0002()

at \u0008\u0017\u0003.\u000F\u0010\u001A\u0003\u0016\u0002()

at GroupDocs.Parser.Parser.GetContainer()

at \u0002\u0019\u0002.\u0002(\u0005\u000F\u0002 \u0002, Parser \u0008)

at \u000F\u0005\u0002.\u000E\u0018\u001B\u0003\u0016\u0002()

at \u0008\u001B\u001B.\u0002(Document \u0002, \u0003\u0018\u001B \u0008, String \u0005)


System.ArgumentException: MaxValue can’t be less than StartValue


at \u0002\u0018\u0005..ctor(PdfExtractor \u0002, Encoding \u0003, Nullable`1 \u0005)

at \u0006\u0017\u0005.\u0008\u0017\u0003\u0017\u0005\u0017\u0015\u0002(Nullable`1 \u0002, TextOptions \u0003)

at GroupDocs.Parser.Parser.GetText(TextOptions options)

at \u0003\u0016\u000F.\u0002(Document \u0002, \u0006\u001A\u000F \u0003, String \u0005)


System.Exception: The root object missing or invalid


at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zSCBWDYol5lAco7vso7s2DLhD5jZ4(Object #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zp0pNQamsvK4gnRp2$maJGLwEPBce(Object #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=ziaAJ6pNcXGHmqeGrZyBBDfxBtmU6VdovgA==(dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd #=zVfrUUmg=, #=qqbNWb9$La$ZAtTc1CKbv1Wka6vNNR2IvouqMYyHzN2Y= #=zLTZCRtc=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zSiZaVB3teJjokkhVmijde22GTMsZ(Boolean #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zSCBWDYol5lAco7vso7s2DLhD5jZ4(Object #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zp0pNQamsvK4gnRp2$maJGLwEPBce(Object #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zjdMpSOkGgWrmMBllNy_PpCQhllOFdVoTwc9j35Y=(Object #=zVfrUUmg=, UInt32 #=zLTZCRtc=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zSiZaVB3teJjokkhVmijde22GTMsZ(Boolean #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zSiZaVB3teJjokkhVmijde22GTMsZ(Boolean #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zsufQgOfLuwSYJ6F5h4qHtu4AL2rCk1X_4vtR_pAV4Fj7(Object[] #=zVfrUUmg=, Type[] #=zLTZCRtc=, Type[] #=zmnlhzPg=, Object[] #=zxLHi_io=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zcIW4Ugh41XT7ev1a4ckgs5gc$Kz7(Int32 #=zVfrUUmg=, Type[] #=zLTZCRtc=, Type[] #=zmnlhzPg=, Boolean #=zxLHi_io=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zyyuKPgNpeNf$LKLTIA==(dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd #=zVfrUUmg=, #=qqbNWb9$La$ZAtTc1CKbv1Wka6vNNR2IvouqMYyHzN2Y= #=zLTZCRtc=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zSiZaVB3teJjokkhVmijde22GTMsZ(Boolean #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zSCBWDYol5lAco7vso7s2DLhD5jZ4(Object #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zp0pNQamsvK4gnRp2$maJGLwEPBce(Object #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zjdMpSOkGgWrmMBllNy_PpCQhllOFdVoTwc9j35Y=(Object #=zVfrUUmg=, UInt32 #=zLTZCRtc=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zSiZaVB3teJjokkhVmijde22GTMsZ(Boolean #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zsufQgOfLuwSYJ6F5h4qHtu4AL2rCk1X_4vtR_pAV4Fj7(Object[] #=zVfrUUmg=, Type[] #=zLTZCRtc=, Type[] #=zmnlhzPg=, Object[] #=zxLHi_io=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=ztWoUUTdbF6sSa07tvrAHFmg=(Stream #=zVfrUUmg=, String #=zLTZCRtc=, Object[] #=zmnlhzPg=)

at #=z6HQ73iZOcVn4IPseLrDIRTGKeDZ8Gl5Lew==..ctor(Stream #=zx7lIinA=, String #=zK7jLnng=, Boolean #=zfRAsW1Zp$VKV)

at \u0006\u0018\u0005.\u0002(Stream \u0002, LoadOptions \u0003)

at \u0006\u0017\u0005.\u0008\u0017\u0003\u0017\u0005\u0017\u0015\u0002()

at GroupDocs.Parser.Parser.GetMetadata()

at \u0003\u0016\u000F.\u0002(Document \u0002, \u0006\u001A\u000F \u0003, String \u0005)


System.Exception: Feature is not supported in .NET Core


at #=zc$r1XF6$TVjQPlmFMZtu3TiGG7spvljfPjqj$R$QcirrZ8aQpA==..ctor(#=zZPSDAujtrPZFucqWbP965s0CDSoMm_hBAytxt84= #=zKI_uO0Y=)

at #=zhrDTZYMf5rhW9GnaOvMvqNaKU9IRsUaU5FHg5kg=.#=zV8uSJUc=(#=zZPSDAujtrPZFucqWbP965s0CDSoMm_hBAytxt84= #=zKI_uO0Y=)

at #=zjwUOhxc_rLvALaWyTDLVTgoqh5I5CcrnNc8Jlh4=..ctor(#=zyg83mp0O1$UJCjKbStK_BR4O1dXqiL0GNBHHUlPkR_FT #=zyZOft4wl5jIhoBmEyw==, Byte[] #=zYJaV_eY=, Int32 #=zoUo3Mao=, String #=zK7jLnng=)

at #=zjwUOhxc_rLvALaWyTDLVTgoqh5I5CcrnNc8Jlh4=.#=zlPzFxu8=(#=zyg83mp0O1$UJCjKbStK_BR4O1dXqiL0GNBHHUlPkR_FT #=zyZOft4wl5jIhoBmEyw==, Byte[] #=zYJaV_eY=, String #=zK7jLnng=)

at #=zKDSFyDTmeLgVBk92BfoWR4$hPNA9.#=zEVG4d9c=()

at #=zCMf$hYfCwKXrKe18JbuiEhPplDxo9coXxHbLsz0=.#=z4BXI52o=(#=zyg83mp0O1$UJCjKbStK_BR4O1dXqiL0GNBHHUlPkR_FT #=zyZOft4wl5jIhoBmEyw==, #=zXYzFcIHn9L2w21uZp_Phlgdzp9orGpCUZA== #=zYkegzMNuC8pI, String #=zHGR3WzCI82EH)

at #=zY8VmwEed395Kos88jhibFldJcmrmLcF7oLmQoTk=.#=zPgnJs5xMD9tdrVUZKPWaUWf_8CPa9P80CAMX1bQNS294VLauEA==()

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=z4JcZxA7AykbfPL2bXwLR$LL6UHYpAM3JXiLD_5QW3QNE(MethodBase #=zVfrUUmg=, Object #=zLTZCRtc=, Object[] #=zmnlhzPg=, Boolean #=zxLHi_io=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=z22j0XGMTQooyTQX8exS9OMg=(MethodBase #=zVfrUUmg=, Boolean #=zLTZCRtc=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zIaUDGnoIbJIcUbFbzBQieeHYqigx(dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd #=zVfrUUmg=, #=qqbNWb9$La$ZAtTc1CKbv1Wka6vNNR2IvouqMYyHzN2Y= #=zLTZCRtc=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zSiZaVB3teJjokkhVmijde22GTMsZ(Boolean #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zSCBWDYol5lAco7vso7s2DLhD5jZ4(Object #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zp0pNQamsvK4gnRp2$maJGLwEPBce(Object #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zXqXgIW0DFCVmGb1huaZK_Vt6uRkuFoOWlg==(dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd #=zVfrUUmg=, #=qqbNWb9$La$ZAtTc1CKbv1Wka6vNNR2IvouqMYyHzN2Y= #=zLTZCRtc=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zSiZaVB3teJjokkhVmijde22GTMsZ(Boolean #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zSCBWDYol5lAco7vso7s2DLhD5jZ4(Object #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zp0pNQamsvK4gnRp2$maJGLwEPBce(Object #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zjdMpSOkGgWrmMBllNy_PpCQhllOFdVoTwc9j35Y=(Object #=zVfrUUmg=, UInt32 #=zLTZCRtc=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zSiZaVB3teJjokkhVmijde22GTMsZ(Boolean #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zSiZaVB3teJjokkhVmijde22GTMsZ(Boolean #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zsufQgOfLuwSYJ6F5h4qHtu4AL2rCk1X_4vtR_pAV4Fj7(Object[] #=zVfrUUmg=, Type[] #=zLTZCRtc=, Type[] #=zmnlhzPg=, Object[] #=zxLHi_io=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zcIW4Ugh41XT7ev1a4ckgs5gc$Kz7(Int32 #=zVfrUUmg=, Type[] #=zLTZCRtc=, Type[] #=zmnlhzPg=, Boolean #=zxLHi_io=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zyyuKPgNpeNf$LKLTIA==(dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd #=zVfrUUmg=, #=qqbNWb9$La$ZAtTc1CKbv1Wka6vNNR2IvouqMYyHzN2Y= #=zLTZCRtc=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zSiZaVB3teJjokkhVmijde22GTMsZ(Boolean #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zSCBWDYol5lAco7vso7s2DLhD5jZ4(Object #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zp0pNQamsvK4gnRp2$maJGLwEPBce(Object #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zjdMpSOkGgWrmMBllNy_PpCQhllOFdVoTwc9j35Y=(Object #=zVfrUUmg=, UInt32 #=zLTZCRtc=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zSiZaVB3teJjokkhVmijde22GTMsZ(Boolean #=zVfrUUmg=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=zsufQgOfLuwSYJ6F5h4qHtu4AL2rCk1X_4vtR_pAV4Fj7(Object[] #=zVfrUUmg=, Type[] #=zLTZCRtc=, Type[] #=zmnlhzPg=, Object[] #=zxLHi_io=)

at dje_qU5MS3LWVP3EEL39ULJ4KKC4K8Z2WFGMXPJMXH3HCU9VT476TEDBA_ejd.#=ztWoUUTdbF6sSa07tvrAHFmg=(Stream #=zVfrUUmg=, String #=zLTZCRtc=, Object[] #=zmnlhzPg=)

at #=z6HQ73iZOcVn4IPseLrDIRTGKeDZ8Gl5Lew==..ctor(Stream #=zx7lIinA=, String #=zK7jLnng=, Boolean #=zfRAsW1Zp$VKV)

at \u0006\u0018\u0005.\u0002(Stream \u0002, LoadOptions \u0003)

at \u0006\u0017\u0005.\u0008\u0017\u0003\u0017\u0005\u0017\u0015\u0002()

at GroupDocs.Parser.Parser.GetMetadata()

at \u0003\u0016\u000F.\u0002(Document \u0002, \u0006\u001A\u000F \u0003, String \u0005)

Hi @jamsharp!

I noticed that you posted several requests describing similar symptoms. As with the other topics, we would need the specific PDF files that reproduce the issue in order to investigate it properly on our side.

If you can share the affected files, it will greatly help us reproduce the problem and identify the root cause.

Thank you for being so cooperative!

Hi, do you really need them or could the callstacks already be sufficient?

I understand that it would be easier when we provide the files, but we got the exceptions from customers of us, and it might be hard to convince them to send us the files (which could contains sensitive information) and allow us to forward them to you.

As mentioned in the comment here

Most of these issues look like very specific data content issues or corrupted files. Because almost all call stack lines contain the sequence of calls, the ctor or loading from stream/file method.
That makes me guess about some specific cases with data content