2023 Mar 29 1:58 PM
Hi,
is it possible to read the content of archived documents (SD invoices). I have a PDF where i have the invoice number, (000000421) see below:
I can retrieve the PDF content in binary format and in char-format with function module ARCHIVOBJECT_GET_BYTES. The result is like that and not "readable":
binary content:
char-content:
So how can i bring this data into "readable" content ? Any ideas ?
br Martin
2023 Mar 29 2:25 PM
it depends of your PDF, some PDF contains a picture, so you need an OCR
if you generate the PDF from SAP, you could create flag inside the pdf, and read the content of these flags (it is Interactive Adobe Form)
2023 Mar 29 2:35 PM
frdric.girod Sorry, i cant change the PDF, the pdf's are already archived as i described. So i have to read the content from the archived document.
2023 Mar 29 2:58 PM
Maybe you could try to use external command on Linux/Unix like pdf2txt or pdf2word ... (you will have to install it).
But if your PDF is not always with the same layout, it will become quickly a nightmare, even with an OCR tool
2023 Mar 30 1:34 PM
You need to use external software to read PDF (unless it's a SAP Adobe Form generated via ADS) or develop your own ABAP "software" to decode it.