cancel
Showing results for 
Search instead for 
Did you mean: 
Read only

How to create a full-text index from ZIP or RAR files in S/4HANA to perform a full-text search

Stanislav_Zizka
Discoverer
0 Kudos
332

Dear All,

In our S/4HANA system we create a full-text index after uploading documents (PDF, Word files, ZIP files) and via this full-text index we perform a full-text search. According to the "SAP HANA Search Developer Guide (2017)" the ZIP / RAR files are not supported when creating a full-text index (see excerpt below) which we also tested and that is the case (a full-text search in the ZIP files does not work).

In the meantime (since 2017), does anybody know a solution how to resolve this??

Excerpt from the Developer Guide:
"In a SAP HANA database, when you create a table that contains large text or binary documents, you can define the columns with the TEXT or BINTEXT data type. This data type allows you to query textual data and present content excerpts in search hit lists. You can also reconstruct the document and display its original textual content.

Note: Embedded document filters can automatically extract text and metadata from most popular document formats, including Adobe PDF, Microsoft Office, OpenDocument, email, HTML, XML, and plain text.
Archive file formats, which may contain nested sub-files, are not supported. This includes ZIP and RAR files, as well as personal email folders like PST files."

Thank you in advance for your help / tips.

Best regards, Stan

Accepted Solutions (0)

Answers (1)

Answers (1)

mamartins
Active Contributor
0 Kudos

Latest SAP HANA version 2.0 SPS 07 maintains the same policy (look into https://help.sap.com/docs/SAP_HANA_PLATFORM/691cb949c1034198800afde3e5be6570/cc999564bb57101480cb97c...😞

Why not create a mechanism like a script to extract the contents before upload?

 

Stanislav_Zizka
Discoverer
0 Kudos
Thank you for your answer. Re: script: Our IT developers estimate that writing an own code for unzipping the files prior indexing would entail big (to huge) effort and would be another enhancement to the system... If possible, we would like to be as close to the SAP standard as possible 😉 We heard of an existing (SAP standard) software that can potentially unzip a file? Do you know any? Thank you.