Technology Blog Posts by SAP
cancel
Showing results for 
Search instead for 
Did you mean: 
Mukta_Joshi
Associate
Associate
9,504

Methodology

SAP BTP uses OCR (Optical Character Recognition) capabilities that enable the conversion of scanned documents or images containing text into editable and searchable data. By leveraging OCR, businesses can extract text from various documents, including invoices, receipts, contracts, and more, thus automating data entry processes and reducing manual errors.

To Start, go to SAP BTP Cockpit Trial -> Services MarketPlace ->Document Information Extraction Trial -> Create Instance

Step 1 — Create Instance

Service — Document Information Extraction Trial

Plan — blocks_of_100

Runtime Environment — Cloud Foundry

Space — dev/trial

Instance Name — Food Receipts/ Any relevant name

Mukta_Joshi_6-1724683092359.png

 

Step 2— Create Schema

  • Write Name according to the problem statement (In this case Food_Receipt_Invoice)
  • Document Type is Custom & OCR is used for type Document or Scene Text based on scenario

Mukta_Joshi_5-1724683079638.png

 

Step 3— Add fields to the Schema

Mention Field Names as per the Information you need to extract.Mention the Data Type as per field.Setup Type can be Manual or Auto.In this case we use auto.

Mukta_Joshi_4-1724683068869.png

 

Step 4 — Activate Schema

Mukta_Joshi_3-1724683058326.png

 

Step 5 — Upload the necessary Document

  • Select document type as Custom as you can select any document you want.

Mukta_Joshi_2-1724683039380.png

Schema should be Food Receipt Schema (One you created just now).

Click on confirm & go to Documents.

Step 6 — Review the extraction results

Mukta_Joshi_1-1724683001531.png

 

You can also review the extraction results in csv formats with more detailed information about the document.

Leveraging SAP Business Technology Platform for document information extraction empowers organizations to unlock the value hidden within their unstructured data. By employing methodologies such as OCR, NLP, ML models, and integration with intelligent technologies, businesses can automate document processing tasks, extract actionable insights, and drive informed decision-making. Whether it’s enhancing operational efficiency, improving compliance, or enhancing customer experiences, SAP BTP provides a comprehensive platform for organizations to harness the power of document information extraction in today’s digital era.

Mukta_Joshi_0-1724682970937.png

Step by Step Guide to the process - 

https://medium.com/@mjoshi2669/document-information-extraction-using-sap-btp-cockpit-1d9886b39096

 
1 Comment