Application Development Discussions
Join the discussions or start your own on all things application development, including tools and APIs, programming models, and keeping your skills sharp.
cancel
Showing results for 
Search instead for 
Did you mean: 

May Developer Challenge - SAP AI Services

noravonthenen
Developer Advocate
Developer Advocate
8,223

!!! THIS CHALLENGE IS CLOSED !!!

CHECK OUT WEEK 2 OF THIS CHALLENGE

CHECK OUT WEEK 3 OF THIS CHALLENGE

CHECK OUT WEEK 4 OF THIS CHALLENGE

CHECK OUT WEEK 5 OF THIS CHALLENGE

Welcome to week 1 of the May Developer Challenge on AI at SAP! The topic of this month’s challenge are the SAP AI Services; Document Information Extraction and Data Attribute Recommendation. To participate in the challenge you just have to post a screenshot of your solution as a reply in this discussion of the corresponding week.

SAP AI Services help you implement custom use cases by providing powerful algorithms specifically tailored to business problems.

Document Information Extraction:

  • The Document Information Extraction service is available in two editions, the original Base Edition and the new genAI-based Premium Edition. The genAI-based Premium edition is using a large language model via generative AI hub on SAP AI Core to extract information from all kinds of documents.
  • With Document Information Extraction you can extract information from the file types PDF or single page JPEG, PNG and TIFF.  
  • Supported document types are: invoice, paymentAdvice, purchaseOrder, businessCard, deliveryNote, resume and birthCertificate. You can also create your own schema to process other document types.
  • You can also extract OCR results directly to process the raw text from you document files as well as use the classification capabilities to classify your documents into the three classes: invoice, purchase order and payment advice.
  • You can also enrich your extracted data with your metadata.
  • You can access the Document Information Extraction service via the UI, via swagger/client calls and the Python SDK.

Data Attribute Recommendation:

  • With Data Attribute Recommendation you can train your own model to classify data records, you can also tackle more complex classification problems such as hierarchical classification of products and predict missing data records
  • Data Attribute Recommendation can be used via swagger/client calls as well as the AI API Python SDK and SAP AI Launchpad
  • If you want to access Data Attribute Recommendations via Postman you can download this Postman Collection

Weekly Challenges

Week 1 Challenge – DOX UI

This week you will use the UI of the Document Information Extraction service to extract information from your favorite recipe. The UI is great to try out your use case and get a feeling of the capabilities of the service. For productive use cases you would call the APIs or implement a workflow using the Python SDK. Productively, you could then for example implement a workflow that processes documents right out of your mailbox, saves the extracted information in the system and structure you need as well as triggers other necessary workflows.

For this week’s challenge, use the UI to extract the header fields “recipe name”, “portions” and the line items “quantity” and “ingredient” from your chosen recipe. Therefore, you need to create a custom schema. Make sure the recipe is in one of the supported languages.

When creating a custom Schema chose the Setup Type auto to use the llm/genAI-based Premium Edition. In the description field provide information for the large language model to understand what you are referring to e.g. “the name of the recipe”.

noravonthenen_0-1714546116599.png

  1. Get a free trial account and run DOX booster: https://developers.sap.com/tutorials/cp-aibus-dox-booster-key.html
  2. Get the Document Information Extraction UI: https://developers.sap.com/tutorials/cp-aibus-dox-ui-sub.html
  3. Create a custom schema: https://developers.sap.com/tutorials/cp-aibus-dox-ui-gen-ai.html
  4. OPTIONAL: Create a template and add your document to the template (improves performance for future recipes)
  5. Upload your favorite recipe to extract the name, portions, quantity and ingredients. Make sure your recipe pdf is only 1 or 2 pages long, otherwise you will quickly reach the limit (50 pages) of the trial plan. And try not to use the entire 50 page quota because we will need it next week as well!
  6. Submission: share a screenshot of the extraction results and the document and write a comment to share your experience using the UI in the discussion below.

Example Screenshot:

noravonthenen_1-1714546116619.png

Additional information:

Processing a ©Pokémon Card in 90 seconds with Document Information Extraction powered by generative AI: https://community.sap.com/t5/technology-blogs-by-sap/processing-a-pok%C3%A9mon-card-in-90-seconds-wi...

Be aware of limits that apply in free tier and trial accounts: https://help.sap.com/docs/document-information-extraction/document-information-extraction/free-tier-...

How to improve your results: https://help.sap.com/docs/document-information-extraction/document-information-extraction/best-pract...

In this “2-min of” video I am describing the technical aspects of the BASE service (without use of LLM) behind the scenes.

48 REPLIES 48

JerMarijnissen
Participant
904

I have taken picture from weekly Belgian magazine

JerMarijnissen_0-1716227485243.png

Extraction of ingredients is going well.  I tried to split quanty and unit of measurement. But I am still looking for the best prompt.

 

@noravonthenen 

Is there a way to extract multiple tables ?
One table with ingredients and one table with preparation steps?

Vitaliy-R
Developer Advocate
Developer Advocate
572

Screenshot 2024-05-22 at 17.35.36.png

SPAGHETTI ALLA PUTTANESCA has been my primary dish recently when I need to cook something quickly for the family or guests.

Mikkelj
Explorer
480

Mikkelj_0-1716453272796.png

 

That was super fun and inspiring ! 🙂

Missing a way to have to set of line items, so that both an ingredient list and a "Steps" list can be created.

 

krishnam_prasanth
Discoverer
0 Kudos
457

I am having some questions on this service. Can an individual can able to map the fields from the documents incase if it extracts inaccurate information at premium-edition. Training a premium-edition model to extract information for the required fields is possible at Document Information Extraction premium-edition possible? If we are trying to map fields from the document we are not able to map the required value to be extracted.

I hope will help me with the necessary reply for the query.

0 Kudos
281

Yes, you can create templates for your documents to make the service better fit your layout 🙂 Adding around 5 documents to that templates should be enough!

0 Kudos
273

Hi Noravonthenen,

After adding 5 documents by mapping to correct fields we required and associating those documents to the template. still we are facing issue in extracting correct/accurate fields as per our training. I hope you will help regarding the issue I am facing.

harsh_itaverma
Participant
444

Thanks @noravonthenen for curating this amazing developer challenge on AI.

I m a bit late, but below is my submission for Week 1.

 

harsh_itaverma_1-1716485208436.png

Thanks,
Harshita

sabarna17
Contributor
363

Das ist meinem Lieblingsessen.

sabarna17_0-1716834509982.png