4 weeks ago
Hello,
I am currently working on extracting fields from over 100 different templates of Indian invoices using the SAP Document Information Extraction service. However, I am facing challenges in obtaining accurate results across multiple scenarios.
Here are the two approaches I have tried:
Scenario 1:
Issues Encountered:
Scenario 2:
Few examples of Issues Encountered:
When I try to create templates for each invoice under the custom schema and train them, I observe even worse results than before (untrained).
Could anyone help with tackling these issues or suggest a better approach to achieve more accurate results in field extraction?
Thank you!
Best regards,
Koushik
You need to create individual document templates for each different format and don't use created schema , Try to use OCR with AI capability,
Other way you will try to get data by using inbuild OCR(PDF) for failed documents in Pretrained model and compare results.
I think you can get better results
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
User | Count |
---|---|
67 | |
11 | |
10 | |
10 | |
9 | |
9 | |
6 | |
5 | |
4 | |
4 |
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.