Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Batch OCR Invoice PDFs and extract text to input into web based fields - can but hard #304

Closed
Domsdorm opened this issue Sep 14, 2021 · 7 comments
Labels

Comments

@Domsdorm
Copy link

Hi! I am currently trying to develop a RPA system where invoices that were scanned manually(into PDF) are OCR-ed and specific text are then extracted for filling up fields in a web-based form. I was wondering if TagUI is able to do this?

A little background, I'm currently a uni student interning and was given this task to perform. I came across your tool while studying and was really impressed and am trying to use it for the task listed above.

Thank you for your help!!

@kensoh
Copy link
Member

kensoh commented Sep 15, 2021

Hi @Domsdorm see below link for a full solution and demo of what you mentioned. It is an automation script to solve Automation Anywhere Week 4 RPA challenge. The tough part is from converting from the unstructured image data into structured data. This is very tough to get right, and involve a lot of trial and error and work. You can see below link to know more about the considerations and options to do this.

aisingapore/TagUI#1093 (comment)

@kensoh kensoh added the query label Sep 15, 2021
@kensoh kensoh changed the title Is it possible to batch OCR Invoice PDFs and extract specific texts from them to input into web based fields? Batch OCR Invoice PDFs and extract text to input into web based fields - can but hard Sep 15, 2021
@Domsdorm
Copy link
Author

Thanks @kensoh for the reply! Currently am trying to bypass the firewall that my company has by using the steps you have told me about in the telegram group. I Will update if I run into any troubles when doing the code.

Really appreciate you taking your own free time to help. Cheers!

@Domsdorm
Copy link
Author

Currently am trying to run the code for the Week 4 RPA challenge but OpenJDK is needed, is there anyway to bypass this?

Also is there a way to check in the script if the invoice data(for example Invoice number) is correctly being pulled?

@kensoh
Copy link
Member

kensoh commented Sep 16, 2021

For 1st question, need OpenJDK / Java 64-bit to do the part on opening file explorer to choose file. But there is workaround, you can use r.upload() to choose the file without opening the file browser (criteria needed by organiser for the challenge). See this solution from another user - https://github.com/DanielCCF/BotGamesAA/blob/master/Week4/Solution-Python.py#L125

For 2nd question, this RPA package requires user to know Python. I'm assuming that you are new to Python that's why you ask this. Because the answer is already written in the Python script itself, the OCR of the image files and extracting the individual data like invoice number. This automation is hard to understand and do without Python knowledge. Most of it is Python programming knowledge, only some are RPA concepts related to this tool.

@Domsdorm
Copy link
Author

Yupp I'm not from a CS background but am interested in learning more. Thank you for your patience tho 😅

@Domsdorm
Copy link
Author

Domsdorm commented Sep 22, 2021

Currently I am trying to select a drop down option (incoming WO). I am able to select the main header but am unable to select the Incoming WO option.

Untitled
')

r.click('//[@aria-haspopup="hauptMenu:submenu:12"]')
r.wait(1)
r.click('//
[@tabindex=""-1">Incoming WO<"]

Is there a way to select that option? Or by using r.click(x, y). If using r.click(x, y) is possible, how do I know which x,y values to input.

@kensoh
Copy link
Member

kensoh commented Sep 22, 2021

You can try if using the r.select() works - see more on usage and examples in API section.

@kensoh kensoh closed this as completed Jan 11, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Development

No branches or pull requests

2 participants