Hi all Has anyone had experience extracting data from PDFs or from PDF forms

Hi all. Has anyone had experience extracting data from PDFs, or from PDF forms?

Yes, this is standard functionality within Adobe Acrobat Pro DC. There are tools like ABBY, KoFax, etc if this is routine, or part of process. What is your use-case or process out of interest?

This came up in relation to the directive to phase out faxes. An easy solution is to scan or save docs to PDF, but that doesn’t yield useful data. I’d be keen to know about any experience you or your colleagues have had? Happy to chat by phone/Zoom or email.

Depending on how you want to use it, it can be connected to a system like Blue Prism ( https://www.blueprism.com/ ) that takes the text, and adds it to a database.
opengraphobject:[360520241545216 : https://www.blueprism.com/ : title=“Blue Prism | Robotic Process Automation” : description=“Blue Prism® develops leading Robotic Process Automation software to provide businesses like yours with a more agile virtual workforce.”]

Exactly as @jon_herries mentioned, you can use BluePrism to automate the process of scanning/retrieving the PDF and passing it on to an OCR tool (i.e. ABBY) that will extract the right data needed. Under the context of automation, we are exploring a variety of use-cases that involve reading PDF’s to analysing lab images. Happy for you to reach out, my email is paragb@adhb.govt.nz.