Discover the key differences between OCR and Agentic AI and how they impact automation, accuracy, and efficiency in finance.
I had the great pleasure recently to speak with Brian Shannon, enterprise account executive for North America, who works closely with Luca van Skyhawk, the Chief Revenue Officer for the US and Canada. He shared some interesting insights, from his perspective, on the market sentiment towards Agentic AI and how these compares to OCR.
Brian: I’ve been working in Finance for a long time. I productized an Accounts Payable system and then an Order Management system over 14 years ago. Since 2010 I’ve seen a lot of evolution in the approach to finance workflows and the optimization of business processes. In this time, the capture component has always been lagging. Optical Character Recognition (OCR) solutions were the first attempt at solving the capture issue. In my opinion, the OCR vendors did a horrible job of managing customer expectations about what it was capable of. AI has built on the concept of OCR and provides the capability for capture plus recognition and interpretation.
Brian: For me, the primary difference is context. Agentic AI can apply context and interpret information. I’ll give you an example.
We were doing a demonstration for a prospect, and they provided us with purchase order and invoice data, from accounts payable. However, the invoice they received from their vendor didn’t match the textual description on the purchase order.
The purchase order was from a steel manufacturer, so the content was complex – a series of part numbers, codes, measurements, and descriptions. I could not even decipher it. Agentic AI recognized that we should post the invoice to a general ledger account for alloys. The steel manufacturer was really surprised given that the word ‘alloy’ was not on the purchase order or the invoice.
In this situation, Agentic AI was able to review both documents, triangulate the information and come up with a logical response. The prospect confirmed it was correct. It was impressive. OCR captures information, it does not understand the context and cannot triangulate.
Brian: Yes. Another major difference between OCR and Agentic AI is the ability to understand context based on rules that are provided with natural human language. This means that we don’t have to set up our rules (or templates) in advance to cater for every eventuality. The latter is tough to do. There will always be exceptions. We simply need a knowledgeable and capable individual to be responsible for acting as a super user with oversight capability and the ability to correct when Agentic AI needs it.
With OCR we focus on having a template. With Agentic AI, our focus is on making sure we can validate and trust the data being processed. Various data inputs could be used to triangulate and validate the data. For example, policy documents, codes, rules and 3rd party, external data that has been certified as accurate. All these inputs, including the catalog example I used earlier, could be fed into the system as valid data inputs for Agentic AI to reference. This is how the system was able to come up with the correct answer in the earlier alloy example.
Brian: The ability for AI to apply intelligence and perform the actions of triangulation and validation is huge. Imagine, for example, that you are configuring a multi-part item with many interdependencies such as a car. If the customer wants a sunroof, then the configuration rule might say this can’t be included in the basic package. The customer might need to purchase a different package. All the packages are defined up-front, and each possible permutation would have a master code. What happens if an error is made in the packaging definitions? Perhaps something is not compliant with a government mandate? With a manual approach, we would have to go back to step 1 and re-define the packages. With Agentic AI, we can resolve the error at the point of finding it, by validating the package against available data. We can validate the changes, apply revised rules and then continue processing. The result is significantly faster throughput.
Brian: Yes, absolutely. The agent is autonomous and can start the order process once it has the relevant information. With OCR, you still have a manual step where someone needs to start another job or task/action.
Brian: There's obviously a mathematical increase of 20+% but this doesn’t provide the full picture, and you can’t calculate the value like this. The difference is the capability of setting additional rules very easily and being able to bring in additional validated reference material to provide an advanced level of context and intelligence (as I mentioned earlier). It comes down to “How much does it cost you in time and effort to do the upfront training required for OCR?” compared to the flexibility and intelligence of Agentic AI being able to adapt unaided and modify at the point of impact when processing orders – and move on.
With both OCR and Agentic AI, you need to have your processes well defined in order to automate, otherwise you will automate garbage.
Curious about what Brian has to say about Agentic AI in order management and finance processes? Check out "Why Agentic AI in Order Management Isn't Just Another Trend."
Discover how much you can save here: https://www.hypatos.ai
Further stories from our blog