Machine vision key to unleashing automation’s full potential

By Tony McCandless, Chief Technology Officer, SS&C Blue Prism
Share
Automation adoption has accelerated in recent years, becoming essential for businesses, but machine vision is key to unlocking its full potential

Businesses across the globe are dealing with daily financial and political uncertainty, set against a backdrop of pandemic recovery. Optimising processes and systems with advanced automation has the potential to improve efficiencies and support companies as they look to not just survive but thrive in a volatile environment. Machine learning (ML), artificial intelligence (AI) and robotic process automation (RPA) are all terms we frequently hear when discussing the value of advanced automation. Machine vision, however, is a key part of unleashing the full potential of these technologies – yet it is rarely included in the automation conversation. Operational managers need to be aware of the role it plays in automation to get the best out of automating their front and back-office processes.

Machine vision is a class of technologies that process information from visual inputs such as images, documents, computer screens, videos and more. Its value in automation lies in its ability to capture and process large quantities of documents, images and video quickly and efficiently in quantities and speeds far in excess of human capability. Machine vision usually works with other advanced technologies, including natural language processing, RPA, AI and machine learning, to deliver automation’s impact on business operations. Machine vision is the eyes of automation, AI and machine learning are the brains and RPA is the backbone you hang these technologies onto to leverage them in automation. 

Capitalising on business opportunities

Automation adoption has accelerated in recent years, becoming essential for businesses to remain competitive across industries. While organisations are prioritising these investments, they are also facing increasing cost pressures, with the aftershocks of the pandemic, supply chain disruptions and geopolitical events all spiking the prices for essential materials, products and services. Documents, images and computer screen-based information are obliquitous elements of the work organisations need to do. Because of this, the use of computer vision has exploded because a significant parentage of front and back-office processes involve dealing with visual information whether that be documents, video or objects like text boxes, scroll bars or buttons on screens. In many businesses, if you want to automate at scale, you will probably have to process image data of one form or another at scale as well.

One of the most common uses of machine vision in automation is in document processing. Machine vision combined with machine learning are the active ingredients of what is referred to as intelligent document processing (IDP): automatically processing and classifying documents, extracting printed or handwritten data and then deciphering the content for further automated processing.

IDP is particularly useful when automating document qualities at scale. For example, technology is transforming traditionally paper-heavy and process-driven sectors, like the financial services industry – by minimising the need for people to be involved in certain processes that might typically have required extracting data from large numbers of documents. Even during the peak of the pandemic in 2020, when most were relying on screens and working from home, an estimated 2.8 trillion pages of paper were printed. Companies are also collectively spending billions on annual wages for data entry.

Machine vision automation is not just about scale, though – it’s about accuracy and improving the work humans do as well. The tedious repetitiveness of these tasks contributes to significant error rates and leads to low satisfaction and high levels of turnover, particularly when dealing with handwritten documents which are processable with IDP.

Insurance administrators no longer devote their days to manually digitising paper applications; banking clerks don’t have to manually enter customer information or spreadsheet data into databases; brokers can avoid the additional work that arises from the errors that can occur when processing high volumes of transactions under strict daily deadlines. By filtering machine vision extracted data inputs through machine learning and AI-based technologies, the speed, accuracy and organisation of processing needed to embrace automation technologies can be realised.

The sophistication of how computer vision is applied in automation is not limited to document processing. Video-based facial recognition in security processes, checkout-less supermarkets and remote equipment identification via drones for inventory management are examples of how computer vision is being leveraged in automation.

Machine vision-based technologies are even becoming central to the creation of automations themselves. For example, instead of relying on human workers to describe processes that are being automated when designing automations, recordings of the process to be automated are created and then machine vision software, combined with other technologies, is used to capture the process end-to-end and then provide the input to automating a lot of the work needed to program the digital workers (bots).

Ensuring accuracy and leaving collaboration with humans in the loop

Standards of accuracy and bias are a concern cited by organisations when it comes to relying on artificial solutions to undertake certain processes. This is why it’s important to have the right processes in place for each application to ensure the best outcome. For automated document processing measures that loop in human workers when uncertainties arise are common. Just like some oversight is needed for humans undertaking processes, that diligence should be applied to digital workers as well.

Conversely, machine vision and AI are also used to QA human-based processes. In healthcare, automated second opinions of radiology-based diagnoses are increasing in use. This is partly because it reduces the time and cost it takes to process second opinions but also because in a growing number of areas, machine vision/AI based processing of radiology images is more accurate than humans.

Youtube Placeholder

Human in the loop (or automation in the loop) avoids the issue of relying solely on technology or humans in areas with critical consequences, whilst allowing humans to utilise the statistically more efficient and accurate capacities of automation technologies. Human healthcare workers are then able to effectively attend more resources to more patients by reaping the benefits of human-digital collaboration. This is the real driver of automation in healthcare – the realisation that every cost saved in administration and clinical processes is a cost that can be allocated to improving patient care. It goes without saying that healthcare is one of the most enthusiastic adopters of automation today.

The future of work is agile and machine vision facilitates this, adding more intelligence to intelligent automation. This technology allows digital workers to interact with screens, documents and video like humans do, which is a big breakthrough. Ultimately, a more fulfilled and satisfied workforce, along with a more competitive and profitable business, are achieved.

Endless possibilities and opportunities

Machine vision is integral to maximising the impact of advanced automation technologies on business operations and paving the way for increased capabilities in the automation space. Self-driving cars are not too far off and demonstrate how machine vision is being pushed to its furthest capabilities.

We speak a lot about empowering employees to do more fulfilling work; as we move forward, it’s going to be about giving people the opportunity to live more fulfilling lives in and outside work. Not only can machine vision unlock more opportunities for people to thrive, but it can also enable businesses to successfully navigate an evolving landscape, reduce costs and increase efficiencies – no matter what challenges and uncertainties lie ahead.

Share

Featured Articles

From $500m to $9bn: Charting Perplexity’s Soaring Valuation

Gen AI challenger is being tipped to receive US$500m in funding, pushing its valuation from US$500m to around US$9bn in just under a year

Workforce Bellwether? 1/4 of Google’s Code Generated by AI

CEO Sundar Pichai revealed a striking statistic at a recent earnings report: a quarter of Google's code is generated by AI before engineers validate it

Nokia Bell Labs & Cambridge Future Tech Bring AI to Health

Nokia Bell Labs partners with Cambridge Future Tech are partnering to bring OmniBuds, an AI-powered earbud system for monitoring vital health to market

Meta AI: The Social Media AI Tool That Reached 500m Users

AI Strategy

How Mondra's Azure-Powered AI Chatbot Helps Reduce Emissions

AI Applications

Why Toyota See NTT Solutions as Key to Making Cars Safer

AI Applications