AI still lacks imagination required to crack computer vision

Share
Human vision still beats computer vision powered by artificial intelligence – for now. But global business will benefit from human-inspired AI eyesight

Artificial intelligence models cannot yet beat humans for image recognition, researchers have found. But AI-powered computer vision will be an essential part of global commercial growth, and engineers hope inspiration from the human eye might help even the score. 

Researchers from HSE University and Moscow Polytechnic University found AI models are worse at recognising images due to a lack of understanding about physiology. 

The scientists uploaded images of classical visual illusions to IBM’s Watson Visual Recognition’s service. These images consisted of geometric silhouettes partially hidden by geometric shapes. The system attempted to ascertain the nature of the image and give an indication of the certainty of its response.

The study – the results of which were published in the Proceedings of the Seventh International Congress on Information and Communication Technology – discovered artificial intelligence is unable to recognise imaginary figures.

“Objects similar to those that we used during the experiment can be found in real life,” says study author Vladimir Vinnikov, an analyst at the Laboratory of Methods for Big Data Analysis of HSE Faculty of Computer Science.  “For example, autopilot of a car or aeroplane perceives a trailer or a radio tower, which at night are indicated only by marker lights, the same way as we perceive imaginary geometric shapes.”

A human viewer will complete the picture due to constant eye movement, a physiological feature of our vision, explains Vinnikov. AI-powered vision does not operate the same way as its light-sensitive matrix is not as free in movement as the human eye and cannot complete imaginary lines. Machine vision sees only what is actually depicted, says Vinnikov, whereas humans can complete the image using their imagination.

Neural networks will be essential for global commercial growth

Neural network image recognition systems will be essential in the commercial sector, but accuracy of recognition is going to remain an important question until it is properly answered. For example, an accident may occur if the autopilot of a car or aeroplane does not recognise an object with low contrast relative to the background, say researchers.

Inaccuracy of machine image recognition can be corrected, however, by simulating some of the physiological features of human eye movements. The addition of vector description of the images would also help machine-powered vision make more sense of potential confusing images.

“Imaginary objects should definitely be used as tests in systems that depend on the recognition of photo and video streams, for example, in autopilots of cars or drones. This will help to avoid the risks associated with the use of machine intelligence systems in industry and transport systems,” says Vinnikov.

Share

Featured Articles

JLR Harnesses AI Power of Tata Communications's MOVE

With the Tata Communications MOVE™ platform JLR is ensuring electric fleet connectivity, driving the future of software-defined automotive manufacturing

Why Persona and Audience Segmentation is a Marketer's Edge

Marketers must use data-driven insights, precise segmentation, and automation to craft personalised campaigns that engage at every step of the journey

WEF Report: The Impact of AI Driving 170m New Jobs by 2030

The WEF predicts AI & tech will create 170 million jobs while displacing 92 million, urging upskilling to prepare workforces for the AI-driven future

The UK’s £14bn Pledge to Become a World Leader in AI

AI Strategy

Why IEA & Microsoft Have Launched AI Tool for Sustainability

AI Strategy

Nvidia’s New AI Releases at CES 2025: Explored

AI Applications