June 8, 2015

Team develops vision system that improves object recognition

by Disney Research

A research group at Disney Research Pittsburgh has developed a computer vision system that, much like humans, can continuously improve its ability to recognize objects by picking up hints while watching videos.

Like most other object recognition systems, the Disney system builds a conceptual model of an object, be it an airplane or a soap dispenser, by using a learning algorithm to analyze a number of example images of the object.

What's different about the Disney system is that it then uses that model to identify objects, when it can, in videos. As it does, it sometimes is able to glean new information about such objects, enabling it to make its own model of the object more complex. And that in turn enables the system to more readily recognize such objects in a wider variety of conditions.

"This process continues, potentially indefinitely, over the lifetime of the recognition system," said Leonid Sigal, a senior research scientist at Disney Research Pittsburgh. "This is a learning system that is continuously evolving through unsupervised experience to build a more complete and complex model of the world."

Sigal and his co-investigators - Alina Kuznetsova and Bodo Rosenhahn of Leibniz University Hannover, and former Disney post-doctoral researcher Sung Ju Hwang, now of Ulsan National Institute of Science and Technology in South Korea - will present their findings at the IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, June 7-12, in Boston.

Recognizing objects in images, though often easy for humans, remains a challenge for automated systems. Systems that learn to recognize objects using one set of images may have difficulty recognizing those same objects in the real world, or under different sets of conditions, or domains.

Rather than try to get a system to more accurately recognize objects using its original model for that object in new domains, the Disney group took a different approach - expanding the object domain incrementally. That means that the system's model for each object will be continuously fine-tuned as the system encounters new information.

One potential problem is that the system, which does this fine tuning without human supervision, may start ascribing attributes to an object that aren't pertinent and lead to errors in detection, but thus far this "domain drift" has not been detected by the Disney researchers.

They tested their incremental learning method against several other leading object recognition methods, using two standard video datasets that included a variety of objects found in the home. In most instances, it outperformed the other methods in detecting items such as microwave ovens, mugs and stoves and demonstrated that it not only got better with experience at detecting these objects in the videos, but also in detecting objects from its original training images.

More information: "Expanding Object Detector's HORIZON-Paper" www.disneyresearch.com/wp-cont … ding-Object-Detector%E2%80%99s-HORIZON-Incremental-Learning-Framework-for-Object-Detection-in-Videos-Paper.pdf

Provided by Disney Research

Citation: Team develops vision system that improves object recognition (2015, June 8) retrieved 26 April 2024 from https://phys.org/news/2015-06-team-vision-recognition.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Researchers improve automated recognition of human body movements in videos

23 shares

Feedback to editors

Optical barcodes expand range of high-resolution sensor

7 hours ago

Ridesourcing platforms thrive on socio-economic inequality, say researchers

8 hours ago

Did Vesuvius bury the home of the first Roman emperor?

8 hours ago

Florida dolphin found with highly pathogenic avian flu: Report

8 hours ago

A new way to study and help prevent landslides

8 hours ago

New algorithm cuts through 'noisy' data to better predict tipping points

8 hours ago

Researchers reconstruct landscapes that greeted the first humans in Australia around 65,000 years ago

8 hours ago

High-precision blood glucose level prediction achieved by few-molecule reservoir computing

9 hours ago

Enhancing memory technology: Multiferroic nanodots for low-power magnetic storage

9 hours ago

Researchers advance detection of gravitational waves to study collisions of neutron stars and black holes

9 hours ago

Load comments (0)

Team develops vision system that improves object recognition

Optical barcodes expand range of high-resolution sensor

Ridesourcing platforms thrive on socio-economic inequality, say researchers

Did Vesuvius bury the home of the first Roman emperor?

Florida dolphin found with highly pathogenic avian flu: Report

A new way to study and help prevent landslides

New algorithm cuts through 'noisy' data to better predict tipping points

Researchers reconstruct landscapes that greeted the first humans in Australia around 65,000 years ago

High-precision blood glucose level prediction achieved by few-molecule reservoir computing

Enhancing memory technology: Multiferroic nanodots for low-power magnetic storage

Researchers advance detection of gravitational waves to study collisions of neutron stars and black holes

Relevant PhysicsForums posts

Passing variables in FORTRAN

Parallel processing for loops and pointer defined outside the loop

My Website For Creating Interactive Visuals Linked To Equations

Number of Multiplications in the FFT Algorithm

Error logging in: onLoginSuccess is not a function

Latest Notable AI accomplishments

Researchers improve automated recognition of human body movements in videos

System designed to label visual scenes according to type learns to detect specific objects

Researchers use passive UHF RFID tags to detect how people interact with objects

New algorithm could enable household robots to better identify objects in cluttered environments

Images that fool computer vision raise security concerns

CMU researchers develop 3-D scanning technology that detects light interaction

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Team develops vision system that improves object recognition

Optical barcodes expand range of high-resolution sensor

Ridesourcing platforms thrive on socio-economic inequality, say researchers

Did Vesuvius bury the home of the first Roman emperor?

Florida dolphin found with highly pathogenic avian flu: Report

A new way to study and help prevent landslides

New algorithm cuts through 'noisy' data to better predict tipping points

Researchers reconstruct landscapes that greeted the first humans in Australia around 65,000 years ago

High-precision blood glucose level prediction achieved by few-molecule reservoir computing

Enhancing memory technology: Multiferroic nanodots for low-power magnetic storage

Researchers advance detection of gravitational waves to study collisions of neutron stars and black holes

Relevant PhysicsForums posts

Related Stories

Researchers improve automated recognition of human body movements in videos

System designed to label visual scenes according to type learns to detect specific objects

Researchers use passive UHF RFID tags to detect how people interact with objects

New algorithm could enable household robots to better identify objects in cluttered environments

Images that fool computer vision raise security concerns

CMU researchers develop 3-D scanning technology that detects light interaction

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience