December 7, 2015

What makes Tom Hanks look like Tom Hanks?

by Jennifer Langston, University of Washington

Tom Hanks has appeared in many acting roles over the years, playing young and old, smart and simple. Yet we always recognize him as Tom Hanks. Why? Is it his appearance? His mannerisms? The way he moves?

University of Washington researchers have demonstrated that it's possible for machine learning algorithms to capture the 'persona' and create a digital model of a well-photographed person like Tom Hanks from the vast number of images of them available on the Internet.

With enough visual data to mine, the algorithms can also animate the digital model of Tom Hanks to deliver speeches that the real actor never performed.

"One answer to what makes Tom Hanks look like Tom Hanks can be demonstrated with a computer system that imitates what Tom Hanks will do," said lead author Supasorn Suwajanakorn, a UW graduate student in computer science and engineering.

The technology relies on advances in 3-D face reconstruction, tracking, alignment, multi-texture modeling and puppeteering that have been developed over the last five years by a research group led by UW assistant professor of computer science and engineering Ira Kemelmacher-Shlizerman. The new results will be presented in a paper at the International Conference on Computer Vision in Chile on Dec. 16.

The team's latest advances include the ability to transfer expressions and the way a particular person speaks onto the face of someone else—for instance, mapping former president George W. Bush's mannerisms onto the faces of other politicians and celebrities.

An online video of George W. Bush animates digital models of other celebrities and politicians synthesized from Internet photo collections. Credit: University of Washington

It's one step toward a grand goal shared by the UW computer vision researchers: creating fully interactive, three-dimensional digital personas from family photo albums and videos, historic collections or other existing visuals.

As virtual and augmented reality technologies develop, they envision using family photographs and videos to create an interactive model of a relative living overseas or a far-away grandparent, rather than simply Skyping in two dimensions.

"You might one day be able to put on a pair of augmented reality glasses and there is a 3-D model of your mother on the couch," said senior author Kemelmacher-Shlizerman. "Such technology doesn't exist yet—the display technology is moving forward really fast—but how do you actually re-create your mother in three dimensions?"

One day the reconstruction technology could be taken a step further, researchers say.

"Imagine being able to have a conversation with anyone you can't actually get to meet in person—LeBron James, Barack Obama, Charlie Chaplin—and interact with them," said co-author Steve Seitz, UW professor of computer science and engineering. "We're trying to get there through a series of research steps. One of the true tests is can you have them say things that they didn't say but it still feels like them? This paper is demonstrating that ability."

Existing technologies to create detailed three-dimensional holograms or digital movie characters like Benjamin Button often rely on bringing a person into an elaborate studio. They painstakingly capture every angle of the person and the way they move—something that can't be done in a living room.

Credit: University of Washington

Other approaches still require a person to be scanned by a camera to create basic avatars for video games or other virtual environments. But the UW computer vision experts wanted to digitally reconstruct a person based solely on a random collection of existing images.

To reconstruct celebrities like Tom Hanks, Barack Obama and Daniel Craig, the machine learning algorithms mined a minimum of 200 Internet images taken over time in various scenarios and poses—a process known as learning 'in the wild.'

"We asked, 'Can you take Internet photos or your personal photo collection and animate a model without having that person interact with a camera?'" said Kemelmacher-Shlizerman. "Over the years we created algorithms that work with this kind of unconstrained data, which is a big deal."

Suwajanakorn more recently developed techniques to capture expression-dependent textures—small differences that occur when a person smiles or looks puzzled or moves his or her mouth, for example.

By manipulating the lighting conditions across different photographs, he developed a new approach to densely map the differences from one person's features and expressions onto another person's face. That breakthrough enables the team to 'control' the digital model with a video of another person, and could potentially enable a host of new animation and virtual reality applications.

"How do you map one person's performance onto someone else's face without losing their identity?" said Seitz. "That's one of the more interesting aspects of this work. We've shown you can have George Bush's expressions and mouth and movements, but it still looks like George Clooney."

Provided by University of Washington

Citation: What makes Tom Hanks look like Tom Hanks? (2015, December 7) retrieved 19 April 2024 from https://phys.org/news/2015-12-tom-hanks.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Digital photos can animate a face so it ages and moves before your eyes

1388 shares

Feedback to editors

Comprehensive model unravels quantum-mechanical effects behind photoluminescence in thin gold films

31 minutes ago

Cosmic rays streamed through Earth's atmosphere 41,000 years ago: New findings on the Laschamps excursion

36 minutes ago

Study suggests Io's volcanoes have been active for 4.5 billion years

41 minutes ago

Ghost particle on the scales: Research offers more precise determination of neutrino mass

4 hours ago

Light show in living cells: New method allows simultaneous fluorescent labeling of many proteins

4 hours ago

Warming of Antarctic deep-sea waters contribute to sea level rise in North Atlantic, study finds

4 hours ago

Unraveling water mysteries beyond Earth: Ground-penetrating radar will seek bodies of water on Jupiter

4 hours ago

Baby white sharks prefer being closer to shore, scientists find

8 hours ago

Key protein regulates immune response to viruses in mammal cells

12 hours ago

Unraveling the mysteries of consecutive atmospheric river events

16 hours ago

Load comments (1)

What makes Tom Hanks look like Tom Hanks?

Comprehensive model unravels quantum-mechanical effects behind photoluminescence in thin gold films

Cosmic rays streamed through Earth's atmosphere 41,000 years ago: New findings on the Laschamps excursion

Study suggests Io's volcanoes have been active for 4.5 billion years

Ghost particle on the scales: Research offers more precise determination of neutrino mass

Light show in living cells: New method allows simultaneous fluorescent labeling of many proteins

Warming of Antarctic deep-sea waters contribute to sea level rise in North Atlantic, study finds

Unraveling water mysteries beyond Earth: Ground-penetrating radar will seek bodies of water on Jupiter

Baby white sharks prefer being closer to shore, scientists find

Key protein regulates immune response to viruses in mammal cells

Unraveling the mysteries of consecutive atmospheric river events

Relevant PhysicsForums posts

Number of Multiplications in the FFT Algorithm

Error logging in: onLoginSuccess is not a function

My Website For Creating Interactive Visuals Linked To Equations

Latest Notable AI accomplishments

Building a homemade Long Short Term Memory with FSMs

Most efficient way to randomly choose a word from a file with a list of words

Digital photos can animate a face so it ages and moves before your eyes

See what a child will look like using automated age-progression software (w/ video)

New method captures facial details at high fidelity and real time

Wrinkles and all: Hi-res eyelid reconstruction makes digital doubles look more realistic

Roboticists learn to teach robots from babies

Creating an avatar from a 3-D selfie

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

What makes Tom Hanks look like Tom Hanks?

Comprehensive model unravels quantum-mechanical effects behind photoluminescence in thin gold films

Cosmic rays streamed through Earth's atmosphere 41,000 years ago: New findings on the Laschamps excursion

Study suggests Io's volcanoes have been active for 4.5 billion years

Ghost particle on the scales: Research offers more precise determination of neutrino mass

Light show in living cells: New method allows simultaneous fluorescent labeling of many proteins

Warming of Antarctic deep-sea waters contribute to sea level rise in North Atlantic, study finds

Unraveling water mysteries beyond Earth: Ground-penetrating radar will seek bodies of water on Jupiter

Baby white sharks prefer being closer to shore, scientists find

Key protein regulates immune response to viruses in mammal cells

Unraveling the mysteries of consecutive atmospheric river events

Relevant PhysicsForums posts

Related Stories

Digital photos can animate a face so it ages and moves before your eyes

See what a child will look like using automated age-progression software (w/ video)

New method captures facial details at high fidelity and real time

Wrinkles and all: Hi-res eyelid reconstruction makes digital doubles look more realistic

Roboticists learn to teach robots from babies

Creating an avatar from a 3-D selfie

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience