On the role of lip joints in the perception of visual speech

Wireless

* = equal contribution

Creating realistic lip movement from audio to simulate speech production is critical to driving natural character animation. Previous research has shown that the traditional measures used to improve and evaluate models for generating lip movement from speech are not a good predictor of subjective opinion of animation quality. Developing subjective metrics requires first understanding what influences a person’s perception of quality. In this work, we focus on the degree of articulation and conduct a series of experiments to study how the strength of articulation affects human perception of lip movement accompanying speech. Specifically, we study how increased movement of the non-articulated (blunted) and articulated (exaggerated) lips affects human perception of quality. We examine the effect of the power of expression on human perception when considering only lip movement, where viewers are presented with speaking faces represented by landmarks, and in the context of anthropomorphic characters, where viewers are presented with realistic videos. Our results show that viewers consistently prefer hyperarticulated lip movement over non-articulated lip movement and that this preference generalizes across different speakers and embodiments.

Source link

Post a Comment

Cookie Consent
We serve cookies on this site to analyze traffic, remember your preferences, and optimize your experience.
Oops!
It seems there is something wrong with your internet connection. Please connect to the internet and start browsing again.
AdBlock Detected!
We have detected that you are using adblocking plugin in your browser.
The revenue we earn by the advertisements is used to manage this website, we request you to whitelist our website in your adblocking plugin.
Site is Blocked
Sorry! This site is not available in your country.