Real-time language independent lip synchronization method using a genetic algorithm

Zorić, Goranka; Pandžić, Igor

izvor podataka: crosbi ✓

Real-time language independent lip synchronization method using a genetic algorithm (CROSBI ID 120123)

Prilog u časopisu | izvorni znanstveni rad | međunarodna recenzija

Zorić, Goranka ; Pandžić, Igor Real-time language independent lip synchronization method using a genetic algorithm // Signal processing, 86 (2006), 12; 3644-3656-x

Podaci o odgovornosti

Autori

Zorić, Goranka ; Pandžić, Igor

Osnovni podaci na izvornom jeziku
Osnovni podaci na ostalim jezicima

Jezik

engleski

Naslov

Real-time language independent lip synchronization method using a genetic algorithm

Sažetak

Lip synchronization is a method for the determination of the mouth and tongue motion during a speech. It is widely used in multimedia productions, and real time implementation is opening application possibilities in multimodal interfaces. We present an implementation of real time, language independent lip synchronization based on the classification of the speech signal, represented by MFCC vectors, into visemes using neural networks. Our implementation improves real time lip synchronization by using a genetic algorithm for obtaining a near optimal neural network topology. The automatic neural network configuration with genetic algorithms eliminates the need for tedious manual neural network design by trial and error and considerably improves the viseme classification results. Moreover, by the direct usage of visemes as the basic unit of the classification, computation overhead is reduced, since only visemes are used for the animation of the face. The results are obtained in comprehensive validation of the system using three different evaluation methods, two objective and one subjective. The obtained results indicate very good lip synchronization quality in real time conditions and for different languages, making the method suitable for a wide range of applications.

Ključne riječi

lip synchronization; lip sync; facial animation; MPEG-4 FBA; human-computer interaction; virtual characters; speech processing; neural networks; genetic algorithms

Napomena

Special issue of Signal Processing on Multimodal Human-Computer Interfaces

Jezik

nije evidentirano

Naslov

nije evidentirano

Sažetak

nije evidentirano

Ključne riječi

nije evidentirano

Napomena

nije evidentirano

Podaci o izdanju

Časopis

Signal processing

Volumen (broj)

86 (12)

Godina

2006.

Stranice rada

3644-3656-x

Status objave rada

objavljeno

ISSN

0165-1684

Povezanost rada

Povezane osobe

Igor Sunday Pandžić (CroRIS ID: 1710; MBZ: 252724) (autor/i)

Goranka Zorić (CroRIS ID: 17075; MBZ: 259432) (autor/i)

Povezane ustanove

Fakultet elektrotehnike i računarstva (036) (autorova ustanova)

Povezani projekti

Utjelovljeni razgovorni agenti za usluge u umreženim i pokretljivim sustavima (rezultat rada na projektu)

Područje

Elektrotehnika, Računarstvo, Informacijske i komunikacijske znanosti

Indeksiranost

Scopus

Current Contents Connect (CCC)

Web of Science Core Collection, Science Citation Index Expanded (WoSCC-SCI-Exp)

Web of Science Core Collection, SCI-Exp, SSCI & A&HCI (WoSCC-SCI-Exp, SSCI, A&HCI)