getCITED   
  Home     Search     Add Content     Reports     Help  
Edit Publication | Edit Contributors | Delete Publication | Edit References | Edit Citations
Add to Bookstack | Show Bookstack | Change Bookstack

Audio-Visual Based Multi-Sample Fusion to Enhance Correlation Filters Speaker Verification System

Post a Comment
CONTRIBUTORS:
  Author Dzati Athiar Ramli
  Author Salina Abdul Samad
  Author Aini Hussain
JOURNAL:
  International Journal on Computer Science and Engineering (IJCSE), 2(4), 1286 - 1294.
YEAR: 2010
PUB TYPE: Journal Article
SUBJECT(S): multi-sample fusion, correlation filter,spectrographic image, lipreading, speaker verification.
DISCIPLINE: Computer Science
HTTP: http://www.enggjournals.com/ijcse/doc/IJCSE10-02-04-21.pdf
LANGUAGE: English
PUB ID: 103-488-562 (Last edited on 2011/06/11 01:06:37 GMT-6)
SPONSOR(S):
 
ABSTRACT:
In this study, we propose a novel approach for speaker verification system that uses a spectrogram image as features and Unconstrained Minimum Average Correlation Energy (UMACE) filters as classifiers. Since speech signal is a behavioral signal, the speech data has a tendency not to consistently reproduce due to the change of speaking rates, health, emotional conditions, temperature and humidity. In order to overcome this problem, a modification of UMACE filters architecture is proposed by executing a multi-sample fusion using speech and lipreading data. So as to evaluate the outstanding fusion scheme, five multi-sample fusion strategies, i.e. maximum, minimum, median, average and majority vote are first experimented using the speech signal data. Afterward, the performance of the audio-visual system using the enhanced UMACE filters is then tested. Here, lipreading data is combined to the audio samples pool and the outstanding fusion scheme that found in prior experiment is used as multi-sample fusion scheme. The Digit Database had been used for performance evaluation and the performance up to 99.64% is achieved by using the enhanced UMACE filters for the speech only system which is 6.89% improvement compared with the base line approach. Subsequently, the implementation of the audio-visual system is observed to be significant in order to broaden the PSR score interval between the authentic and imposter data as well as to further improve the performance of audio only system that offer toward a robust verification system.
STATISTICS
Click on # to view
 Citations  
 References  
 Comments  
 Quality      0/0.00 
 Interest      0/0.00 
 View(er)s   1/82 
Quality
  N/A
High
  7
  6
  5
  4
  3
  2
  1
Low
Interest
  N/A
High
  7
  6
  5
  4
  3
  2
  1
Low
Prev | Next

    ABOUT getCITED   |    CONTACT US   |    USER INFO   |    PREFERENCES   |    PRIVACY   |    LOG IN   
Comments? Suggestions? Send them to feedback@getCITED.org.

Copyright © 2000-2013 getCITED Inc. All Rights Reserved.