Investigation of MFCC feature representation for classification of spoken letters using Multi-Layer Perceptrons (MLP)

In this paper, the Mel-Frequency Cepstral Coefficient (MFCC) is demonstrated as an effective feature representation method for spoken letters recognition. The Multi-Layer Perceptron (MLP) was used as a classifier to discriminate between two spoken letters - 'A' and 'S'. The datas...

Full description

Bibliographic Details
Published in:ICCAIE 2011 - 2011 IEEE Conference on Computer Applications and Industrial Electronics
Main Author: Daud M.S.; Yassin I.M.; Zabidi A.; Johari M.A.; Salleh M.K.M.
Format: Conference paper
Language:English
Published: 2011
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-84858756069&doi=10.1109%2fICCAIE.2011.6162096&partnerID=40&md5=b968932fde18023db818c50bd2e5437d
Description
Summary:In this paper, the Mel-Frequency Cepstral Coefficient (MFCC) is demonstrated as an effective feature representation method for spoken letters recognition. The Multi-Layer Perceptron (MLP) was used as a classifier to discriminate between two spoken letters - 'A' and 'S'. The dataset consists of 72 samples (35 and 37 samples of spoken letters 'A' and 'S', respectively). The samples were represented using the Mel Frequency Cepstral Coefficients (MFCC). Several experiments were conducted to determine the optimal network parameters to yield the best classification results. The results indicate that the optimal network structure was with 2 hidden units, which yielded classification accuracy of 100% (training) and 93% (testing). © 2011 IEEE.
ISSN:
DOI:10.1109/ICCAIE.2011.6162096