Investigation of MFCC feature representation for classification of spoken letters using Multi-Layer Perceptrons (MLP)
In this paper, the Mel-Frequency Cepstral Coefficient (MFCC) is demonstrated as an effective feature representation method for spoken letters recognition. The Multi-Layer Perceptron (MLP) was used as a classifier to discriminate between two spoken letters - 'A' and 'S'. The datas...
Published in: | ICCAIE 2011 - 2011 IEEE Conference on Computer Applications and Industrial Electronics |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
2011
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-84858756069&doi=10.1109%2fICCAIE.2011.6162096&partnerID=40&md5=b968932fde18023db818c50bd2e5437d |
Summary: | In this paper, the Mel-Frequency Cepstral Coefficient (MFCC) is demonstrated as an effective feature representation method for spoken letters recognition. The Multi-Layer Perceptron (MLP) was used as a classifier to discriminate between two spoken letters - 'A' and 'S'. The dataset consists of 72 samples (35 and 37 samples of spoken letters 'A' and 'S', respectively). The samples were represented using the Mel Frequency Cepstral Coefficients (MFCC). Several experiments were conducted to determine the optimal network parameters to yield the best classification results. The results indicate that the optimal network structure was with 2 hidden units, which yielded classification accuracy of 100% (training) and 93% (testing). © 2011 IEEE. |
---|---|
ISSN: | |
DOI: | 10.1109/ICCAIE.2011.6162096 |