Speech to text for Indonesian homophone phrase with Mel Frequency Cepstral Coefficient

Bustamin A.

doi:10.1109/CyberneticsCom.2016.7892562

Share

Export Citation

APA

MLA

Chicago

Harvard

Vancouver

BIBTEX

RIS

Universitas Hasanuddin

Research output:Contribution to journal›Article›peer-review

Speech to text for Indonesian homophone phrase with Mel Frequency Cepstral Coefficient

Bustamin A.

Proceedings Cyberneticscom 2016 International Conference on Computational Intelligence and Cybernetics

Published: 2017Citations: 4

Abstract

In this study, speech to text system for homophone phrases in Indonesian was designed using an extraction method which featured Mel Frequency Cepstral Coefficient (MFCC). Feature extraction results were classified by comparing the two classifiers of Backpropagation Neural Network (BPNN) and K-Nearest Neigbour (KNN). The input data used were the recordings of each of 3 male and female respondents. The recording process was conducted for 5 seconds at a sampling frequency of 16 kHz and at channel mono. Classification results with test data to BPNN showed accuracy rates of 96.67% and 90% respectively for male and female respondents. Moreover, the level of accuracy obtained on KNN amounted to 83.33% for males and 73.33% for females.

Access to Document

10.1109/CyberneticsCom.2016.7892562

Fingerprint

Mel-frequency cepstrumSciences

Speech recognitionSciences

Feature extractionSciences

Computer scienceSciences

PhraseSciences

HomophoneSciences

Artificial intelligenceSciences

Pattern recognition (psychology)Sciences

CepstrumSciences

Artificial neural networkSciences

PhilosophySciences

LinguisticsSciences

Share

Export Citation

Speech to text for Indonesian homophone phrase with Mel Frequency Cepstral Coefficient

Abstract

Access to Document

Other files and links

Related Papers

Fingerprint