Share

Export Citation

APA
MLA
Chicago
Harvard
Vancouver
BIBTEX
RIS
Universitas Hasanuddin
Research output:Contribution to journalArticlepeer-review

Speech to text for Indonesian homophone phrase with Mel Frequency Cepstral Coefficient

Bustamin A.

Proceedings Cyberneticscom 2016 International Conference on Computational Intelligence and Cybernetics

Published: 2017Citations: 4

Abstract

In this study, speech to text system for homophone phrases in Indonesian was designed using an extraction method which featured Mel Frequency Cepstral Coefficient (MFCC). Feature extraction results were classified by comparing the two classifiers of Backpropagation Neural Network (BPNN) and K-Nearest Neigbour (KNN). The input data used were the recordings of each of 3 male and female respondents. The recording process was conducted for 5 seconds at a sampling frequency of 16 kHz and at channel mono. Classification results with test data to BPNN showed accuracy rates of 96.67% and 90% respectively for male and female respondents. Moreover, the level of accuracy obtained on KNN amounted to 83.33% for males and 73.33% for females.

Other files and links

Fingerprint

Mel-frequency cepstrumSciences
Speech recognitionSciences
Feature extractionSciences
Computer scienceSciences
PhraseSciences
HomophoneSciences
Artificial intelligenceSciences
Pattern recognition (psychology)Sciences
CepstrumSciences
Artificial neural networkSciences
PhilosophySciences
LinguisticsSciences