Share

Export Citation

APA
MLA
Chicago
Harvard
Vancouver
BIBTEX
RIS
Universitas Hasanuddin
Research output:Contribution to journalArticlepeer-review

Improving the Performance of Voice Lie Detection Using Mel Frequency Cepstral Coefficients and Long Short Term Memory Models

Kusumawati D.

Journal of Advanced Computational Intelligence and Intelligent Informatics

Q3
Published: 2026

Abstract

The objective of this study is to show that the combination of the Mel frequency cepstral coefficient (MFCC) and long short-term memory (LSTM) can be an effective approach for voice lie detection. To improve the performance of voice lie detection, a modified MFCC was used to extract important features in voice. An MFCC was modified by adding zero-crossing rate, audio entropy, and energy entropy parameters to detect changes in tone in each voice frame. LSTM was used to detect and classify voice-based lies. Datasets were obtained from the video recordings of the trial of a suspect. A total of 847 voice datasets were obtained after applying the time stretching augmentation technique where the audio duration was changed from 28.0 s to 4 s per video. The lie classification process was performed using the LSTM method that was equipped with additional dropout and dense layers and optimized using the adaptive moment estimation (Adam) optimizer. The results showed that the combination of the MFCC and LSTM achieved a classification accuracy level of 97% and an area under the curve value of 0.97 using epoch parameters of 200, Adam optimizer, and learning rate of 0.0001. This study concluded that the addition of zero-crossing rate, audio entropy, and energy entropy parameters to the MFCC extraction feature and the use of Adam optimizer in LSTM improved the accuracy of voice lie detection.

Access to Document

10.20965/jaciii.2026.p0113

Other files and links

Fingerprint

Mel-frequency cepstrumSciences
Computer scienceSciences
Speech recognitionSciences
Long short term memorySciences
Artificial intelligenceSciences
CepstrumSciences
Pattern recognition (psychology)Sciences
Entropy (arrow of time)Sciences
Feature extractionSciences
Artificial neural networkSciences
Energy (signal processing)Sciences
Feature (linguistics)Sciences
Audio signalSciences
Support vector machineSciences
Time–frequency analysisSciences
Discrete cosine transformSciences
Process (computing)Sciences
Moment (physics)Sciences
Frequency bandSciences