Share

Export Citation

APA
MLA
Chicago
Harvard
Vancouver
BIBTEX
RIS
Universitas Hasanuddin
Research output:Contribution to journalArticlepeer-review

Comparative Analysis of Part of Speech Tagging Methods for the Bugis Language: From Statistical to Deep Neural Approaches

Nurfadhilah E.

International Conference on Computer Control Informatics and Its Applications Ic3ina

Published: 2024Citations: 5

Abstract

Part-of-speech (POS) tagging is essential in natural language processing (NLP) that facilitates various downstream applications. However, POS tagging for low-resource languages such as Buginese remains challenging due to the scarcity of annotated data. This paper explores several POS tagging methods, including Unigram, Hidden Markov Model (HMM), Conditional Random Fields (CRF), Long Short-Term Memory (LSTM), Bidirectional LSTM (Bi-LSTM), and Gated Recurrent Unit (GRU), integrated with word embedding techniques. We present a comparative analysis of these methods based on their performance on a newly collected and annotated Bugis language dataset. Our results demonstrate that advanced neural models outperform traditional statistical methods, highlighting the potential of deep learning techniques for low-resource language processing, especially RNN, which gets a higher and significant average F1 Score of 97.54%.

Other files and links

Fingerprint

Computer scienceSciences
Artificial intelligenceSciences
Natural language processingSciences
Speech recognitionSciences
Statistical analysisSciences
Artificial neural networkSciences
Part-of-speech taggingSciences
Part of speechSciences
MathematicsSciences
StatisticsSciences