Share

Export Citation

APA
MLA
Chicago
Harvard
Vancouver
BIBTEX
RIS
Universitas Hasanuddin
Research output:Contribution to journalArticlepeer-review

Entity Extraction in Indonesian Online News Using Named Entity Recognition (NER) with Hybrid Method Transformer, Word2Vec, Attention and Bi-LSTM

Zainuddin Z.

International Journal on Informatics Visualization

Q3
Published: 2025Citations: 3

Abstract

Named Entity Recognition (NER) is a crucial task in Natural Language Processing (NLP) that identifies entities such as person names, locations, and organizations within the text. While many NER studies have concentrated on the English language, there is a significant need for further research on Indonesian NER. Indonesia presents unique challenges due to its structural complexities, polysemy, and ambiguities. Conventional machine learning and deep learning techniques have been widely applied in NER; however, more detailed exploration into integrating these methods for performance improvement is needed. This study introduces a novel hybrid model, TWBiL, which combines Transformer mechanisms, Word2Vec embeddings, Bidirectional Long Short-Term Memory (Bi-LSTM), and Attention mechanisms to enhance NER performance on Indonesian text. TWBiL harnesses the strengths of each component to generate superior word vector representations, extract intricate sentence features, and disambiguate entities contextually. Our experimental results demonstrate the effectiveness of the proposed hybrid model, revealing a significant improvement in NER performance. Specifically, TWBiL achieves an F1-Score of 85.11 on an Indonesian online news dataset, outperforming the traditional Bi-LSTM model, which achieved a score of 75.18. The results indicate that TWBiL effectively reduces ambiguity and captures context more accurately, enhancing entity recognition. Future research should priorities reducing computational time when handling larger datasets without compromising overall NER performance. This study underscores the potential of integrating advanced deep learning techniques to tackle the unique challenges of Indonesian NER, thus providing a solid foundation for further advancements in the field.

Access to Document

10.62527/joiv.9.3.2902

Other files and links

Fingerprint

Word2vecSciences
Named-entity recognitionSciences
Computer scienceSciences
Artificial intelligenceSciences
IndonesianSciences
Natural language processingSciences
TransformerSciences
Speech recognitionSciences
Pattern recognition (psychology)Sciences
EngineeringSciences
LinguisticsSciences
Task (project management)Sciences
EmbeddingSciences
PhilosophySciences
Systems engineeringSciences
VoltageSciences
Electrical engineeringSciences