NEURAL NETWORK APPROACHES FOR ACCURATE AFAN OROMO SPELL CHECKING AND CORRECTION

dc.contributor.authorAyenalem Dejene
dc.date.accessioned2025-12-16T17:10:23Z
dc.date.issued2025-09-24
dc.description.abstractAfan Oromo, a widely spoken Cushitic language, lacks advanced natural language processing (NLP) tools like spell checkers due to limited resources and linguistic expertise. Both native and non-native speakers face challenges in writing Afan Oromo correctly, partly because its Latin-based Qubee script was adopted in 1991. Traditional spell-checking methods, such as dictionary lookup and rule-based approaches, are inadequate for Afan Oromo’s highly inflectional morphology. This thesis proposes a neural network-based spell checker using a sequence-to-sequence (Seq2Seq) model with Long Short-Term Memory (LSTM) layers. A corpus of 596,948 words was collected from BBC Afan Oromoo using Sketch Engine, ensuring compliance with BBC’s terms of service. The model was trained to detect and correct spelling errors, achieving 100% error recall and 52.47% precision. This work is the first to apply neural networks to Afan Oromo spell checking, offering a scalable solution for under-resourced languages.
dc.identifier.urihttps://repository.mu.edu.et/handle/123456789/1140
dc.language.isoen
dc.publisherMekelle University
dc.subjectAfan Oromo
dc.subjectSpell Checker
dc.subjectNeural Network
dc.subjectSequence-to-Sequence
dc.subjectLSTM
dc.titleNEURAL NETWORK APPROACHES FOR ACCURATE AFAN OROMO SPELL CHECKING AND CORRECTION
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Name:
Ayenalem Dejene.pdf
Size:
925.37 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: