African GIS in Urban Planning (Technical/Methodology) | 17 June 2009
Challenges and Opportunities in Applying Natural Language Processing to African Languages in Kenya: A Methodological Approach
O, d, h, i, a, m, b, o, M, u, i, g, a, i
Abstract
Natural Language Processing (NLP) applications have shown promise in various fields including urban planning, but their adoption for African languages remains limited. The methodology involves a systematic review of existing literature on NLP for African languages, followed by an experimental design where we tested four NLP models (including LSTM) on a dataset comprising 1000 sentences from Swahili and English. Of the four models tested, LSTM showed the highest accuracy in classifying Swahili sentences, achieving an F1 score of 85% with a confidence interval of ±3% While initial results indicate potential for NLP in African languages, further research is needed to validate these findings across diverse linguistic and cultural contexts. Future studies should focus on developing more robust models that can handle the complexities of African languages' syntax and semantics. Natural Language Processing, African Languages, Urban Planning, Methodology, Kenya Model estimation used $\hat{\theta}=argmin<em>{\theta}\sum</em>i\ell(y<em>i,f</em>\theta(x<em>i))+\lambda\lVert\theta\rVert</em>2^2$, with performance evaluated using out-of-sample error.