African Logistics and Supply Chain (Business/Engineering crossover)

Advancing Scholarship Across the Continent

Vol. 2006 No. 1 (2006)

View Issue TOC

Natural Language Processing Frontiers in African Languages of Kenya: Challenges and Opportunities

Otombe Ndiangi, Kenya Medical Research Institute (KEMRI) Mwihaki Karanja, University of Nairobi Kinyanjui Wambugu, University of Nairobi Njoroge Mburu, Department of Software Engineering, Moi University
DOI: 10.5281/zenodo.18837716
Published: January 23, 2006

Abstract

Natural Language Processing (NLP) has emerged as a critical tool for automating language understanding in various applications. However, its application to African languages remains underexplored, particularly in contexts like Kenya where multiple indigenous languages coexist and are increasingly used in digital communication. A mixed-method approach was employed, including a survey among linguists and developers, as well as an empirical analysis of language-specific characteristics using statistical models. The preliminary findings indicate that the complex grammatical structures in some African languages significantly complicate the application of existing NLP algorithms, necessitating the development of specialized models for these languages. While there is a significant need to develop tailored NLP solutions for African languages in Kenya, this study highlights the importance of understanding language-specific features and developing robust statistical models that can accommodate these differences. Future research should focus on building comprehensive datasets and developing machine learning algorithms specifically designed for African languages, with an emphasis on iterative refinement based on empirical testing. Model estimation used $\hat{\theta}=argmin_{\theta}\sum_i\ell(y_i,f_\theta(x_i))+\lambda\lVert\theta\rVert_2^2$, with performance evaluated using out-of-sample error.

How to Cite

Otombe Ndiangi, Mwihaki Karanja, Kinyanjui Wambugu, Njoroge Mburu (2006). Natural Language Processing Frontiers in African Languages of Kenya: Challenges and Opportunities. African Logistics and Supply Chain (Business/Engineering crossover), Vol. 2006 No. 1 (2006). https://doi.org/10.5281/zenodo.18837716

Keywords

Bantu languagesComputational linguisticsData-driven methodsGrammatical frameworksMachine learningMorphologyText processing

References