DSpace Repository

SWAHILI TEXT CLASSIFICATION USING SUPPORT VECTOR MACHINE AND FEATURE SELECTION TO ENHANCE OPINION ANALYSIS IN KENYAN UNIVERSITIES

Show simple item record

dc.contributor.author Obiria, Peter B.
dc.contributor.author Ngigi, Phyllis
dc.contributor.author Machuke, Grace
dc.date.accessioned 2018-03-20T07:46:20Z
dc.date.available 2018-03-20T07:46:20Z
dc.date.issued 2018-01
dc.identifier.citation Obiria, P. B., Ngigi, P., & Machuke, G. (2018). Swahili Text Classification using Support Vector Machine and Feature Selection to Enhance Opinion Analysis in Kenyan Universities. International Advanced Research Journal in Science, Engineering and Technology, 5(1), 60-69. doi:10.17148/IARJSET.2018.5110 en_US
dc.identifier.issn 2393-8021 (Online)
dc.identifier.issn 2394-1588 (Print)
dc.identifier.uri http://localhost:8080/xmlui/handle/123456789/187
dc.description.abstract In Kenya’s social media, Nairobi Swahili is the norm for communication in institutions of higher learning. Extant studies dwells on standard Swahili, affording limited text classification literature for Nairobi Swahili Natural Language Processing. The research explores how social media experience provides new ways for interactions, resulting into new challenges in managing student concerns that now require new knowledge for decision-making. Students have taken advantage of social media platforms by creating virtual discussion forums, which are quickly becoming repositories of collective knowledge. Unfortunately, institutions of higher are not able to utilize the collected knowledge through these platforms. The focus of this research is to ensure knowledge generated via social media is useful through opinion mining to enable extraction, classification and storage to support decision-making. Different algorithms were tested utilizing data from popular social media; operated by students in Kenyan universities. The results showed that SVM gives the best results when used with Linear Kernels and better performance on TF-IDF with N-grams methods. An analysis on the different SVM kernel showed linear kernel to have a better performance at 80% compared to Polynomial kernel and Radial Basis Function kernels, which both stand at 57%. To choose the best feature selection method for use along with linear SVM, TF and TF-IDF were tested. TF-IDF performed better with N-grams at 83%; rendering this research both theoretical and practical significance. The research would provide fast hand information for decision support in Kenyan higher learning institutions using text-mining tools in social media en_US
dc.language.iso en en_US
dc.publisher International Advanced Research Journal in Science, Engineering and Technology (IARJSET) en_US
dc.subject Nairobi Swahili en_US
dc.subject Support Vector Machines en_US
dc.subject Feature Selection en_US
dc.subject N-grams en_US
dc.title SWAHILI TEXT CLASSIFICATION USING SUPPORT VECTOR MACHINE AND FEATURE SELECTION TO ENHANCE OPINION ANALYSIS IN KENYAN UNIVERSITIES en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account