Publication: Sentiment Classification of Sinhala Content in Social Media: An Ensemble Approach
Type:
Article
Date
2021-12-09
Journal Title
Journal ISSN
Volume Title
Publisher
IEEE
Abstract
We focus on the binary classification of Sinhala social media content in the sports domain using machine learning algorithms. In particular, we improve upon the accuracy achieved in a previous study of ours that utilized word and character N-grams. We use the base learners from that study to implement a probability-based stacking ensemble approach. This is done by creating a base learner library of 1066 base learners, using 13 different algorithms and different N-gram feature extraction methods. Different base learner combinations from the library are then stacked together to find the best stacking ensemble model. The best stacking ensemble model achieves an accuracy of 83.8% which is an improvement of over 1.5% of our previous study.
Description
Keywords
Sentiment Classification, Sinhala Content, Social Media, Ensemble Approach
Citation
P. Jayasuriya, R. Munasinghe and S. Thelijjagoda, "Sentiment Classification of Sinhala Content in Social Media: An Ensemble Approach," 2021 IEEE 16th International Conference on Industrial and Information Systems (ICIIS), 2021, pp. 140-145, doi: 10.1109/ICIIS53135.2021.9660656.
