Publication: Automatic Sinhala News Classification Approach for News Platforms
Type:
Article
Date
2020-12-18
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Institute of Electrical and Electronics Engineers Inc.
Abstract
Because of generating various news articles in
large scale, online sources moved into an automatic
categorization mechanism. This research has been conducted
using LDA topic modeling approach and using other
classification algorithms to establish a news categorization
solution. Sinhala news websites have only few news categories
and do not have any relationships or hierarchies between the
categories. Therefore, some users require to search manually
and find the necessary articles which are in those categories.
Purpose of this study is to build a news categorization model
with categorization hierarchies for Sinhala news articles. The
goals of the models are to identify the most suitable news
category for a related news article and develop hierarchies
using generated news categories and assign the news articles
according to the hierarchical structure. The final experiments
and evaluations show that the solution performs well to solve
the automatic categorization problem in Sinhala news
platforms.
Description
Keywords
Sinhala text classification, topic modeling, natural language processing, machine learning
