Publication:
Automatic Sinhala News Classification Approach for News Platforms

Thumbnail Image

Type:

Article

Date

2020-12-18

Journal Title

Journal ISSN

Volume Title

Publisher

Institute of Electrical and Electronics Engineers Inc.

Research Projects

Organizational Units

Journal Issue

Abstract

Because of generating various news articles in large scale, online sources moved into an automatic categorization mechanism. This research has been conducted using LDA topic modeling approach and using other classification algorithms to establish a news categorization solution. Sinhala news websites have only few news categories and do not have any relationships or hierarchies between the categories. Therefore, some users require to search manually and find the necessary articles which are in those categories. Purpose of this study is to build a news categorization model with categorization hierarchies for Sinhala news articles. The goals of the models are to identify the most suitable news category for a related news article and develop hierarchies using generated news categories and assign the news articles according to the hierarchical structure. The final experiments and evaluations show that the solution performs well to solve the automatic categorization problem in Sinhala news platforms.

Description

Keywords

Sinhala text classification, topic modeling, natural language processing, machine learning

Citation

Endorsement

Review

Supplemented By

Referenced By