Unsupervised Sinhala Cyberbullying Categorization

Please use this identifier to cite or link to this item: https://rda.sliit.lk/handle/123456789/2829

Title:	Unsupervised Sinhala Cyberbullying Categorization
Authors:	Chandrasena, B.G.M
Keywords:	Cyberbullying Hate Speech Machine Learning NLP (Natural Language Processing) Supervised Learning Unsupervised Learning Artificial Neural Network
Issue Date:	2021
Abstract:	The objective of unsupervised machine learning is to categorize the social media comments into a given number of pre-learned categories. The earlier studies of this domain have used many the dataset for supervised learning & introduced a large number of techniques, methodologies. A major challenge there was training labels. Although words with training comments are easy to find, separating them manually is not an easy task. Through this research, we hope to find a solution to this using unsupervised machine learning techniques. the proposed technique divides the comments into words and removed special characters, emojis, and links from the comments & categorized each comment using a keyword list of each category and similarity findings. And then this was used to categorize comments for training. The implemented method shows the same performance, by Comparison with other supervised machine learning techniques for cyberbullying. Therefore, this mechanism can be used in any other places where low-cost cyberbullying identification is needed. This also can be used to create train comments.
URI:	http://rda.sliit.lk/handle/123456789/2829
Appears in Collections:	2021

Files in This Item:

File	Description	Size	Format
THESIS--MS19810874.pdf Until 2050-12-31		1.94 MB	Adobe PDF	View/Open Request a copy
THESIS--MS19810874Abs.pdf		122.58 kB	Adobe PDF	View/Open