Please use this identifier to cite or link to this item:
https://rda.sliit.lk/handle/123456789/2829
Title: | Unsupervised Sinhala Cyberbullying Categorization |
Authors: | Chandrasena, B.G.M |
Keywords: | Cyberbullying Hate Speech Machine Learning NLP (Natural Language Processing) Supervised Learning Unsupervised Learning Artificial Neural Network |
Issue Date: | 2021 |
Abstract: | The objective of unsupervised machine learning is to categorize the social media comments into a given number of pre-learned categories. The earlier studies of this domain have used many the dataset for supervised learning & introduced a large number of techniques, methodologies. A major challenge there was training labels. Although words with training comments are easy to find, separating them manually is not an easy task. Through this research, we hope to find a solution to this using unsupervised machine learning techniques. the proposed technique divides the comments into words and removed special characters, emojis, and links from the comments & categorized each comment using a keyword list of each category and similarity findings. And then this was used to categorize comments for training. The implemented method shows the same performance, by Comparison with other supervised machine learning techniques for cyberbullying. Therefore, this mechanism can be used in any other places where low-cost cyberbullying identification is needed. This also can be used to create train comments. |
URI: | http://rda.sliit.lk/handle/123456789/2829 |
Appears in Collections: | 2021 |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
THESIS--MS19810874.pdf Until 2050-12-31 | 1.94 MB | Adobe PDF | View/Open Request a copy | |
THESIS--MS19810874Abs.pdf | 122.58 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.