Publication:
AI-Powered Sinhala Character Recognition and Digital Transformation

dc.contributor.authorVidyalankara, R. A. Sumudu
dc.date.accessioned2025-04-28T06:09:14Z
dc.date.available2025-04-28T06:09:14Z
dc.date.issued2024-12
dc.description.abstractThis research focuses on developing an effective system for recognizing and converting handwritten and printed Sinhala text into digital format. As the primary language of Sri Lanka, Sinhala presents unique challenges for handwriting recognition due to its intricate strokes and complex character structures. Existing methods often fall short in accurately interpreting Sinhala characters, highlighting the need for a tailored solution. The proposed system employs Convolutional Neural Networks to classify and recognize Sinhala characters with high precision. A key innovation is error-guided preprocessing, applied iteratively to images misclassified during the initial training phase. Failed images are processed using methods such as blurriness detection, dynamic contrast adjustment, noise removal with bilateral filtering, and morphological operations for stroke enhancement. This approach ensures improved image quality and meaningful feature extraction for subsequent retraining. Additional techniques like contour analysis and gradient- based feature extraction further enhance the system's recognition capabilities. To optimize performance, strategies such as data augmentation, hyperparameter tuning, and model ensembles are explored, improving the system's adaptability and robustness. The system is evaluated on a diverse dataset of handwritten and printed Sinhala text, demonstrating significant improvements in recognition, accuracy and efficiency. Its applications include optical character recognition, document digitization, and automated form processing. This thesis contributes a comprehensive, CNN-based methodology tailored to the complexities of Sinhala script, offering a promising solution for advancing Sinhala language technologies.en_US
dc.identifier.urihttps://rda.sliit.lk/handle/123456789/4061
dc.language.isoenen_US
dc.publisherSLIITen_US
dc.subjectData Preparationen_US
dc.subjectAugmentation Techniquesen_US
dc.subjectSystem Designen_US
dc.subjectPerformance Optimizationen_US
dc.subjectprototype systemen_US
dc.subjectRobustness Testingen_US
dc.titleAI-Powered Sinhala Character Recognition and Digital Transformationen_US
dc.typeThesisen_US
dspace.entity.typePublication

Files

Original bundle

Now showing 1 - 2 of 2
Thumbnail Image
Name:
MS12906468 - AI-Powered Sinhala Character Recognition 1-09.pdf
Size:
185.79 KB
Format:
Adobe Portable Document Format
Description:
No Thumbnail Available
Name:
MS12906468 - AI-Powered Sinhala Character Recognition.pdf
Size:
4.83 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: