Research Papers - Dept of Information Technology

Permanent URI for this collectionhttps://rda.sliit.lk/handle/123456789/593

Browse

Search Results

Now showing 1 - 3 of 3
  • Thumbnail Image
    PublicationEmbargo
    Sinhala Part of Speech Tagger using Deep Learning Techniques
    (IEEE, 2022-12-21) Sathsarani, M.W.A.R.; Thalawaththa, T.P.A.B.; Galappaththi, N.K.; Danthanarayana, J.N.; Gamage, A
    Natural Language Processing (NLP) is a sub-field of Artificial Intelligence (AI) that consists of a collection of computational methods motivated by theory for the automated classification and reflection of human languages. The foundation for many sophisticated applications of NLP, including named entity recognition, sentiment analysis, machine translation, in-formation retrieval, and information processing, is laid by Part of Speech (POS) tagging, which is part of the lexical layer of NLP systems. In contrast to English, French, German, and other languages from the same geographical region, the development of high-accuracy, stable POS taggers for the Sinhala language is still in its early stages. Hence, Sinhala is identified as a low-resource language. The main objective of this research is to create a POS tagger for the Sinhala language to solve this issue. An innovative and novel strategy that has never been used with the Sinhala language has been designed. This approach has been suggested specifically to evaluate the possibility of enhancing the accuracy compared to other methodologies. So, deep learning algorithms have been applied in this study, which has a significant impact on improving tagger performance. First, highly accurate individual classifiers for primary POS tags were implemented, and then they were combined into one composite model. As expected, all individual classifiers and the final composite model have achieved a higher accuracy level. Thus, it demonstrates that the proposed solution using deep learning algorithms outperformed other methods, such as rule-based and stochastic, in terms of accuracy.
  • Thumbnail Image
    PublicationEmbargo
    Evolutionary Algorithm for Sinhala to English Translation
    (IEEE, 2019-10-08) Nugaliyadde, A; Joseph, J. K; Chathurika, W. M. T; Mallawarachchi, Y
    Machine Translation (MT) is an area in natural language processing, which focuses on translating from one language to another. Many approaches ranging from statistical methods to deep learning approaches are used in order to achieve MT. However, these methods either require a large number of data or a clear understanding about the language. Sinhala language has less digital text which could be used to train a deep neural network. Furthermore, Sinhala has complex rules, and therefore, it is harder to create statistical rules in order to apply statistical methods in MT. This research focuses on Sinhala to English translation using an Evolutionary Algorithm (EA). EA is used to identifying the correct meaning of Sinhala text and to translate it into English. The Sinhala text is passed to identify the meaning in order to get the correct meaning of the sentence. With the use of the EA the translation is carried out. The translated text is passed on to grammatically correct the sentence. This has shown to achieve accurate results.
  • Thumbnail Image
    PublicationEmbargo
    A translator from sinhala to english and english to sinhala (sees)
    (IEEE, 2012-12-12) Wijerathna, L.; Pulasinghe, K; Somaweera, W. L. S. L; Kaduruwana, S. L; Wijesinghe, Y. U; De Silva, D. I; Thellijjagoda, S
    This paper presents a rule based machine translation system which is capable of translating sentences from Sinhala to English and vice versa. This is the first Sinhala to English and English to Sinhala machine translation system which comes with features such as a Sinhalese font translator, which is capable of interpreting Sinhalese words written in English characters (Singlish) to Sinhala characters, and an English grammar and spell checker. An entered sentence to the system will be tokenized and translated according to a rule. When translating Sinhala sentences to English the user input should be in Singlish and when translating English sentences to Sinhala input should be in English. The main objective of this translator is to enable a smooth flow translation of words, sentences and paragraphs to locals as well as foreigners and thereby eliminate the language barrier. A considerable amount of rules, patterns and words of both languages were used to develop this system. With 87% accuracy this pilot machine translation system translated 500 grammatically well-structured Sinhala sentences to English and 150 grammatically well-structured English sentences to Sinhala. The system is capable of translating approximately 70 sentences in one minute.