Research Publications Authored by SLIIT Staff
Permanent URI for this communityhttps://rda.sliit.lk/handle/123456789/4195
This collection includes all SLIIT staff publications presented at external conferences and published in external journals. The materials are organized by faculty to facilitate easy retrieval.
Browse
3 results
Search Results
Publication Embargo Road Navigation System Using Automatic Speech Recognition (ASR) And Natural Language Processing (NLP)(IEEE, 2019-01-31) Withanage, P; Liyanage, T; Deeyakaduwe, N; Dias, E; Thelijjagoda, SIn a highly evolving technical era, Voice-based Navigation Systems play a major role to bridge the gap between human and machine. To overcome the difficulty in taking and understanding user's voice commands, simulating the natural language, process the route with user's turn by turn directions while mentioning key entities like street names, landmarks, point of interests, junctions and map the route in an interactive interface, we propose a user-centric roadmap navigation mobile application called “Direct Me”. The approach of generating the user preferred route, system will first convert the audio streams into text through Automatic Speech Recognizer (ASR) using Pocket Sphinx Library, followed by Natural Language Processing (NLP) by utilizing Stanford CoreNLP Framework to retrieve the navigation-associated information and process the route in the Map using Google Map API upon the user request. This system is used to provide an efficient approach to translate natural language directions to a machine-understandable format and will benefit the development of voice-based navigation-oriented humanmachine interface.Publication Open Access Snap & Hear: Comic Book Analyst for Children Having Literacy and Visual Barriers(CSEDU 2020 - 12th International Conference on Computer Supported Education, 2020) Yapa, R. B. D; Kahaduwa Arachchi, T. L; Suriyarachchi, V. S; Abegunasekara, U. D; Thelijjagoda, SComic books are very popular across the world due to the unique experience they provide for all of us in the society without any age limitation. Because of this attraction, which comic books have received, it has proved that comic literature will be able to survive in the twenty first century, even with the existence of multidimensional movie theatres as its competitors. While the biggest global filmmakers are busy with making movies from comic books, many researchers have been investigating their time on digitizing the comic stories as it is, expecting to create a new era in the comic world. But most of them have focused only on one or few components of the story. This paper is based on a research which aims to give the full experience of enjoying the comic books for everyone in the world despite of visual and literacy barriers people are having. Proposed solution comes as a web application that translates input image of a comic story into a text format and delivers it as an audio story to the user. The story will be created using extracted components such as characters, objects, speech text and balloons and considering the association among them with the use of image processing and deep learning technologies.Publication Embargo Project Bhashitha-Mobile based optical character recognition and text-to-speech system(IEEE, 2018-08-08) De Zoysa, D. S. S; Sampath, J. M; De Seram, E. M. P; Dissanayake, D. M. I. D; Wijerathna, L; Thelijjagoda, SIn the modern era when computers play a vital role in people's day today activities, visually impaired people face numerous problems when accessing printed text using existing technologies. This will rise to the need for the improvement of devices that could bring relief to this tasks that the blind people have to go beginning to end. Due to digitization of books there are many excellent attempts at building a vigorous document analysis system in industries and research labs, but this is only for those who are able to visible aided. “Bhashitha” is an android based mobile application contains OCR and TTS for Sinhala, Tamil and English languages as single product by resolving problems in existing systems. In order to make the proposed system, user needs to acquire printed document as optical image using a camera of the mobile phone. The image skew will reduce the OCR accuracy drastically due to the angle view of the document. Therefore after doing the image skew detection optical image is passing to the OCR engine to convert the image to character streams representing letters of recognized words. Finally, the converted text output is access by TTS system to convert the textual content into a voice output. Additionally, it consists audio assist system to navigate through the pages in the diligence for differently abled users. This is easier, portable and faster solution comparing to the existing systems which are made for visually impaired.
