Converting high resolution multi-lingual printed document images in to editable text using image processing and artificial intelligence

Jayakody, A; Premachandra, H. W. H; Kawanaka, H

Publication:
Converting high resolution multi-lingual printed document images in to editable text using image processing and artificial intelligence

dc.contributor.author	Jayakody, A
dc.contributor.author	Premachandra, H. W. H
dc.contributor.author	Kawanaka, H
dc.date.accessioned	2022-07-21T05:42:38Z
dc.date.available	2022-07-21T05:42:38Z
dc.date.issued	2022-06-21
dc.description.abstract	The optical character recognition technique is used to convert information, mainly printed or handwritten text in paper materials, into an electronic format that the computers can edit. According to the literature, there are few competent OCR systems for recognizing multilingual characters in the form of Sinhala and English characters together. The lack of an appropriate technology to recognize multilingual text still remains as a problem that the current research community must address, and it has been designated as the key problem for this study. The main goal of this research is to develop a multilingual character recognition system that uses character image geometry features and Artificial Neural Networks to recognize printed Sinhala and English scripts together. It is intended that the solution would be improved to cover three Sri Lanka’s most commonly spoken languages, with the addition of Tamil as a later upgrade. The primary technologies for this study were character geometry features and Artificial Neural Networks. At the moment almost an 85% of success rate has been achieved with a database containing around 800 images, which are divided into 46 characters (20 Sinhala and 26 English), and each character is represented in 20 different forms of character images. Recognition of text from printed bi-lingual documents is experimented by extracting individual character data from such printed text documents and feeding them to the system.	en_US
dc.identifier.citation	H. W. H. Premachandra, A. Jayakody and H. Kawanaka, "Converting high resolution multi-lingual printed document images in to editable text using image processing and artificial intelligence," 2022 2nd International Conference on Image Processing and Robotics (ICIPRob), 2022, pp. 1-7, doi: 10.1109/ICIPRob54042.2022.9798739.	en_US
dc.identifier.doi	10.1109/ICIPRob54042.2022.9798739	en_US
dc.identifier.issn	978-1-6654-0771-7
dc.identifier.uri	https://rda.sliit.lk/handle/123456789/2818
dc.language.iso	en	en_US
dc.publisher	IEEE	en_US
dc.relation.ispartofseries	2022 2nd International Conference on Image Processing and Robotics (ICIPRob);
dc.subject	Converting	en_US
dc.subject	high resolution	en_US
dc.subject	multi-lingual	en_US
dc.subject	printed document	en_US
dc.subject	editable text	en_US
dc.subject	image processing	en_US
dc.subject	images	en_US
dc.subject	artificial intelligence	en_US
dc.title	Converting high resolution multi-lingual printed document images in to editable text using image processing and artificial intelligence	en_US
dc.type	Article	en_US
dspace.entity.type	Publication

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Converting_high_resolution_multi-lingual_printed_document_images_in_to_editable_text_using_image_processing_and_artificial_intelligence.pdf
Size:: 982.16 KB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Research Papers - Dept of Computer Systems Engineering

Publication: Converting high resolution multi-lingual printed document images in to editable text using image processing and artificial intelligence

Files

Original bundle

License bundle

Collections

Publication:
Converting high resolution multi-lingual printed document images in to editable text using image processing and artificial intelligence