Please use this identifier to cite or link to this item:
https://rda.sliit.lk/handle/123456789/3264
Title: | Algorithmically Navigating Complex Tabular Structures in Images for Information Extraction |
Authors: | Nugawela, M Abeywardena, K. Y Mahaadikara, H |
Keywords: | Algorithmically Navigating Complex Tabular Structures Information Extraction Images |
Issue Date: | 26-Dec-2022 |
Publisher: | IEEE |
Citation: | M. Nugawela, K. Y. Abeywardena and H. Mahaadikara, "Algorithmically Navigating Complex Tabular Structures in Images for Information Extraction," 2022 3rd International Informatics and Software Engineering Conference (IISEC), Ankara, Turkey, 2022, pp. 1-6, doi: 10.1109/IISEC56263.2022.9998220. |
Series/Report no.: | 2022 3rd International Informatics and Software Engineering Conference (IISEC); |
Abstract: | Computer vision has been in the forefront of automating workflows to replace manual repetitive tasks with convenience and accuracy. Recognizing text from images of commercial documents through optical character recognition (OCR) form the initial step of most such workflows where majority of their information are in the form of complex data structures such as tables and nested tables. Although OCR technology has evolved to effectively capture text from images, there is still room for improvement in recognizing complex data structures and extracting tabular data from images. This paper proposes an algorithmic approach based on keyword detection and the position of words relative to each other in order to recognize nested structures and successfully extract tabular data into a program and human readable format, which aims to take a different approach as opposed to using machine learning models or pre-defined templates for layout recognition. Furthermore, this approach is shown to yield successful results in correctly comprehending the layout and data of nested table structures in multiple rows in a table. |
URI: | https://rda.sliit.lk/handle/123456789/3264 |
ISSN: | 978-1-6654-5995-2 |
Appears in Collections: | Department of Computer Systems Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Algorithmically_Navigating_Complex_Tabular_Structures_in_Images_for_Information_Extraction.pdf Until 2050-12-31 | 510.92 kB | Adobe PDF | View/Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.