Please use this identifier to cite or link to this item: https://rda.sliit.lk/handle/123456789/3264
Title: Algorithmically Navigating Complex Tabular Structures in Images for Information Extraction
Authors: Nugawela, M
Abeywardena, K. Y
Mahaadikara, H
Keywords: Algorithmically
Navigating Complex
Tabular Structures
Information Extraction
Images
Issue Date: 26-Dec-2022
Publisher: IEEE
Citation: M. Nugawela, K. Y. Abeywardena and H. Mahaadikara, "Algorithmically Navigating Complex Tabular Structures in Images for Information Extraction," 2022 3rd International Informatics and Software Engineering Conference (IISEC), Ankara, Turkey, 2022, pp. 1-6, doi: 10.1109/IISEC56263.2022.9998220.
Series/Report no.: 2022 3rd International Informatics and Software Engineering Conference (IISEC);
Abstract: Computer vision has been in the forefront of automating workflows to replace manual repetitive tasks with convenience and accuracy. Recognizing text from images of commercial documents through optical character recognition (OCR) form the initial step of most such workflows where majority of their information are in the form of complex data structures such as tables and nested tables. Although OCR technology has evolved to effectively capture text from images, there is still room for improvement in recognizing complex data structures and extracting tabular data from images. This paper proposes an algorithmic approach based on keyword detection and the position of words relative to each other in order to recognize nested structures and successfully extract tabular data into a program and human readable format, which aims to take a different approach as opposed to using machine learning models or pre-defined templates for layout recognition. Furthermore, this approach is shown to yield successful results in correctly comprehending the layout and data of nested table structures in multiple rows in a table.
URI: https://rda.sliit.lk/handle/123456789/3264
ISSN: 978-1-6654-5995-2
Appears in Collections:Department of Computer Systems Engineering

Files in This Item:
File Description SizeFormat 
Algorithmically_Navigating_Complex_Tabular_Structures_in_Images_for_Information_Extraction.pdf
  Until 2050-12-31
510.92 kBAdobe PDFView/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.