Publication:
Use of Interpretable Evolved Search Query Classifiers for Sinhala Documents

dc.contributor.authorHaddela, P
dc.contributor.authorHirsch, L
dc.contributor.authorBrunsdon, T
dc.contributor.authorGaudoin, J
dc.date.accessioned2022-08-24T04:59:19Z
dc.date.available2022-08-24T04:59:19Z
dc.date.issued2021-01
dc.description.abstractDocument analysis is a well matured yet still active research field, partly as a result of the intricate nature of building computational tools but also due to the inherent problems arising from the variety and complexity of human languages. Breaking down language barriers is vital in enabling access to a number of recent technologies. This paper investigates the application of document classification methods to new Sinhalese datasets. This language is geographically isolated and rich with many of its own unique features. We will examine the interpretability of the classification models with a particular focus on the use of evolved Lucene search queries generated using a Genetic Algorithm (GA) as a method of document classification. We will compare the accuracy and interpretability of these search queries with other popular classifiers. The results are promising and are roughly in line with previous work on English language datasets.en_US
dc.identifier.citationHaddela, P., Hirsch, L., Brunsdon, T., Gaudoin, J. (2021). Use of Interpretable Evolved Search Query Classifiers for Sinhala Documents. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Proceedings of the Future Technologies Conference (FTC) 2020, Volume 1. FTC 2020. Advances in Intelligent Systems and Computing, vol 1288. Springer, Cham. https://doi.org/10.1007/978-3-030-63128-4_59en_US
dc.identifier.doi10.1007/978-3-030-63128-4_59en_US
dc.identifier.issn978-3-030-63127-7
dc.identifier.urihttps://rda.sliit.lk/handle/123456789/2923
dc.language.isoenen_US
dc.publisherSpringer, Chamen_US
dc.relation.ispartofseriesFTC 2020: Proceedings of the Future Technologies Conference (FTC) 2020,;Volume 1 pp 790–804
dc.subjectEvolved Search Queriesen_US
dc.subjectGenetic Algorithmen_US
dc.subjectInterpretable text classificationen_US
dc.subjectLucene Sinhala Analyzeren_US
dc.subjectSinhala Document Classificationen_US
dc.titleUse of Interpretable Evolved Search Query Classifiers for Sinhala Documentsen_US
dc.typeArticleen_US
dspace.entity.typePublication

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
use of Interpretable.pdf
Size:
1016.03 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: