Publication: Spatio-temporal graph neural network based child action recognition using data-efficient methods: A systematic analysis
| dc.contributor.author | Mohottala, S | |
| dc.contributor.author | Gawesha, A | |
| dc.contributor.author | Kasthurirathna, D | |
| dc.contributor.author | Samarasinghe, P | |
| dc.contributor.author | Abhayaratne, C | |
| dc.date.accessioned | 2026-02-14T09:48:14Z | |
| dc.date.issued | 2025-06-03 | |
| dc.description.abstract | This paper presents implementations on child activity recognition (CAR) using spatial–temporal graph neural network (ST-GNN)-based deep learning models with the skeleton modality. Prior implementations in this domain have predominantly utilized CNN, LSTM, and other methods, despite the superior performance potential of graph neural networks. To the best of our knowledge, this study is the first to use an ST-GNN model for child activity recognition employing both in-the-lab, in-the-wild, and in-the-deployment skeleton data. To overcome the challenges posed by small publicly available child action datasets, transfer learning methods such as feature extraction and fine-tuning were applied to enhance model performance. As a principal contribution, we developed an ST-GNN-based skeleton modality model that, despite using a relatively small child action dataset, achieved superior performance (94.81%) compared to implementations trained on a significantly larger (x10) adult action dataset (90.6%) for a similar subset of actions. With ST-GCN-based feature extraction and fine-tuning methods, accuracy improved by 10%–40% compared to vanilla implementations, achieving a maximum accuracy of 94.81%. Additionally, implementations with other ST-GNN models demonstrated further accuracy improvements of 15%–45% over the ST-GCN baseline. The results on activity datasets empirically demonstrate that class diversity, dataset size, and careful selection of pre-training datasets significantly enhance accuracy. In-the-wild and in-the-deployment implementations confirm the real-world applicability of above approaches, with the ST-GNN model achieving 11 FPS on streaming data. Finally, preliminary evidence on the impact of graph expressivity and graph rewiring on accuracy of small dataset-based models is provided, outlining potential directions for future research. The codes are available at https://github.com/sankamohotttala/ST_GNN_HAR_DEML. | |
| dc.identifier.doi | https://doi.org/10.1016/j.cviu.2025.104410 | |
| dc.identifier.issn | 10773142 | |
| dc.identifier.uri | https://rda.sliit.lk/handle/123456789/4644 | |
| dc.language.iso | en | |
| dc.publisher | Elsevier Inc | |
| dc.relation.ispartofseries | Computer Vision and Image Understanding ; Volume 259 Article number 104410 | |
| dc.subject | Child action recognition | |
| dc.subject | Data-efficiency | |
| dc.subject | Data-efficient methods | |
| dc.subject | Graph neural networks | |
| dc.subject | Human action recognition | |
| dc.subject | Transfer learning | |
| dc.title | Spatio-temporal graph neural network based child action recognition using data-efficient methods: A systematic analysis | |
| dc.type | Article | |
| dspace.entity.type | Publication |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- Spatio-temporal graph neural network based child action recognition using.pdf
- Size:
- 4.99 MB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.69 KB
- Format:
- Item-specific license agreed upon to submission
- Description:
