Publication:
Latent Structures in Zero-Inflated Risk Domains: An Elastic–Tweedie Synergy for Claim Forecasting

dc.contributor.authorKumarasinghe, P. B. W. S. R.
dc.contributor.authorNapagoda, N. A. D. N.
dc.date.accessioned2026-01-11T08:09:45Z
dc.date.issued2025-10-10
dc.description.abstractThe frequency of insurance claims presents a unique modeling challenge due to high-dimensional inputs, strong feature correlations, and the dominance of zero-inflated outcomes. Conventional statistical models often fall short under these conditions, failing to capture the underlying structure of complex data sets. This study proposes an advanced predictive framework integrating Elastic Net regularization and a Tweedie-distribution-based XGBoost algorithm to address these issues in the context of motor insurance. Those methodologies were applied to the French Motor Claims data set,which contains over 678,000 policies, to distill influential variables while suppressing redundancy and noise. Lasso Regression, Elastic Net and the Boruta algorithm were employed to select relevant features. Elastic Net, in particular proved effective in identifying critical predictors including Exposure, Vehicle Age, Driver Age, BonusMalus, Area, and Fuel Type by balancing sparsity and multicollinearity. Thesefeatures were used to train both standard and Tweedie-distribution-based XGBoost models. Performance was evaluated using RMSE, MAE, and R², where the Tweedie XGBoost model guided by Elastic Net-selected features achieved the highest accuracy and explanatory power. The proposed architecture not only offers superior generalization and interpretability but also exhibits robustness in modeling skewed, zero-dominated distributions inherent to claim data. Beyond predictive enhancement, this framework has practical implications for actuarial science, particularly in dynamicpricing strategies, refined segmentation, and adaptive underwriting. This approach marks a shift toward more nuanced and scalable machine learning paradigms in insurance analytics by integrating statistically grounded feature selection with distribution-aware boosting.
dc.identifier.doihttps://doi.org/10.54389/IUJF8468
dc.identifier.isbn978-624-6010-14-0
dc.identifier.issn2783 – 8862
dc.identifier.urihttps://rda.sliit.lk/handle/123456789/4500
dc.language.isoen
dc.publisherDepartment of Mathematics and Statistics, Faculty of Humanities and Sciences, SLIIT
dc.relation.ispartofseriesICActS; 20p.-25p.
dc.subjectClaim frequency prediction
dc.subjectFeature selection
dc.subjectTweedie-distribution-based XGBoost
dc.subjectElastic Net
dc.subjectLasso Regression
dc.titleLatent Structures in Zero-Inflated Risk Domains: An Elastic–Tweedie Synergy for Claim Forecasting
dc.typeArticle
dspace.entity.typePublication

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
Latent Structures in Zero-Inflated Risk Domains.pdf
Size:
554.43 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.69 KB
Format:
Item-specific license agreed upon to submission
Description: