Publication: Support Vector Machine Based an Efficient and Accurate Seasonal Weather Forecasting Approach with Minimal Data Quantities
Type:
Article
Date
2022-02-11
Journal Title
Journal ISSN
Volume Title
Publisher
SLIIT
Abstract
Climate change makes a big impact in our daily activities. Therefore, forecasting climate changes
prior to its actual occurrences is important. Even though highly accurate weather prediction systems
throughout the world are available, they require mass amounts of data exceeding thousands of data
points to obtain a significant accuracy. This study was aimed at proposing a Support Vector Machine
based approach to carryout seasonal weather predictions up to thirty-minute intervals, the results of
which would be considerably effective with respect to predictions carried out with models trained with
annual datasets. The model was trained utilizing a dataset corresponding to the district of Kandy which
consisted of 136 samples, 20 features, and 5 labels. By means of carrying out numerous data
preprocessing steps, the model was trained, and the relevant hyperparameters were optimized
considering the grid search algorithm to yield a maximum accuracy of 86%, once tested via the k-fold
cross validation. The performance of the Support Vector Machine was also then compared for the same
dataset with that of the K-Nearest Neighbor algorithm which consumed relatively fewer computing
resources. An optimal accuracy of 61% was observed for this model for a K-value of 27. This approach
supported the concept of a Support Vector Machine’s ability to perceive time series forecasts to a
relatively higher degree and its ability to perform effectively in higher dimensional datasets with smaller
number of samples. As per the future work, the Receiver Operating Characteristic analysis is proposed
to be carried out to evaluate the performance of the model and the dataset size is proposed to be further
enhanced to a maximum of a thousand samples to yield the best performance results.
Description
Keywords
Support Vector Machines, Principal Component Analysis, Receiver Operating Characteristic, Machine Learning, Weather Forecast, Hyperparameter Optimization
