Principal Component Analysis and Support Vector Machine on Ice
Entrainment Through a Sluice Gate
Abstract
The entrainment and accumulation of ice floes in front of the sluice
gates are closely related to the water transport efficiency and safe
operation of the channel during an ice period. A flume study is carried
out for a sluice gate with free outflow. A framework of stacking
ensemble models is used to analyze the data, which consists of a
two-level structure including the principal component analysis (PCA) and
the support vector machine (SVM) algorithms. Based on the mechanism of
ice floe accumulation, ten input characteristics of the machine learning
(ML) model are selected. The PCA method is used to eliminate redundant
information. The first principal component, with a contribution rate of
71.76%, and the second principal component, with a contribution of rate
15.64%, are extracted as the inputs of the SVM model, and the state of
the floating ice in front of the gate is used to determine the
classification labels. The 5-fold cross-validation method is used to
train the model. The training results showed that the Gaussian radial
basis functions (RBF) were the optimal kernel function. The performance
of the developed model is measured using area under curve (AUC),
accuracy (Acc) and F1-score (F1) values as statistical indicators. The
results showed that the established PCA-SVM model improves the Bernoulli
naive Bayes (Bernoulli NB) classifier and K-nearest neighbors’ algorithm
(KNN) models. It increasing the AUC value by 11% and 5%, the Acc value
by 16% and 17%, and the F1 value by 17% and 2%, respectively.