Uploaded by albertqj

Long-term prediction of sea surface chlorophyll-a concentration based on the combination of spatio-temporal features

advertisement
Water Research 211 (2022) 118040
Contents lists available at ScienceDirect
Water Research
journal homepage: www.elsevier.com/locate/watres
Long-term prediction of sea surface chlorophyll-a concentration based on
the combination of spatio-temporal features
Liu Na a, Chen Shaoyang a, *, Cheng Zhenyan b, Wang Xing a, Xiao Yun c, Xiao Li a, Gong Yanwei a,
Wang Tingting a, Zhang Xuefeng a, Liu Siqi a
a
b
c
School of Marine Science and Technology, Tianjin University, Tianjin 300072, China
College of Fisheries, Tianjin Agricultural University, Tianjin 300384, China
Xian Research Institute of Surveying and Mapping, Xian 710061, China
A R T I C L E I N F O
A B S T R A C T
Keywords:
HABs
Chl-a
CNN-LSTM model
Long-term rolling prediction
Time series analysis
Harmful algal blooms (HABs) events have a serious impact on marine fisheries and marine management. They
occur globally with high frequency and are characterized by a long duration and difficult governance. HABs
incidents have occurred in the South China Sea (SCS), and the frequency of occurrence has been on the rise in
recent decades. Predicting the long-term chlorophyll-a (Chl-a) concentration has the potential to facilitate longterm monitoring and early warning of HABs events. Currently, long-term predictions of ocean circulation and
temperature are common, while long-term predictions of marine biochemistry are still in their infancy. Tradi­
tional Chl-a prediction methods have problems, such as low accuracy and the inability to carry out long-term
predictions. This research improved the CNN-LSTM model by combining spatio-temporal features to predict
Chl-a concentrations. This model can extract both the temporal and spatial features of Chl-a, expand the dataset,
and improve the prediction accuracy and training speed. The predictions were made using a Chl-a dataset for the
Reed Tablemount in the SCS. The time series of Chl-a used was the satellite data of NASA’s official website from
January 2002 to June 2020. The results indicate that the predictions of the CNN-LSTM model are better than
those of the LSTM and SARIMA models. The five-year long-term rolling prediction of Chl-a was carried out, and
the three-year Pearson correlation coefficient reached 0.5. The novelty of this study is the realization of a threeyear long-term prediction of Chl-a concentrations. The Mann-Kendall trend test method and the least square
method were used to fit the straight line to detect the trend of the five-year predicted value and the true value,
respectively. The results indicated that the prediction value and true value of the sea surface Chl-a from 2015 to
2020 both exhibited an overall upward trend. In addition, the prediction performance of the model in large-scale
prediction is better than that in small-scale prediction.
1. Introduction
According to the Harmful Algae (Red Tide) Event Database (HAE­
DAT), in the 30 years after 1985, there were more than 8000 harmful
algal blooms (HABs) events worldwide, and the frequency of occur­
rences has gradually increased. The South China Sea (SCS) has a high
quality of seawater due to factors such as deep water and the distance
from land. However, HABs frequently occur in the SCS. Studies have
determined that between 1980 and 2003, there were more than 700
HABs occurrences in the SCS, with 170 in the 1980s and 440 in the
1990s (Wang et al., 2008). HABs in the SCS have exhibited an upward
trend in recent decades (Zhang, 2013). HABs are one of the three major
marine disasters worldwide. After an outbreak, it has the characteristics
of a long duration and difficult governance. When HABs occur, they
negatively affect fisheries (McOwen et al., 2015) and marine ecosys­
tems, causing substantial property loss and even threatening human life
and health. Therefore, it is essential to build a HABs prediction model
and a marine water quality prediction system (Rostam et al., 2021). In
the past, the unpredictability of red tide fluctuations caused difficulty in
handling HABs events. The red tide monitoring and early warning has
ushered in the "big data era", making long-term and large-scale red tide
monitoring and early warning possible.
Park et al. (2019) summarized previous studies (Seferian et al., 2014)
and demonstrated that biogeochemical factors, such as acidity, oxygen,
* Corresponding author.
E-mail address: nmdiscsy@126.com (C. Shaoyang).
https://doi.org/10.1016/j.watres.2022.118040
Received 29 September 2021; Received in revised form 9 December 2021; Accepted 2 January 2022
Available online 4 January 2022
0043-1354/© 2022 Elsevier Ltd. All rights reserved.
L. Na et al.
Water Research 211 (2022) 118040
fishery production (Chavez et al., 2003; Chassot et al., 2010; Stock et al.,
2017). Moreover, it can provide valuable support for scientific sea area
management and red tide control. Currently, long-term predictions of
ocean circulation and temperature are common, while long-term pre­
dictions of marine biochemistry are still in their infancy. Existing Chl-a
prediction studies have been mostly short-term predictions (Zhao et al.,
2017), and most of the prediction targets have been lakes (Hou et al.,
2004; Zeng et al., 2006; Chen et al., 2014a; Li et al., 2017c; Barzegar
et al., 2020, 2021) and rivers (Lee and Lee, 2018), with a lack of
long-term predictions for the multi-year Chl-a concentrations in sea
areas. The prediction algorithm that is suitable for one water source may
not be suitable for other water sources because they have different
characteristics in terms of hydrology, climate, geochemistry, and bio­
logical characteristics (Rostam et al., 2021). It is necessary to model the
medium- and long-term predictions of Chl-a, concentrations but
long-term predictions are exceptionally difficult because long-term
predictions require more historical data input (Li et al., 2017b). The
robustness of LSTMs in medium-and long-term simulations (Wang et al.,
2019) provides support for long-term predictions. Some scholars (Chen
et al., 2020; Yu et al., 2020) have predicted the long-term trends in Chl-a
concentrations, but they have not compared these results with the true
values to verify the accuracy predictions. The National Aeronautics and
Space Administration (NASA) Global Modeling and Assimilation Office
(GMAO) Subseasonal to Seasonal Forecast System (GEOS-S2S) estab­
lished a long-term global biogeochemical prediction model, conducted a
9-month long-term seasonal prediction of the global ocean Chl-a con­
centrations (Rousseaux and Gregg, 2017; Rousseaux et al., 2021), and
have continued to improve the model. In addition, Park et al. (2019)
attempted a longer time span for Chl-a predictions using the Global
Earth System Model (ESM) (Stock et al., 2014) to predict regional sea­
sonal to multi-year ocean Chl-a fluctuations, including predicting the
fluctuations in the Somali Sea up to 2–3 years in advance, but the pre­
diction time for the tropical Pacific was only 12 months. Seferian et al.
(2014) used ESM in 2014 to verify that the effective predictable range of
tropical Pacific net primary productivity (NPP) could be extended to
three years. Studies have demonstrated that the NPP is the most sensi­
tive to the surface Chl-a concentration (Lee et al., 2015) and is usually
modeled as a function of Chl-a (Westberry et al., 2008). It is inferred
from this that the prediction time of Chl-a concentration in the tropical
Pacific could also be extended to interannual scales.
The current study has improved the CNN-LSTM model. Compared
with the traditional LSTM time-series prediction model, this model adds
the process of extracting spatial data features that are highly correlated
with the target prediction area, supplements the dataset, and effectively
ameliorates the poor prediction performance of traditional LSTMs for
small data volumes. The model was tested and compared with LSTM and
SARIMA models. Then, the three-dimensional spatio-temporal charac­
teristics of the Chl-a sequence were extracted and analyzed, with a longterm rolling prediction of Chl-a concentrations over a five-year time
span. Long-term Chl-a predictions help to understand Chl-a fluctuations
and provides services for the establishment of a red tide early warning
research system.
and primary production, may be more predictable than their physical
counterparts. Therefore, the concentration of chlorophyll-a (Chl-a), a
factor that is commonly used for oceans, represents the concentration of
algae (Yang et al., 2019) as a key indicator for monitoring the degree of
seawater nutrition (Zou et al., 2020). The over-proliferation of marine
algae triggers red tides, and predicting the temporal and spatial changes
in Chl-a concentrations can provide timely insights into the algae and
marine ecological conditions to provide an early warning of red tide
disasters (Xu et al., 2019). The original Chl-a prediction model is a
simple first-order equation proposed by Vollenweider (1975). This
model does not consider other factors affecting Chl-a. There are many
factors that affect the concentration of Chl-a, such as sea surface tem­
perature, wind speed, light transmittance of the sea water, and whether
it is near the shore (Chen et al., 2011; Carneiro et al., 2014). Subse­
quently, scholars established a theoretical analysis model to predict
Chl-a concentration based on the nature of the water body (Jørgensen
et al., 1978; Wu et al., 2018). This type of model can consider the in­
teractions between various elements in the water body, but the diversity
of water quality variables in the ocean may make correct modeling or
parameterization difficult (Dutkiewicz et al., 2020). The autoregressive
integrated moving average model (ARIMA) is the most common type of
predictive model for time-series data. The ARIMA model has a simple
structure but requires that the time-series data must be stable. The
seasonal difference autoregressive moving average model (SARIMA) is
an advanced ARIMA model that can support the seasonal variables of the
time series. In the past, Chl-a determination primarily relied on-site
sampling, which has the disadvantages of high cost and slow speed
(Wu and Liu, 2012). With the development of satellite remote sensing
technology, remote sensing monitoring of Chl-a (Gitelson et al., 2007),
and remote sensing inversion (O’Reilly et al., 1998; Dall’Olmo et al.,
2003), research has become increasingly mature. With the diversifica­
tion of the means of obtaining information, a large amount of variable
water quality data can be obtained. Therefore, machine learning models
are widely used in water quality variable predictions (Ark et al., 2015;
Xiao et al., 2017), such as artificial neural networks (Lu et al., 2008;
Vilas et al., 2011; Alizadeh and Kavianpour, 2015; Sinshaw et al., 2019;
Shi et al., 2020), the random forest method (Li et al., 2017a, 2018;
Yajima and Derot, 2018), and support vector machines (Noori et al.,
2015; Park et al., 2015; Xu et al., 2015; Kisi and Parmar, 2016; Fijani
et al., 2019). In water quality predictions, machine learning focuses
more on prediction accuracy than on model structures (Elith and
Leathwick, 2009). Deep learning (DL) is a special type of machine
learning (Guo et al., 2020) that has been widely used in the field of data
prediction in recent years, and its predictions using large datasets are
better than with machine learning. DL prediction models have been
widely used in the field of marine environment in the past two years,
such as ship navigation (Liu et al., 2020), waves (Bao et al., 2020), sea
height anomalies (SSHA) (Song et al., 2020), and sea surface tempera­
tures (Xie et al., 2020), as well as a few studies on predicting ocean Chl-a
concentrations (Rostam et al., 2021). Most current studies have used
independent DL models for short-term Chl-a predictions. Researchers
have mostly used the LSTM model to predict Chl-a concentrations (Cho
et al., 2018; Zheng et al., 2021), with some using the convolutional
neural network (CNN) model (Choi et al., 2019). Yussof et al. (2021)
implemented the LSTM and CNN methods to predict HABs events on the
west coast of Sabah eight days in advance and determined that the
predictions of the LSTM model are better than those of the CNN model.
Because the LSTM method can learn the long-term dependence, which
the CNN method cannot, the LSTM method can memorize the
time-series information. However, the correlation coefficient of the
LSTM model is still very low. Barzegar et al. (2020) used the hybrid
CNN-LSTM model for the first time to perform short-term Chl-a pre­
dictions on Small Prespa Lake in Greece and verified that the hybrid
CNN-LSTM model is better than the DL independent model in predicting
Chl-a levels.
Chl-a seasonal and multi-year predictions are necessary to maintain
2. Materials and methods
2.1. Data set
The study area was located in the SCS. The SCS is a vital and
controversial part of global ocean governance. While its abundant re­
sources provide opportunities for the surrounding countries, it also
brings challenges to the environmental protection of the SCS. Sur­
rounding countries have a common responsibility for the environment in
the SCS. However, the controversy surrounding the SCS complicates its
environmental protection, and there is no complete governance system.
The balance of the SCS and the management of eutrophication of the SCS
require long-term environmental cooperation between China and the
2
L. Na et al.
Water Research 211 (2022) 118040
neighboring countries. Water pollution in the SCS predominantly in­
cludes land transportation pollution, heavy metal pollution of coral
reefs, and oil and gas development pollution. The selected dataset covers
the global oceans and SCS from 11.1◦ N–12.1◦ N, 115.8◦ E–117.5◦ E, and
covers part of the Reed Tablemount in the SCS (Fig. 1). The Reed
Tablemount is rich in oil and gas resources. Drilling mud discharge,
domestic sewage discharge, and pipeline-laying sediments in oil and gas
exploration and development activities may cause pollution to the sea.
Therefore, the environmental conditions of the sea near the Reed
Tablemount should be considered.
Using a single variable input has a higher accuracy than using mul­
tiple variable inputs (Xiao et al., 2017; Yussof et al., 2021). In this study,
the Chl-a concentration was selected as the only variable input. The SCS
is far from the mainland, and it is difficult to cover a wide range of sea
areas with actual measurements and station observations. Therefore,
remote sensing was used to obtain the long-term and large-scale
observation data. The data used in the experiment were obtained from
the official website of NASA Ocean Color (https://oceancolor.gsfc.nasa.
gov/). The downloaded data were the vector data of the satellite time
series. Because of the large number of missing values in the daily Chl-a
data, we selected the second-level Chl-a semi-monthly synthetic product
data of the modified sensor from January 2002 to June 2020. The data
comes from Terra and Aqua satellites, and the accuracy of the remote
sensing data was 1000 m. NASA’s research on the inversion of Chl-a
concentration has reached a relatively mature stage. O’Reilly et al.
(1998) obtained a model ocean chlorophyll x (OCx) for global Chl-a
Fig. 1. The location of the study area, the area covered by the grids is the research area.
3
L. Na et al.
Water Research 211 (2022) 118040
concentration inversion that is suitable for various types of remote
sensing data through experimental summaries. The global Chl-a inver­
sion algorithm is not as accurate as the regional inversion algorithm, but
because the study area is Case 1 water, the inversion is relatively reliable
(O’Reilly et al., 1998).
The downloaded Chl-a remote sensing data had a resolution of 1 km
× 1 km. First, we re-analyzed the data, divided it into grids, and
expanded it 20 times to produce a grid product with a spatial resolution
of 20 km × 20 km (Fig. 1). The value of a single grid was replaced by the
average of 400 original data points. The SCS is a sea area with sub­
stantial cloud coverage, especially in summer when the cloud coverage
can reach 80%, which could cause missing and abnormal sensor data.
Therefore, to observe the continuity of the data, it was necessary to
remove the outliers outside the interquartile range when studying the
temporal and spatial changes in the Chl-a concentration, and spline
interpolation was used for the missing values. Before entering the model,
the original data were normalized with min–max normalization to
prevent gradient changes and improve the convergence speed of the
model. The normalization and min–max scaling techniques scale be­
tween 0 and 1, executed in the scikit-learn preprocessing library.
For each grid, the dataset was divided into a training set, validation
set, and prediction set according to the time periods. In the time
dimension, the semi-monthly Chl-a data from January 2002 to June
2020 included 444 data, with 210 data from January 2002 to September
2010 as the training set, 114 data from September 2010 to June 2015 as
the validation set, and 120 data from June 2015 to June 2020 as the
prediction set.
information is stored and remember the long-term information of the
time series for future predictions. When the data flow in the network, it
can be stored, removed, and added according to whether the informa­
tion is needed, effectively coping with the problem of vanishing gradi­
ents (Shi et al., 2015). Therefore, LSTM can predict longer sequences
and widely spaced sequences (Gers et al., 2002). LSTM has advantages in
time series modeling, has strong learning and generalization abilities,
and has a good predictive effect on nonstationary data.
However, LSTM neural networks are complex, and there are many
processing parameters during the training process. In addition, LSTM
training is slow, computationally intensive, time consuming, and re­
quires advanced hardware systems. For RNN gradient disappearance
and gradient explosion problems, there is a certain degree of improve­
ment in LSTM, but this has not been completely solved. LSTM can only
extract relatively long time-series information (Weninger et al., 2015),
and the predictions for longer time series and long-term trends worsen.
2.2.3. CNN-LSTM
CNN neurons can respond to the surrounding units when processing
data and are generally used to process images. CNNs include an input
layer, convolutional layer, activation layer, pooling layer, and fully
connected layer. The CNN-LSTM model in this study combines the CNN
and LSTM. It has the advantage of CNN to extract multi-dimensional
image features and can perceive spatial information features. Howev­
er, it retains the advantage of LSTM in processing the time series. In the
CNN-LSTM model, the CNN extracts the deep features of the index. The
LSTM model is then applied to perform predictions using these deep
features (Huang et al., 2018). CNN-LSTM contains six modules: an input
layer, convolution layer, pooling layer, dropout layer, LSTM layer, and
output layer. Compared with the LSTM time series model, the con­
volutional layer, pooling layer, and dropout layer of the CNN are added.
The CNN-LSTM automatically generates a feature extractor during the
training process, and the convolution kernel performs a
multi-dimensional feature extraction on the input data for local
perception and integrates all the information through shared weights.
Nonlinear mapping is performed on the convolution output through the
activation function. Commonly used activation functions are the sig­
moid function, Tanh function, and ReLU. The convolutional layer can
transfer the shape information to the next layer in the form of the same
dimension. The data circulating in the convolutional layer are shape
data, that is, multidimensional data. The pooling layer can reduce the
dimensionality into one dimension, lose spatial information, and then
pass it to the LSTM layer. The pooling process includes max pooling and
average
pooling.
The
pooling
layer
primarily
performs
dimensionality-reduction operations on the features, reduces the num­
ber of data and parameters, and reduces overfitting.
2.2. Chl-a prediction models
2.2.1. SARIMA
Box et al. (1976) proposed the traditional ARIMA prediction model,
which is more suitable for datasets with stable sequence trends, that is,
stationary data sets. Therefore, in the prediction process, we first elim­
inated the trend and seasonal influence and converted the data into
stable data. SARIMA (p,d,q)(P,D,Q)s have seven parameters, adding
seasonal parameters to ARIMA (p,d,q).
The structure of the SARIMA model is as follows (Ma et al., 2021):
⎧
( S( d D )
( S) )
⎪
⎪
⎨ Φ(L)Ap L ∇ ∇S xi = Δq (L)Bq L εt
2
(1)
E(εt ) = 0, Var(εt ) = σ S , E(εt |εS ) = 0, S ∕
=t
⎪
⎪
⎩ E(xS εt ) = 0, S < t
In the formula, L is the delay operator, Ap (LS ) is the p-order autore­
gressive operator, Bq (LS ) is the q-order seasonal moving average oper­
ator, ∇d = (1 − L)d is the difference operation, and ∇DS = (1 − LS )d is
seasonal difference operation.
First, we determined whether the data were stationary data using the
stationarity test. Then, we determined whether the model satisfied the
residual white noise test. Finally, the autocorrelation graph (acf) and
partial autocorrelation graph (pacf) were used to determine the model
parameters.
2.3. Implementation of dl prediction models
2.3.1. Simulation environment
The experiment was carried out on a PC with the following features:
Hardware: Processor Intel(R) Core(TM) i7–7500 U CPU, 8 GB
memory, dual graphics card AMD Radeon(TM) 530, and Intel(R) HD
Graphics 620.
Software: Windows64-bit operating system, PyCharm-Professional2020.2.3 compiler, Anaconda3–2.4.1-Windows-x86_64 environment
configuration, Python3.5.6, Keras2.2.2, and TensorFlow1.10.0.
2.2.2. LSTM
The LSTM network was originally a recurrent neural network (RNN)
proposed by Hochreiter and Schmidhuber (Hochreiter and Schmid­
huber, 1997). The neurons of the traditional RNN contain self-feedback
connections, and the output is jointly determined by the input and the
previous output, making it capable of remembering. However, as the
time interval increases, the information is multiplied by the decimal
multiple times during the neuron flow process and is lost, resulting in the
disappearance of the gradient (Zeng and Zhang, 2013). The influence of
the current output on the subsequent output weakens until it disappears.
Therefore, useful information cannot be continuously remembered. The
LSTM can continuously circulate information to ensure that the
2.3.2. LSTM
When using the LSTM model to predict the Chl-a concentration, the
hidden layer length is 50, the time step is 100, the learning rate is 1e-7,
and the batch_size is set to 16. LSTM implements Adam optimization and
80 rounds of sample training. The model predicts one time step at a time,
and when performing multi-step prediction, the previous prediction
output is added and iterated to the next sequence for the next prediction
process.
4
L. Na et al.
Water Research 211 (2022) 118040
2.3.3. CNN-LSTM
2.3.3.3. CNN-LSTM prediction implementation. The sea surface is
divided into grids in space, and the Chl-a data are condensed into one
observation value at each grid point. The Chl-a value of a single grid
point is correlated with the adjacent grid points, and the related grid
data constitute the spatial data. The Chl-a value at each grid point is a
time series, and the Chl-a value has a correlation (dependency) with the
historical data, which constitutes the time dimension data. The time
dimension data and spatial data constitute the 3D time-series grid data.
In the CNN-LSTM prediction process, we selected the historical 3D timeseries grid data of the Chl-a concentration to predict the future 3D timeseries grid data and scroll forward through the window to make the next
prediction. Taking the D42 grid as an example, the D42 grid needs to be
tested for mutual information before the prediction. It was found that
the D43 and E42 grids adjacent to the D42 grid had a strong correlation
with the D42 grid (greater than 0.7). Therefore, this study input the
three-dimensional Chl-a time-series data of the three grid areas of D42,
D43, and E42. In the single-step prediction process, 30 time steps of the
sample are used to predict the next Chl-a data of the target grid. Sliding
forward one step at a time predicts the next data point in the target grid
area. In the rolling prediction process, the predicted data are added to
the sample, and the following 72 time steps (three years) of the Chl-a
concentration data are used to predict the subsequent Chl-a concentra­
tion data. As the number of iterations increases, the prediction accuracy
gradually decreases (Fig. 3).
2.3.3.1. Mutual information. When using the CNN-LSTM model to pre­
dict the Chl-a concentration, a grid mutual information analysis is per­
formed first. Mutual information is a measure of the statistical
correlation between two random variables that can measure the corre­
lation between two events. For two discrete variables X and Y, the
calculation is defined as follows:
(
)
∑∑
p(x, y)
I(X; Y) =
p(x, y)log
dxdy
(2)
p(x)p(y)
y∈Y x∈X
where I(X; Y)is the mutual information entropy; p(x, y) is the joint
probability distribution function of the discrete variables X and Y, and
p(x)p(y) are the marginal probability distribution functions of the
discrete variables X and Y, respectively.
If I(X; Y)=0, the variables X and Y are independent of each other. If
I(X; Y)=1, the variables X and Y are completely correlated.
2.3.3.2. CNN-LSTM training implementation. By adding a convolutional
layer, we extract the overall features of the surrounding grid area data
that are highly related to the current grid area. The model uses the
convolutional layer (Conv1D) of the unified kernel initializer. The size of
the added convolution kernel is 64, and the sigmoid activation function
is used. There are no learning parameters in the pooling process. Before
the data enters the LSTM layer, if the number of neurons is too large, it
may lead to over-learning and overfitting. Therefore, this study adds a
dropout layer and randomly removes some neurons to reduce over­
fitting, with a dropout rate of 0.001. The LSTM layer selects the tanh
activation function and 64 hidden neural units. In the LSTM layer, the
kernel layer sets the glorot_uniform initialization value, the weight
initialization uses the Zeros method, and the activation function of the
loop step is hard_sigmoid. The learning rate of the CNN-LSTM model is
set to 0.001, the batchsize is set to 8, and the training epoch is set to 150.
During the runtime, the model uses the loss function of the MSE and
performs Adam optimization (Fig. 2).
2.4. Inspection index
2.4.1. Prediction accuracy test
The root mean square error (MSE) and mean absolute error (MAE)
were calculated to test the Chl-a prediction level. The smaller the MSE
and MAE, the closer the predicted value is to the true value. A value of
0 indicates the highest accuracy. The Pearson correlation coefficient (r)
was calculated to test the rolling prediction level of the long-term Chl-a.
The r metric represents the level of linear regression between the
observed and predicted values. The larger the Pearson correlation co­
efficient, the higher the correlation between the predicted and original
Fig. 2. Program running structure diagram.
5
L. Na et al.
Water Research 211 (2022) 118040
Fig. 3. The prediction process for Chl-a in D42 grid area (a) single-step prediction process and (b) rolling prediction process.
values. When r is equal to 1, the strongest correlation is observed.
n ∑
i− 1
∑
N
1 ∑
MSE =
(yi − fi )2
N i=1
(3)
N
1 ∑
MAE =
|(yi − fi )|
N i=1
(4)
∑N
− y)(fi − f )
1
2 ∑N
2 ]2
i=1 (yi − y)
i=1 (fi − f )
r = [∑
N
i=1 (yi
S=
i=2
)
(
sign xi − xj
(6)
j=1
In the equation, the value of sign(x) is as follows:
⎧
⎨1 x > 0
sign(x) = 0 x = 0
⎩
− 1x<0
(7)
S is normal distribution, E(S) = 0, Var(S) = n(n − 1)(2n + 5)/18.
Define statistics:
/√̅̅̅̅̅̅̅̅̅̅̅̅̅̅
⎧
Var(S) S > 0
⎨ (S − 1)
Z=
(8)
0S=/
0 √̅̅̅̅̅̅̅̅̅̅̅̅̅̅
⎩
(S + 1)
Var(S) S < 0
(5)
where N is the size of the prediction sample, yi is the predicted value, fi is
the true value, y is the average value of the predicted sequence, and f is
the average value of the actual sequence.
In the equations, E(Sk ) is the mean of Sk , and Var(Sk ) is the variance
of Sk .
If Z > 0, the X series shows an upward trend, and if Z < 0, the X series
shows a downward trend. The significance level α is given, and Z1− α/2
can be obtained from the normal distribution table. If |Z| > |Z1− α/2 |,
2.4.2. Trend test
2.4.2.1. Mann-Kendall test. The Mann-Kendall method (M-K method) is
a non-parametric statistical test method.
First, for the time series X{x1, x2, ⋯xn}, we define the test statistic S:
6
L. Na et al.
Water Research 211 (2022) 118040
then the sequence exhibits a significant trend change at the significance
level α.
(
)
(9)
P = 2 × 1 − ∂|Z|
(D42-D43-E42-E41) combination to predict the Chl-a concentration of
the D42 grid area, compared the prediction effects of the two combi­
nations, and selected the optimal combination. We conducted mutual
information entropy experiments on all the grids (Fig. 5). Each grid area
has a strong correlation with its surrounding area. When predicting the
target grid, we selected different combinations of the target grid and its
surrounding grid areas for the comparison experiments and selected the
combination method with the best prediction.
where P is the probability of the statistical trend, and ∂|Z| can be obtained
from the normal distribution table. When P < 1− α, the trend change was
not significant; when P > 1− α, the trend changed significantly.
2.4.2.2. Least squares method. The least-squares method fits a straight
line to determine the change trend of the sequence.
y = kx + b
3.1.2. Parameter analysis
The timestep is the key parameter of the model. When predicting a
sequence, we usually need to determine the optimal timestep, that is,
how many historical values are used to predict future values. MAE (mg/
m3) and MSE (mg/m3) represent the prediction errors. For the prediction
of the Chl-a concentration in the D42 area of the target grid, we chose
two combinations (D42-D43-E42) and (D42-D43-E42-E41), set the
different timestep values, respectively, compared the sizes of MAE and
MSE, and selected the best combination method and best timestep value
(Table 1).
When predicting the Chl-a concentration in the D42 grid area, the
(D42-D43-E42) combination method was selected to have the smallest
prediction errors and best model performance (Table 1). The overall
error of the (D42-D43-E42-E41) combination method is higher than that
of the (D42-D43-E42) combination, and the prediction was poor. The
reason for this result may be that the correlation between the E41 and
D42 grids is weak, and the mutual information entropy is less than 0.6.
Both the MAE and MSE values, as well as the error, were the smallest
when the timestep value was set to 30. When the timestep value was set
to 70, the MAE and MSE errors were relatively small but slightly larger
than those with the timestep of 30, and the prediction was better. The
experiments demonstrated that within the timestep range of 100, with
the increase in the timestep, the MAE and MSE values exhibited periodic
changes. When the timestep was 30 or 70, the model provided the best
prediction when compared with the historical Chl-a concentrations in
the SCS. This may be because the Chl-a concentration presents seasonal
and periodic changes.
(10)
The least-squares method determines the trend change of the
sequence using the first-order coefficient k of the straight line. If k > 0,
the sequence exhibits an upward trend. If k = 0, then the sequence is a
stationary sequence. When k < 0, the sequence exhibits a downward
trend.
3. Results and discussion
3.1. Index analysis
3.1.1. Mutual information
Before testing the CNN-LSTM model, we first clarified the correlation
of the grid area, selected multiple grids with a strong correlation with
the target grid, extracted features, and predicted the Chl-a concentration
of the target area. The mutual information entropy was calculated to
express the correlation of the grid area. The D42 grid area is used as an
example to calculate the mutual information entropy of the D42 grid and
other grids.
The grid area of D42 has the highest correlation with D43 and E42,
and the mutual information entropy is 0.705 and 0.772, respectively,
which are both greater than 0.7 (Fig. 4). D42 has a high correlation with
the E41 grid, and the mutual information entropy is approximately 0.6.
For the prediction, we selected the (D42-D43-E42) combination and the
Fig. 4. Mutual information entropy of D42 grid.
7
L. Na et al.
Water Research 211 (2022) 118040
Fig. 5. Regional correlation: (a) Mutual information entropy between all grids and (b) Masking result of mutual information entropy less than 0.7.
Table 1
MAE and MSE values of different network combinations and different timestep tests.
Timestep
MAE (mg/m3)
MSE (mg/m3)
D42-D43-E42
D42-D43-E42-E41
D42-D43-E42
D42-D43-E42-E41
10
20
30
40
50
60
70
80
90
100
6.91e-3
1.10e-2
1.09e-4
2.78e-4
2.26e-2
9.88e-3
7.28e-4
1.97e-4
6.51e-3
9.46e-3
7.30e-5
1.71e-4
8.39e-3
1.09e-2
1.19e-4
2.15e-4
2.32e-2
9.83e-3
8.23e-4
1.69e-4
9.30e-3
8.86e-3
1.38e-4
1.43e-4
8.01e-3
1.67e-2
1.08e-4
4.06e-4
2.15e-2
1.60e-2
7.38e-4
4.07e-4
1.22e-2
1.27e-2
2.69e-4
2.93e-4
160e-2
155e-2
4.10e-4
4.17e-4
By comparing the MAE and MSE errors, we determined the best value
of the network combination method and timestep. For the prediction of
the D42 grid area sequence, when selecting the combination (D42-D43E42) and set the time step to 30 or 70, the error was small, and the model
performance was good.
The epoch represents the number of training rounds for a training
sample. The timesteps were selected as 30 and 70, respectively, and
different epochs were selected based on experience to test the best
Table 2
MAE and MSE values of different timesteps and different Epochs.
Timestep
MAE (mg/m3)
MSE (mg/m3)
30
70
30
70
10
50
100
150
200
500
800
1000
2.45e-2
2.32e-2
8.53e-4
7.65e-4
7.27e-3
9.62e-3
1.00e-4
1.74e-4
6.51e-3
8.00e-3
7.30e-5
1.08e-4
6.29e-3
6.64e-3
6.80e-5
7.10e-5
5.56e-3
7.25e-3
6.10e-5
8.70e-5
5.13e-3
7.05e-3
5.60e-5
8.50e-5
576e-3
6.24e-3
6.90e-5
6.90e-5
6.36e-3
7.36e-3
8.20e-5
9.10e-5
8
L. Na et al.
Water Research 211 (2022) 118040
performance of the model (Table 2).
When the timesteps were 30 and 70, the errors of MAE and MSE were
not significantly different. When epoch was less than 100, the error was
large, indicating that the model was not fully trained. When the epoch
was greater than 500, the values of MAE and MSE began to increase,
indicating that the model was over-fitting. Increasing the epoch will also
lead to a long training time and high cost. Through experiments, we
found that after training approximately 150 times, the model stabilized.
Therefore, we set the epoch as 150 to provide the best model (Table 2).
Through the parameter analysis, we selected the (D42-D43-E42)
combination, set the timestep to 30 and epoch to 150, and as with the
prediction model of the D42 sequence, predicted the target grid D42
sequence and calculated the predicted loss curve (Fig. 6). After 150
rounds of training, the loss of the training set and the loss of the test set
were both below 0.005.
All the grid areas were predicted, and the true and predicted values
of the Chl-a concentration in June 2015 were selected for display
(Fig. 7). The CNN-LSTM model has high accuracy in predicting the Chl-a
in the study area, the error between the predicted value and the true
value was small, and the Chl-a concentration change trend in the entire
area was consistent. This shows that our model performs well in terms of
the regional Chl-a predictions. The same model with the same parame­
ters could be applied to the Chl-a prediction of the entire region, with
strong spatial stability and universal applicability. After training on all
the grid regions, the average MAE of the study area prediction was
1.20e-2 mg/m3, and the average MSE was 2.99e-4 mg/m3.
The grid resolution was 20 km × 20 km, and the standard deviation
and coefficient of variation for calculating a single grid area were small.
The average standard deviation was 0.0158, and the average coefficient
of variation was 0.201. This demonstrates that a remote sensing reso­
lution of 1 km and a grid resolution of 20 km × 20 km can accurately
analyze the characteristics of temporal and spatial changes in Chl-a
concentrations. The Chl-a content in the study area gradually
increased from west to east and from north to south (Fig. 7). The reason
may be due to the existence of Xiongnan Jiao in the north of the study
area and the Nares Bank in the west. The Chl-a concentration near the
coral reef beach is lower than that of the open ocean, which is consistent
with the results of Yahel et al. (1998). Yahel et al. (1998) believed that
the abundance and Chl-a concentration of phytoplankton near coral
reefs are 15–65% lower than that of the adjacent open waters, and the
decline in Chl-a near coral reefs is usually related to an increase in the
degradation products.
3.2. Comparison of three models
In this study, three models were used to predict the grid area dataset:
SARIMA, LSTM, and CNN-LSTM. We used the D42 area as an example to
make the predictions, compared the prediction performance of the three
models, and calculated the 95% confidence interval of the predicted
value.
For the SARIMA model, the stationarity test was first performed on
the time series, and the P value was 6.30e-10, which is less than the
significance level of 0.05, indicating that the time series was a stationary
series, and there was no need to carry out the difference operation, that
is, d = 0. The white noise test result was p = 3.40e-28, which is less than
the significance level of 0.05. We use the autocorrelation graph (acf) and
partial autocorrelation graph (pacf) to determine p = 4 and q = 4. The
concentration of Chl-a in the SCS exhibits seasonal changes with a halfyear cycle, so s = 12 was used.
3.2.1. Single-step prediction
Three models were used to predict the D42 sequence and determine
the confidence interval (Fig. 8).
The predicted value of the CNN-LSTM model best fit the true value
curve, and the confidence interval was the narrowest, indicating that the
predicted value was the closest to the true value. The LSTM model was
second, and the SARIMA model provided the worst prediction (Fig. 8). It
can be seen from the figure that the prediction interval is included in the
true value interval, which may be that the predictive data is not sensitive
to extreme values and human influences, such as government
Fig. 6. The loss curve of Chl-a prediction in the D42 grid area.
9
L. Na et al.
Water Research 211 (2022) 118040
Fig. 7. Comparison of Kriging interpolation results between true and predicted values in the study area: (a) True Chl-a value in June 2015 and (b) Predictive Chl-a
value in June 2015.
management (Yu et al., 2020).
Affected by complex geophysical and chemical effects, Chl-a changes
in the SCS have complex and multiscale characteristics. The Reed
Tablemount is located in the Nansha area of the central basin of the SCS.
The Chl-a concentration is generally low, maintained at approximately
0.1 mg/m3. The seasonal Chl-a time series of the Reed Tablemount
waters has a bimodal structure annually, with a time scale and period of
approximately 6 months. The Chl-a concentration is the highest in
winter, approximately 0.15 mg/m3, and the lowest in summer
(approximately 0.075 mg/m3). This is primarily because the SCS
monsoon is the key influencing factor for Chl-a changes in Liletan waters
(Yu et al., 2020). The strong monsoon and complex topography cause
the SCS to have a significant seasonal circulation system. In winter, the
northeast monsoon affects the sea, and the circulation in the SCS has a
cyclonic structure. Vertical mixing is significant, transporting nutrients
from the lower layer upward and promoting the growth of phyto­
plankton on the sea surface. In summer, driven by the southwest
monsoon, the SCS circulation presents an anticyclonic structure. The
summer monsoon is not as strong as the winter monsoon, which limits
the vertical mixing of the seawater. The increase in the sea surface
temperature is also an crucial factor in the decrease in the Chl-a con­
centration during summer (Chen et al., 2020). The increase in temper­
ature limits the growth and reproduction of phytoplankton; therefore,
the Chl-a content is lower than that in winter. The Chl-a concentrations
were high in 2007 and 2010 when La Niña phenomena occurred and
decreased significantly in 2002, 2006, 2009, and 2015 with the El Niño
phenomena. This is because when an El Niño phenomenon occurs, the
water temperature rises abnormally and the monsoons weaken, which
inhibits the diffusion of nutrients and reduces the amount of phyto­
plankton. When the La Niña phenomenon occurred, the opposite was
observed.
The prediction errors for the three models were calculated (Table 3).
The prediction error of The CNN-LSTM model had the smallest predic­
tion error and provided the best prediction of Chl-a concentrations in the
SCS.
3.2.2. Multi-step prediction
Three models were used to perform 1-year, 2-year, 3-year, 4-year,
and 5-year long-term rolling predictions on the D42 grid area and
calculate the Pearson correlation coefficient between the predicted
value and the true value (Table 4). The larger the Pearson correlation
coefficient, the higher the correlation between the predicted and true
values. The experiments verified that when the time step of the CNNLSTM model was 72, the correlation coefficient between the predicted
value and the true value was the largest, and the prediction performance
was the best. Seferian et al. (2014) determined that primary production
in tropical regions could be predicted three years in advance. Through
the experiments, we concluded that the long-term prediction of Chl-a
concentrations in the SCS can be up to three years.
As the predictions progressed, the Pearson correlation coefficient
decreased exponentially, indicating that the predictions worsen with an
increase in time. The performance of the CNN-LSTM model in predicting
the long-term Chl-a concentrations was significantly better than that of
the LSTM and SARIMA models (Table 4).
3.3. Long-term rolling prediction
We used the CNN-LSTM model to predict the long-term Chl-a se­
quences of all the grid regions. Then, the correlations between the
predicted and true values were calculated (Fig. 9).
10
L. Na et al.
Water Research 211 (2022) 118040
Fig. 8. Comparison of three model predictions: (a) CNN-LSTM prediction, (b) LSTM prediction, and (c) SARIMA prediction.
Table 3
Errors predicted by the three models.
MAE (mg/m3)
MSE (mg/m3)
CNN-LSTM
LSTM
SARIMA
6.29e-3
6.80e-5
1.82e-2
5.56e-4
2.34e-2
8.84e-4
Table 4
Long-term rolling prediction performance (r) of the three models in the D42 grid
area.
LSTM
CNN-LSTM
SARIMA
11
1 year
2 year
3 year
4 year
5 year
0.639
0.706
0.235
0.334
0.636
0.181
0.272
0.499
0.245
0.241
0.322
0.215
0.176
0.0248
0.105
L. Na et al.
Water Research 211 (2022) 118040
Fig. 9. Five-year Pearson correlation coefficient when using CNN-LSTM for long-term rolling prediction.
It can be seen from Fig. 9 that the prediction of all the grid areas of
the dataset gradually decreases with an increase in time. In the next
year, the Pearson correlation coefficient exceeds 0.8, indicating that the
predictions for one year are ideal. When predicting the five-year Chl-a
concentration, the Pearson correlation coefficient is reduced to below
0.4, the predicted value and the true value are quite different, and the
prediction accuracy is low. The variation in the prediction performance
for different grid areas indicates the need to fine-tune the model pa­
rameters or identifies the influence of human activities.
To express the level of CNN-LSTM long-term more accurately pre­
dictions, we calculated the regional average of the MAE, MSE, and
Pearson correlation coefficients of the CNN-LSTM long-term predictions
(Table 5).
As the prediction time increased, the average MAE and MSE errors of
the predicted regions gradually increased, and the Pearson correlation
coefficient gradually decreased. The prediction error of the three-year
Chl-a value was relatively small, and the Pearson correlation coeffi­
cient reached 0.5 (Table 5). Therefore, the CNN-LSTM model could
extend the Chl-a prediction to three years. The experiments demon­
strated that the prediction accuracy of the CNN-LSTM model is not only
significantly higher than the LSTM and SARIMA models, but it can also
be applied to long-term rolling predictions.
The study used the M-K trend test method and the least square
method to fit a straight line to analyze the monotonic change trend of the
Chl-a time series. The Chl-a sequence was divided into subsets in units of
years, and the M-K test statistic Z was obtained. The significance level
(α) was set to 0.05, and P is the probability of the statistical trend. Using
the Mann-Kendall trend test method and the least squares method, the
five-year long-term predicted values and true values of all grids in the
study area were analyzed (Table 6).
The M-K test indicates that from 2015 to 2020, the interannual
changes in Chl-a in the study area exhibited a slight upward trend, which
was not significant (P < 1-α). The least square test also established that
Chl-a in the study area demonstrated an upward trend (Table 6). This is
consistent with the results of previous studies (Palacz et al., 2011; Chen
et al., 2014b). From the monotonic trend analysis, it was concluded that
the Chl-a concentration in the central and southern SCS exhibited an
upward trend. The P is relatively low, which may be because the data
were retrieved using remote sensing, and the lack of comparison with
the measured voyage data resulted in a variations between the obtained
data and the true values. In addition, there were many missing data, and
the sequence obtained by interpolation produced errors compared with
the true values.
The trend analysis of a single grid demonstrates that the true Chl-a
concentration of the grids other than the H42 grid exhibited an up­
ward trend. However, the grid trend change in the predicted value has
both upward and downward trends, indicating that the long-term
Table 6
Grid average of the Chl-a concentration trends in the study area.
Table 5
Average performance indicators for the grid.
MAE (mg/m3)
MSE (mg/m3)
r
1 year
2 year
3 year
4 year
5 year
1.94e-2
6.10e-4
6.74e-1
2.02e-2
6.50e-4
5.42e-1
2.18e-2
7.50e-4
4.92e-1
2.38e-2
8.90e-4
3.65e-1
2.68e-2
1.23e-3
2.42e-1
Trend test
M-K test
The least square method
12
Z
P
k
b
True value
Predicted value
9.56e-1
3.65e-1
9.41e-5
1.11e-1
2.67e-1
4.60e-1
2.55e-5
1.19e-1
L. Na et al.
Water Research 211 (2022) 118040
Chl-a concentration prediction experiment was carried out using the
CNN-LSTM model. The long-term prediction of the Chl-a seasonal data
was carried out, and the Chl-a prediction time was extended to three
years.
This research improved the CNN-LSTM model to achieve long-term
predictions on a small sample dataset. The traditional LSTM timeseries prediction model performs poorly on small sample datasets. The
improvement of the CNN-LSTM prediction model in this study can
effectively address this problem. This method is based on the idea of
combining spatio-temporal features of data to expand the dataset, and it
performs well on small sample datasets. Compared with the LSTM and
SARIMA models, the results indicate that the prediction accuracy of the
CNN-LSTM model is much higher than that of the LSTM and SARIMA
models, and the training speed is also faster than that of the two models.
A long-term prediction experiment on Chl-a concentrations was carried
out, and it was found that the Pearson correlation coefficient (r) reached
0.674, 0.542, 0.492 for one year, two years, and three years, respec­
tively. A trend analysis was conducted on the predictive data and true
values from 2015 to 2020, and it was found that the Chl-a concentration
exhibited an upward trend from 2015 to 2020. In addition, the inspec­
tion effect is better for large-scale areas, and the model provides a better
predictions. This indicates that the model may be applicable to largerscale sea areas and has the potential to predict Chl-a concentrations in
global seas.
prediction for the grid is volatile and has certain errors. This may be
because the predicted data were added to the sequence to predict the
next value during the prediction, and the accuracy of the prediction will
decrease annually. In addition, factors such as human activity trajec­
tories and submerged reefs also affect the prediction accuracy of a single
grid area. The Reed Tablemount area is rich in oil and gas resources, and
some oil and gas development activities in the surrounding countries
may affect the predicted level of Chl-a. Areas with more human devel­
opment activities may affect the change in the Chl-a concentration,
leading to the inapplicability and affecting the predictive performance
of the model. Furthermore, there numerous submerged reefs in the Reed
Tablemount, which may affect the distribution of Chl-a in different grid
areas. The long-term prediction of a single grid has errors, but for the
entire region, the long-term prediction of regional Chl-a concentrations
is consistent with the actual situation. Therefore, this study analyzed
whether the model performs better in large-scale predictions than for a
single grid. This results indicate that the model has the potential to be
applied to a wider area, for example, other waters of the SCS, and even
global waters.
From the perspective of the overall study area, both the predicted
and true values show an upward trend on a long-term scale. This in­
dicates that the long-term predictions of Chl-a concentrations are ideal
for the entire study area. The overall Chl-a concentration on the surface
of the Lile Beach exhibited an upward trend. This may be due to the
influence of wind speed (Yu et al., 2020). The annual increase in wind
speed in the SCS promotes the vertical mixing of seawater, which is
beneficial to the growth and reproduction of phytoplankton, thereby
promoting an increase in the Chl-a concentration (Jiang et al., 2019).
Furthermore, a gradual increase in human activities may cause SCS
pollution. The number of red tides in the SCS in recent decades has also
exhibited an upward trend, which is consistent with the upward trend of
the Chl-a concentration. This indicates that the increase in the Chl-a
concentration may be related to the occurrence of HABs events. There­
fore, the long-term prediction of Chl-a has the potential to provide
long-term monitoring and early warning of HABs in the SCS. Due to the
complexity around the SCS, HABs governance in the SCS also has the
characteristics of fragmentation and lag. It is necessary to increase in­
vestments in marine monitoring and predictions, establish long-term
Chl-a marine monitoring and predicting networks, and effectively pro­
vide early warnings of HABs. Chl-a and many other water quality fac­
tors, such as sea temperature, organic carbon, yellow substances, and
heavy metals, constitute vital indicators for marine water quality
monitoring and predictions. Currently, based on the research on the
temporal and spatial changes in Chl-a concentrations, we determined
through some extended experiments that the improved CNN-LSTM
model can also be applied to other seasonal seawater elements. The
next step is to carry out relevant predicting research on other seawater
elements, establish a multi-element coupling model to form a global
long-term prediction system for marine biochemistry, and participate in
global governance.
Declaration of Competing interest
The authors declare that they have no known competing financial
interests or personal relationships that could have appeared to influence
the work reported in this paper.
Acknowledgments
The work is supported by Tianjin Philosophy and Social Science
Planning Project of China (No. TJKS20XSX-015), the National Social
Science Foundation of China (No. 20VHQ002) and the National Natural
Science Foundation of China (No. 41701480).
Supplementary materials
Supplementary material associated with this article can be found, in
the online version, at doi:10.1016/j.watres.2022.118040.
References
Alizadeh, M.J., Kavianpour, M.R., 2015. Development of wavelet-ANN models to predict
water quality parameters in Hilo Bay, Pacific Ocean. Mar. Pollut. Bull. 98, 171–178.
https://doi.org/10.1016/j.marpolbul.2015.06.052.
Ark, Y., Cho, K.H., Park, J., Cha, S.M., Kim, J.H., 2015. Development of early-warning
protocol for predicting chlorophyll-a concentration using machine learning models
in freshwater and estuarine reservoirs. Korea. Sci. Total Environ. 502, 31–41.
https://doi.org/10.1016/j.scitotenv.2014.09.005.
Bao, S.D., Meng, J.M., Sun, L.N., Liu, Y.X., 2020. Detection of ocean internal waves based
on Faster R-CNN in SAR images. J. Oceanol. Limnol. 38, 55–63. https://doi.org/
10.1007/s00343-019-9028-6.
Barzegar, R., Aalami, M.T., Adamowski, J., 2020. Short-term water quality variable
prediction using a hybrid CNN-LSTM deep learning model. Stoch. Env. Res. Risk A.
34, 415–433. https://doi.org/10.1007/s00477-020-01776-2.
Barzegar, R., Aalami, M.T., Adamowski, J., 2021. Coupling a hybrid CNN-LSTM deep
learning model with a boundary corrected maximal overlap discrete wavelet
transform for multiscale lake water level forecasting. J. Hydrol. 598, 126196 https://
doi.org/10.1016/j.jhydrol.2021.126196.
Box, G.E.P., Jenkins, G.M., Reinsel, G.C., 1976. Time series analysis forecasting and
control - Rev. J. Time Ser. Anal. 31, 238–242. https://doi.org/10.2307/3150485.
Carneiro, F.M., Nabout, J.C., Vieira, L.C.G., Roland, F., Bini, L.M., 2014. Determinants of
chlorophyll a concentration in tropical reservoirs. Hydrobiologia 740, 89–99.
https://doi.org/10.1007/s10750-014-1940-3.
Chassot, E., Bonhommeau, S., Dulvy, N.K., Mélin, F., Watson, R., Gascuel, D., Le Pape, O.,
2010. Global marine primary production constrains fisheries catches. Ecol. Lett. 13,
495–505. https://doi.org/10.1111/j.1461-0248.2010.01443.x.
4. Conclusion
Through spatio-temporal trend analyses, this study found that the
Chl-a concentrations on the surface of the Reed Tablemount have been
increasing. Furthermore, the number of HABs events in the SCS in recent
decades has also increased on an interannual scale, consistent with the
rising trend of Chl-a concentrations. This indicates that the increase in
the Chl-a concentration may be related to the occurrence of HABs
events. Therefore, the long-term prediction of Chl-a seasonality has the
potential to provide long-term monitoring and early warning of HABs
and support for fishery management, protection of marine endangered
species and marine ecosystem health, and global ocean governance.
However, the long-term predictions of Chl-a are still in its infancy.
Currently, to the best of the authors’ knowledge, the longest prediction
time for tropical oceans in related studies is 12 months. In this study, a
13
L. Na et al.
Water Research 211 (2022) 118040
Li, X., Sha, J., Wang, Z.L., 2017c. Chlorophyll-A prediction of lakes with different water
quality patterns in China based on hybrid neural networks. Water (Basel) 9, 524.
https://doi.org/10.3390/w9070524.
Li, X., Sha, J., Wang, Z.L., 2018. Application of feature selection and regression models
for chlorophyll-a prediction in a shallow lake. Environ. Sci. Pollut. R. 25,
19488–19498. https://doi.org/10.1007/s11356-018-2147-3.
Liu, Y.C., Duan, W.Y., Huang, L.M., Duan, S.L., Ma, X.W., 2020. The input vector space
optimization for LSTM deep learning model in real-time prediction of ship motions.
Ocean Eng. 213, 107681 https://doi.org/10.1016/j.oceaneng.2020.107681.
Lu, Z.J., Zhu, L., Pei, H.P., Wang, Y., 2008. The model of chlorophyll a concentration
forecast in the West Lake based on wavelet analysis and BP neural networks. Acta
Chim. Sinica 28, 4965–4973. CNKI:SUN:STXB.0.2008-10-044.
Ma, S.Q., Liu, Q.Y., Zhang, Y.D., 2021. A prediction method of fire frequency: based on
the optimization of SARIMA model. PLoS ONE 16, e0255857. https://doi.org/
10.1371/journal.pone.0255857.
McOwen, C.J., Cheung, W.W., Rykaczewski, L.R.R., Watson, R.A., Wood, L.J., 2015. Is
fisheries production within large marine ecosystems determined by bottom-up or
top-down forcing? Fish Fish 16, 623–632. https://doi.org/10.1111/faf.12082.
Noori, R., Yeh, H.D., Abbasi, M., Kachoosangi, F.T., Moazami, S., 2015. Uncertainty
analysis of support vector machine for online prediction of five-day biochemical
oxygen demand. J. Hydrol. 527, 833–843. https://doi.org/10.1016/j.
jhydrol.2015.05.046.
O’Reilly, J.E., Maritorena, S., Mitchell, B.G., Siegel, D.A., Carder, K.L., Garver, S.A.,
Kahru, M., McClain, C., 1998. Ocean color chlorophyll algorithms for SeaWiFS.
J. Geophys. Res-Oceans 103, 24937–24953. https://doi.org/10.1029/98JC02160.
Palacz, A.P., Xue, H.J., Armbrecht, C., Zhang, C.Y., Chai, F., 2011. Seasonal and interannual changes in the surface chlorophyll of the South China Sea. J. Geophys. ResOceans 116, 015. https://doi.org/10.1029/2011JC007064.
Park, J.Y., Stock, C.A., Dunne, J.P., Yang, X.S., Rosati, A., 2019. Seasonal to multiannual
marine ecosystem prediction with a global Earth system model. Science 365. https://
doi.org/10.1126/science.aav6634, 284-+.
Park, Y., Cho, K.H., Park, J., Cha, S.M., Kim, J.H., 2015. Development of early-warning
protocol for predicting chlorophyll-a concentration using machine learning models
in freshwater and estuarine reservoirs. Korea. Sci. Total Environ. 502, 31–41.
https://doi.org/10.1016/j.scitotenv.09.005.
Rostam, N.A.P., Malim, N.H.A.H., Abdullah, R., Ahmad, A.L., Ooi, B.S., Chan, D.J.C.,
2021. A complete proposed framework for coastal water quality monitoring system
with algae predictive model. IEEE Access 9, 108249–108265. https://doi.org/
10.1109/ACCESS.2021.3102044.
Rousseaux, C.S., Gregg, W.W., 2017. Forecasting ocean chlorophyll in the equatorial
pacific. front. Mar. Sci. 4, 236. https://doi.org/10.3389/fmars.2017.00236.
Rousseaux, C.S., Gregg, W.W., Ott, L., 2021. Assessing the skills of a seasonal forecast of
chlorophyll in the global pelagic oceans. Remote Sens-Basel 13, 1051. https://doi.
org/10.3390/rs13061051.
Seferian, R., Bopp, L., Gehlen, M., Swingedouw, D., Mignot, J., Guilyardi, E.,
Servonnat, J., 2014. Multiyear predictability of tropical marine productivity. P. Natl.
Acad. Sci. U.S.A. 111, 11646–11651. https://doi.org/10.1073/pnas.1315855111.
Shi, B.G., Bai, X., Yao, C., 2015. An end-to-end trainable neural network for image-based
sequence recognition and its application to scene text recognition. IEEE T. Pattern
Anal. 39, 2298–2304. https://doi.org/10.1109/TPAMI.2016.2646371.
Shi, S.X., Wang, L., Yu, X., Xu, L.Y., 2020. Application of long term and short term
memory neural network in prediction of chlorophyll a concentration. Acta Oceanol.
Sin. 42, 134–142. https://doi.org/10.3969/j.issn.0253-4193.2020.02.014.
Sinshaw, T.A., Surbeck, C.Q., Yasarer, H., Najjar, Y., 2019. Artificial neural network for
prediction of total nitrogen and phosphorus in US lakes. J. Environ. Eng. 145,
04019032 https://doi.org/10.1061/(ASCE)EE.1943-7870.0001528.
Song, T., Jiang, J., Li, W., Xu, D., 2020. A Deep Learning Method With Merged LSTM
Neural Networks for SSHA Prediction. IEEE J-Stars 13, 2853–2860. https://doi.org/
10.1109/JSTARS.2020.2998461.
Stock, C.A., Dunne, J.P., John, J.G., 2014. Global-scale carbon and energy flows through
the marine planktonic food web: an analysis with a coupled physical-biological
model. Prog. Oceanogr. 120, 1–28. https://doi.org/10.1016/j.pocean.2013.07.001.
Stock, C.A., John, J.G., Rykaczewski, R.R., Asch, R.G., Cheung, W.W.L., Dunne, J.P.,
Friedland, K.D., Lam, V.W.Y., Sarmiento, J.L., Watson, R.A., 2017. Reconciling
fisheries catch and ocean productivity. P. Natl. Acad. Sci. U.S.A. 114, E1441–E1449.
https://doi.org/10.1073/pnas.1610238114.
Vilas, L.G., Spyrakos, E., Palenzuela, J.M.T., 2011. Neural network estimation of
chlorophyll a from MERIS full resolution data for the coastal waters of Galician rias
(NW Spain). Remote Sens. Environ. 115, 524–535. https://doi.org/10.1016/j.
rse.2010.09.021.
Vollenweider, R.A., 1975. Input-output models with special reference to the phosphorus
loading concept in limnology. Schweizerische Zeitschrift Hydrol. 37, 53–84. https://
doi.org/10.1007/bf02505178.
Wang, S.F., Tang, D.L., He, F.L., Fukuyo, Y.S., Azanza, R.V., 2008. Occurrences of
harmful algal blooms (HABs) associated with ocean environments in the South China
Sea. Hydrobiologia 596, 79–93. https://doi.org/10.1007/s10750-007-9059-4.
Wang, Y., Xu, C., Zhang, S., Yang, L., Wang, Z., Zhu, Y., Yuan, J., 2019. Development and
evaluation of a deep learning approach for modeling seasonality and trends in handfoot-mouth disease incidence in mainland. China. Sci. Rep-Uk 9, 1–15. https://doi.
org/10.1038/s41598-019-44469-9.
Weninger, F., Bergmann, J., Schuller, B.W., 2015. Introducing CURRENNT: the munich
open-source CUDA recurrent neural network toolkit. J. Mach. Learn. Res. 16,
547–551. https://doi.org/10.5555/2789272.2789289.
Westberry, T., Behrenfeld, M.J., Siegel, D.A., Boss, E., 2008. Carbon-based primary
productivity modeling with vertically resolved photoacclimation. Global
Biogeochem. Cy. 22, GB2024. https://doi.org/10.1029/2007GB003078.
Chavez, F.P., Ryan, J., Lluch-Cota, S.E., Niquen, C.M., 2003. From anchovies to sardines
and back: multidecadal change in the Pacific Ocean. Science 299, 217–221. https://
doi.org/10.1126/science.1075880.
Chen, B.Z., Liu, H.B., Xiao, W.P., Wang, L., Huang, B.Q., 2020. A machine-learning
approach to modeling picophytoplankton abundances in the South China Sea. Prog.
Oceanogr. 189, 102456 https://doi.org/10.1016/j.pocean.2020.102456.
Chen, M.J., Li, J., Dai, X., Sun, Y., Chen, F.Z., 2011. Effect of phosphorus and
temperature on chlorophyll a contents and cell sizes of Scenedesmus obliquus and
Microcystis aeruginosa. Limnology 12, 187–192. https://doi.org/10.1007/s10201010-0336-y.
Chen, Q., Rui, H., Li, W., Zhang, Y., 2014a. Analysis of algal Bloom risk with
uncertainties in lakes by integrating self-organizing map and fuzzy information
theory. Sci. Total Environ. 482, 318–324. https://doi.org/10.1016/j.
scitotenv.2014.02.096.
Chen, X.Y., Pan, D., Bai, Y., He, X.Q., Wang, T.Y., 2014b. Are the trends in the surface
chlorophyll opposite between the South China Sea and the Bay of Bengal? Remote
Sens-Basel. 9240, 924019 https://doi.org/10.1117/12.2067584.
Cho, H., Choi, U.J., Park, H., 2018. Deep learning application to time series prediction of
daily chlorophyll-a concentration. WIT Trans. Ecol. Environ. 215, 157–163. https://
doi.org/10.2495/EID180141.
Choi, J., Kim, J., Won, J., Min, O., 2019. Modelling chlorophyll-a concentration using
deep neural networks considering extreme data imbalance and skewness. In: 2019
21st International Conference on Advanced Communication Technology (ICACT).
IEEE, pp. 631–634. https://doi.org/10.23919/ICACT.2019.8702027.
Dall’Olmo, G., Gitelson, A.A., Rundquist, D.C., 2003. Towards a unified approach for
remote estimation of chlorophyll-a in both terrestrial vegetation and turbid
productive waters. Geophys. Res. Lett. 30, 1938–1941. https://doi.org/10.1029/
2003GL018065.
Dutkiewicz, S., Cermeno, P., Jahn, O., Follows, M.J., Hickman, A.E., Taniguchi, D.A.A.,
Ward, B., 2020. Dimensions of marine phytoplankton diversity. Biogeosciences 17,
609–634. https://doi.org/10.5194/bg-17-609-2020.
Elith, J., Leathwick, J.R., 2009. Species distribution models: ecological explanation and
prediction across space and time. Annu. Rev. Ecol. Evol. S. 40, 677–697. https://doi.
org/10.1146/annurev.ecolsys.110308.120159.
Fijani, E., Barzegar, R., Deo, R., Tziritis, E., Konstantinos, S., 2019. Design and
implementation of a hybrid model based on two-layer decomposition method
coupled with extreme learning machines to support real-time environmental
monitoring of water quality parameters. Sci. Total Environ. 648, 839–853. https://
doi.org/10.1016/j.scitotenv.2018.08.221.
Gers, F.A., Schraudolph, N.N., Schmidhuber, J., 2002. Learning precise timing with
LSTM recurrent networks. J. Mach. Learn. Res. 3, 115–143. https://doi.org/
10.1162/153244303768966139.
Gitelson, A.A., Schalles, J.F., Hladik, C.M., 2007. Remote chlorophyll-a retrieval in
turbid, productive estuaries: chesapeake Bay case study. Remote Sens. Environ. 109,
464–472. https://doi.org/10.1016/j.rse.2007.01.016.
Guo, Q.H., Jin, S.C., Li, M., Yang, Q.L., Xu, K.X., Ju, Y.Z., Zhang, J., Xuan, J., Liu, J.,
Su, Y.J., Xu, Q., Liu, Y., 2020. Application of deep learning in ecological resource
research:theories, methods, and challenges. Sci. China Earth Sci. 63, 1457–1474.
https://doi.org/10.1007/s11430-019-9584-9.
Hochreiter, S., Schmidhuber, J., 1997. Long short-term memory. Neural Comput. 9,
1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735.
Hou, G.X., Song, L.R., Liu, J.T., Xiao, B.D., Liu, Y.D., 2004. Modeling of cyanobacterial
blooms in hypereutrophic Lake Dianchi. China. J. Freshwater Ecol. 19, 623–629.
https://doi.org/10.1080/02705060.2004.9664743.
Huang, M., Tian, D., Liu, H., Zhang, C., Yi, X., Cai, J., Ruan, J., Zhang, T., Kong, S.,
Ying, G., 2018. A hybrid fuzzy wavelet neural network model with self-adapted
fuzzy-means clustering and genetic algorithm for water quality prediction in rivers.
Complexity 2018, 1–11. https://doi.org/10.1155/2018/8241342.
Jiang, B., Wei, Y.L., Ding, J., Zhang, R., Liu, Y.X., Wang, X.Y., Fang, Y.Z., 2019. Trends of
sea surface wind energy over the South China Sea. J. Oceanol. Limnol. 37,
1510–1522. https://doi.org/10.1007/s00343-019-8307-6.
Jørgensen, S.E., Mejer, H., Friis, M., 1978. Examination of a lake model. Ecol. Model. 4,
253–278. https://doi.org/10.1016/0304-3800(78)90010-8.
Kisi, O., Parmar, K.S., 2016. Application of least square support vector machine and
multivariate adaptive regression spline models in long term prediction of river water
pollution. J. Hydrol. 534, 104–112. https://doi.org/10.1016/j.jhydrol.2015.12.014.
Lee, S., Lee, D., 2018. Improved prediction of harmful algal blooms in four major South
Korea’s rivers using deep learning models. Int. J. Env. Res. Pub. He. 15, 1322.
https://doi.org/10.3390/ijerph15071322.
Lee, Y.J., Matrai, P.A., Friedrichs, M.A.M., Saba, V.S., Antoine, D., Ardyna, M.,
Asanuma, I., Babin, M., Belanger, S., Benoit-Gagne, M., Devred, E., FernandezMendez, M., Gentili, B., Hirawake, T., Kang, S.H., Kameda, T., Katlein, C., Lee, S.H.,
Lee, Z.P., Melin, F., Scardi, M., Smyth, T.J., Tang, S., Turpie, K.R., Waters, K.J.,
Westberry, T.K., 2015. An assessment of phytoplankton primary productivity in the
Arctic Ocean from satellite ocean color/in situ chlorophyll-a based models.
J. Geophys. Res-Oceans. 120, 6508–6541. https://doi.org/10.1002/2015JC011018.
Li, B., Yang, G.S., Wan, R.R., Hormann, G., Huang, J.C., Fohrer, N., Zhang, L., 2017a.
Combining multivariate statistical techniques and random forests model to assess
and diagnose the trophic status of Poyang Lake in China. Ecol. Indic. 83, 74–83.
https://doi.org/10.1016/j.ecolind.2017.07.033.
Li, X., Peng, L., Yao, X., Cui, S., Hu, Y., You, C., Chi, T., 2017b. Long short-term memory
neural network for air pollutant concentration predictions: method development and
evaluation. Environ. Pollut. 231, 997–1004. https://doi.org/10.1016/j.
envpol.2017.08.114.
14
L. Na et al.
Water Research 211 (2022) 118040
453. IOP Publishing Ltd, 012002. https://doi.org/10.1088/1755-1315/453/1/
012002.
Yu, Z.Y., Yang, K., Luo, Y., Shang, C.X., 2020. Spatial-temporal process simulation and
prediction of chlorophyll-a concentration in Dianchi Lake based on wavelet analysis
and long-short term memory network. J. Hydrol. 582, 124488 https://doi.org/
10.1016/j.jhydrol.2020.124488.
Yussof, F.N., Maan, N., Reba, M.N.M., 2021. LSTM networks to improve the prediction of
harmful algal blooms in the west coast of Sabah. Int. J. Env. Res. Pub. He. 18, 7650.
https://doi.org/10.3390/ijerph18147650.
Zeng, X., Zhang, Y., 2013. Development of recurrent neural network considering
temporalspatial input dynamics for freeway travel time modeling. Comput-Aided.
Civ. Inf. 28, 359–371. https://doi.org/10.1111/mice.12000.
Zeng, Y., Yang, Z.F., Liu, J.L., 2006. Prediction of the concentration of chlorophyll-alpha
for Liuhai urban lakes in Beijing City. J. Environ. Sc. 18, 827–831. CNKI:SUN:
HJKB.0.2006-04-035.
Zhang, Q.T., 2013. Review on the Annual Variation of Red Tides in China Sea.
Environmental Monitoring in China 29, 98–102. https://doi.org/10.19316/j.
issn.1002-6002.2013.05.019.
Zhao, W.X., Zhou, B., Liu, H.L., Li, H., Jiang, D.G., Ji, M., 2017. BP neural network-based
short-term prediction of chlorophyll concentration inmainstreamof Haihe River.
Water Resour. Hydropower Eng. 48, 134–140. https://doi.org/10.13928/j.cnki.
wrahe.2017.11.023.
Zheng, L., Wang, H.P., Liu, C., Zhang, S.R., Ding, A.Z., Xie, E., Li, J., Wang, S.R., 2021.
Prediction of harmful algal blooms in large water bodies using the combined EFDC
and LSTM models. J. Environ. Manage. 295, 113060 https://doi.org/10.1016/j.
jenvman.2021.113060.
Zou, W., Zhu, G.W., Cai, Y.J., Vilmi, A., Xu, H., Zhu, M.Y., Gong, Z.J., Zhang, Y.L., Qin, B.
Q., 2020. Relationships between nutrient, chlorophyll a and Secchi depth in lakes of
the Chinese eastern plains ecoregion: implications for eutrophication management.
J. Environ. Manage. 260, 109923 https://doi.org/10.1016/j.jenvman.2019.109923.
Wu, Y., Liu, S., 2012. Modeling of land use and reservoir effects on nonpoint source
pollution in a highly agricultural basin. J. Environ. Monitor. 14, 2350–2361. https://
doi.org/10.1039/c2em30278k.
Wu, Z., Wang, X., Chen, Y., Cai, Y., Deng, J., 2018. Assessing river water quality using
water quality index in Lake Taihu Basin. China. Sci. Total Environ. 612, 914–922.
https://doi.org/10.1016/j.scitotenv.2017.08.293.
Xiao, X., He, J.Y., Huang, H.M., Miller, T.R., Christakos, G., Reichwaldt, E.S.,
Ghadouani, A., Lin, S.P., Xu, X.H., Shi, J.Y., 2017. A novel single-parameter
approach for forecasting algal blooms. Water Res. 108, 222–231. https://doi.org/
10.1016/j.watres.2016.10.076.
Xie, J., Zhang, J., Yu, J., Xu, L., 2020. An adaptive scale sea surface temperature
predicting method based on deep learning with attention mechanism. IEEE Geosci.
Remote S. 17, 740–744. https://doi.org/10.1109/LGRS.2019.2931728.
Xu, G.C., Li, P., Lu, K.X., Zhan, T.T., Zhang, J.X., Ren, Z.P., Wang, X.K., Yu, K.X., Shi, P.,
Cheng, Y.T., 2019. Seasonal changes in water quality and its main influencing factors
in the Dan River basin. Catena 173, 131–140. https://doi.org/10.1016/j.
catena.2018.10.014.
Xu, Y.F., Ma, C.Z., Liu, Q., Xi, B.D., Qian, G.R., Zhang, D.Y., Huo, S.L., 2015. Method to
predict key factors affecting lake eutrophication - A new approach based on Support
Vector Regression model. Int. Biodeter. Biodegr. 102, 308–315. https://doi.org/
10.1016/j.ibiod.2015.02.013.
Yahel, G., Post, A.F., Fabricius, K., Marie, D., Vaulot, D., Genin, A., 1998. Phytoplankton
distribution and grazing near coral reefs. Limnol. Oceanogr. 43, 551–563. https://
doi.org/10.4319/lo.1998.43.4.0551.
Yajima, H., Derot, J., 2018. Application of the Random Forest model for chlorophyll-a
forecasts in fresh and brackish water bodies in Japan, using multivariate long-term
databases. J. Hydroinform. 20, 206–220. https://doi.org/10.2166/hydro.2017.010.
Yang, X., Huang, M.T., Bai, K.Y., 2019. Simulation system of lake eutrophication
evolution based on RS & GIS technology–a case study in Wuhan East Lake. In: 2019
5th International Conference on Green Materials and Environmental Engineering,
15
Download