Capabilities and Limitations of Neural Networks in Snow Cover 1. Introduction

advertisement
Capabilities and Limitations of Neural Networks in Snow Cover
Mapping from Passive Microwave Data
Juan C. Arevalo ( NOAA/CREST Graduate student) jc@ce.ccny.cuny.edu, Hosni Ghedira (Assistant Professor) ghedira@ce.ccny.cuny.edu, Reza Khanbilvardi (Professor) rk@ccny.cuny.edu.
The City College of the City University of New York. Convent Avenue at 140th St, Steinman Hall, New York, NY. 10031.
2. Study Area, Data Acquisition and Remote Sensing Data
1. Introduction
SSM/I Images
Snow-cover parameters are being increasingly used as input to hydrological models.
Having an accurate estimation of the snow cover characteristics during the snowmelt
season is indispensable for an efficient hydrological modeling and for an improved
snowmelt runoff forecasts.
Passive microwave remote sensing techniques have been investigated by numerous
researchers using various sensors and have been demonstrated to be effective for
monitoring snow pack parameters such as spatial and temporal distribution, snow
water equivalent (SWE), depth, and snow condition (wet/dry state). However, the snow
products derived from passive microwave sensors are usually limited by the relatively
low resolution, especially when the purpose is to use this product as input for
hydrological models. Moreover, the accuracy of passive-microwave-based maps is
usually affected by the presence of vegetation.
In this project, we used an adaptive neural network system to generate the spatial
distribution of snow accumulation from multi-channel SSM/I data in the Northern
Midwest of the United States. Five SSM/I channels were used in this experiment (19H,
19V, 22V, 37V, and 85V). The normalized difference vegetation index (NDVI) is used
from the AVHRR to quantify the vegetation dynamic in the snow mapping process. Six
snow days with high snow accumulation have been selected during the 2001/2002
winter season to train and test the neural network system. The snow depths and NDVI
values have been compiled and gridded into 25 km x 25 km grid to match the final
SSM/I resolution. To ensure an accurate selection of training pixels, different
approaches have been tested by varying the selection criteria of snow pixels. The final
results have shown the importance of these selection criterions on the neural network
performance.
3. Data Acquisition: Vegetation
AVHRR Image
The Normalized Difference Vegetation Index (NDVI) has been derived from
the visible and the near-infrared channels of NOAA-AVHRR Sensor over our
study area.
8-Km, 10-days composite
image, January 21-31 1994.
NDVI
48.7
Jan17 19H
Jan18 19H
Jan24 19H
Jan25 19H
Six days have been selected during the
2001/2002 winter season (01/16, 01/17,
01/18, 01/23, 01/24, and 01/25).
The following illustrations show the
channel 19H of SSM/I satellite with 25
km resolution for the six selected days.
Source of Data:
Ground data from National Oceanic and
Atmospheric Administration, NOAA
SSM/I data from DMSP SSM/I Pathfinder daily
EASE-Grid brightness temperatures, January
2002. Boulder, CO: National Snow and Ice Data
Center, NSIDC (Armstrong, R.L., K.W. Knowles,
M.J. Brodzik and M.A. Hardman. 1994,
updated current year).
The study area is located in the Northern Midwest of the United States within 110˚37’48’’W - 102˚02’24’’W
and 48˚42’36’’N - 40˚43’48’’N.. The passive microwave data from the NOAA/NASA Pathfinder Program
Special Sensor Microwave/Imager (SSM/I) Level 3 Equal Area Scalable Earth-Grid (EASE-Grid) Brightness
Temperatures F13 satellite is used in both ascending and descending orbits. These images provide
measurements of the brightness temperature in seven channels with different frequencies and polarizations (19
V, 19 H, 22 V, 37V, 37 H, 85 V, and 85 H).
Jan23 19H
Original gridded data in Northern
Hemisphere
projection
with
coordinates: 119˚44’W - 99˚57’W
and 49˚36’W - 34˚34’W
4. Artificial Neural Network
6. Neural Network Results
8. Snow Cover Maps
The available training data, has been divided into three subsets:
The graph below shows the accuracy variation of 100 neural network trained with different initial configurations. Threshold 0.6 for the
approach 4, which yields the net with the highest accuracy. That net was used to simulate the corresponding snow maps.
The following color images represent two snow-cover maps for each selected day
generated from the artificial neural network output. Each map contains 34 X 30
pixels with spatial resolution of 25 km.
• The first one is the learning set, whish is used for computing and updating the network weights.
• The second subset is the validation set, which is used for stopping the training by monitoring
the validation error during the training process.
• The third subset is the test set that is not used during the training process, and it is only used to
assess the classification accuracy and to compare between different classifiers and different
network configurations.
25 Km – NDVI Image
Jan16 19H
Study Area, Covered by SSM/I
34x30 pixels
Approach 4. Five channels,Tb + standard deviation of NDVI. Threshold 0.6
100
%
5 Channels
90
5 Channels + St Dev NDVI
48.7
80
48.7
70
46.7
No
Cover
age
46.7
60
Network Architecture : [5:10:10:1]
44.7
50
The 190 training pixels have been
set as follow:
42.6
Input layer
2 hidden layers
40
Output layer
30
Learning set
40.7
1110.6
108.8
106.5
104.4
90 pixels
20
102.0
10
Validation set
The AVHRR data was acquired from the
Distributed Active Archive Center (DAAC)
located at Goddard Space Flight Center,
NASA.
45 pixels
0
1
Test set
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21 22
23 24
25 26
27 28 29
30
31 32
33 34
35 36
37 38 39
40
41 42
43 44
45 46
47 48
49
50
51 52
53
54
55
56
57
58
59 60
61 62 63
64
65 66
67 68
69
70
71
72
73
74
75
76
77
78
79
80
81 82
83 84
85 86
87 88 89
90
91 92
93 94
95 96
97 98 99 100
Run
50 pixels
The overall accuracy varies between 60 and 80 %, having a fairly stable pattern. However, some high and low peaks have been
observed; 86 % and 52% being the highest and lowest accuracy. The Kappa coefficient, which assess the agreement in the
classification between the snow and non-snow pixels, has a very irregular pattern.
25 Km – St Dv. NDVI Image
48.7
The original 8 Km NDVI values have
been gridded into a regular grid of 25 Km
over the study area.
2
46.7
42.6
110.6
108.8
106.5
104.4
22 V
102.0
%
90
70
60
37 V
50
40
30
20
85 V
10
0
Latitud
49 [N]
0
0.2
0.4
0.6
0.8
47
46
1
During the simulation process, a continuous range from zero to one will
be produced by the output neuron. We have introduced a threshold value
(between 0 and 1) to decide if the pixel will be classified as snow or nosnow pixel.
The optimal threshold value cannot be identified with certainty without
measuring its effect on the overall accuracy of the neural network
classification. In this project, the threshold value has been varied from 0.2
to 0.8. The effect of the decision threshold on classification accuracy of
each class is illustrated in the following figure:
Threshold
48
Graph shows the pattern and difference from the
truth data; snow/no-snow and the ANN result
The input layer size may vary depending
on the approach used; from 5 to 7 neurons.
Actual snow depth from the truth data and the ANN result
45
1
42
ANN
0.8
0.8
5. Neural Network Approaches
Snow - Snow
Non snow – Non snow
41
Threshold 0.4
Results obtained for the four approaches (number of
input channels): 1. the 5 brightness temperature (Tb)
channels; 2. the 5 Tb plus NDVI; 3. the 5 Tb plus NDVI
and standard deviation, and 4. the 5 Tb plus the
standard deviation of NDVI.
40
%
39
95
90
85
38
108
107
106
105
104
103
102
101
100
99
98
Longitud [W]
80
75
70
Threshold
0.6
Overall accuracy
Kappa coefficient
0.4
0.6
Threshold
43
1
ANN
44
109
40.7
110.6
Jan 17
44.7
42.6
40.7
108.8
106.5
106.4
102.0
110.6
48.7
48.7
46.7
46.7
44.7
44.7
42.6
42.6
108.8
106.5
104.4
10.2.0
108.8
106.5
104.4
102.0
108.8
106.5
104.4
102.0
108.8
106.5
104.4
102.0
108.8
106.5
104.4
102.0
108.8
106.5
104.4
102.0
40.7
40.7
110.6
108.8
106.5
104.4
102.0
110.6
48.7
48.7
46.7
46.7
44.7
44.7
42.6
42.6
100
40.7
50
110
No
Snow
42.6
For each vector of five brightness temperatures presented to the input layer, a value equal to one will be assigned in the output layer
if the presented vector correspond to a snow pixel. Otherwise, a value equal to zero will be assigned to the corresponding vector.
19 V
A total of 195 ground stations covering the study area have been identified
for this experiment. The figure below shows the distribution of the ground
stations over the study area (red rectangle).
111
Snow
Effect of the decision threshold on classification
Ground data distribution
112
44.7
19 H
80
113
Jan 16
44.7
The standard deviation values have been
measured during the gridding process to
quantify the vegetation homogeneity for
each pixel.
114
The 5 channels
maps (first
column) represent
the simulation
generated by
simulating the
trained neural
network in the
approach 1,
(threshold = 0.4).
46.7
The other maps (2nd
column) represent
the simulation
results of neural
network trained with
the approach 4 by
using 5 SSM/I
channels plus the
standard deviation
of the NDVI as
input. For this
configuration, the
best performance
was obtained with a
threshold equal to
0.6.
Jan 18
40.7
40.7
110.6
Jan 23
108.8
106.5
104.4
102.0
110.6
48.7
48.7
46.7
46.7
44.7
44.7
42.6
42.6
40.7
40.7
110.6
108.8
106.5
104.4
102.0
110.6
0.4
48.7
48.7
46.7
46.7
44.7
44.7
42.6
42.6
0.2
0.2
0
0
0
0
65
1
1
2
3
4
5
6
7
8
9
10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25
Truth data – snow depth [in]
Truth data
Jan 24
60
Ground snow map
Threshold 6
55
50
%
45
48.7
Only pixels with ground
stations
inside
their
boundaries are considered
for
the
training
and
validation
of
Neural
Network. A total of 165
pixels satisfy this criterion.
46.7
44.7
42.6
95
40
90
35
85
30
80
25
75
20
70
5Tb
NDVI
S-NDVI
StDv
65
60
55
50
Highest Overall accuracy
45
40
Average overall accuracy and standard deviation
Average Kappa coefficient and its standard deviation
35
30
25
20
40.7
110.6
5Tb
108.8
106.5
104.4
102.0
NDVI
S-NDVI
StDv
A total of 100 runs of the neural
network for each approach have
been performed.
The following graphs show for each
approach the average accuracy
and the Kappa coefficient for the
100-runs and their corresponding
standard deviation. These results
have shown that the addition of
the NDVI standard deviation
(homogeneity factor) improves the
snow identification accuracy.
7. Confusion matrices
110.6
As a part of the assessment of the capabilities and limitations of neural networks for snow mapping, the neural network output has
been evaluated with a confusion matrix that was computed for each approach. The overall accuracy and Kappa coefficient were
measured. The following matrices correspond to the net giving the highest accuracy out of 100 runs.
5 Channels
S
NS
S
0.95
0.24
NS
0.05
0.76
Accuracy = 84 Kappa = 68.4
5 Channels + NDVI
S
NS
S
0.96
0.17
NS
0.04
0.83
Accuracy = 90 Kappa = 79.67
40.7
40.7
5 Channels+NDVI+St Dev. NDVI
5 Channels+St Dev. NDVI
S
NS
S
0.88
0.08
NS
0.13
0.92
Accuracy = 90 Kappa = 79.94
S
NS
S
0.80
0.08
NS
0.20
0.92
Accuracy = 86 Kappa = 72.00
Jan 25
108.8
106.5
104.4
102.0
110.6
48.7
48.7
46.7
46.7
44.7
44.7
42.6
42.6
40.7
40.7
110.6
108.6
106.5
104.4
102.0
110.6
Download