Report

advertisement
Twin Cities Urban Lakes Project
Ann Krogman
FR 5262 Section 001
Objectives
In 2002 the Sterner Lab was interested in investigating the biodiversity of lakes in the
urban environment at a landscape level. Fortunately, the Twin Cities in Minnesota were an ideal
area to study urban lakes as the 7-county metro area is dotted with lakes as small as storm water
holding ponds to as large as Lake Minnetonka. It was hypothesized that land-use surrounding
the lakes would impact biodiversity, specifically; increases in impervious surface would yield
lower biodiversity. To conduct the study, 100 lakes within the 7-county Twin Cities Metro Area
were selected via random stratified sampling. The metro area was broken into three concentric
circles. Twenty-five lakes were selected from the inner circle or urban ring, twenty-five lakes
were selected from the second, or suburban ring, and finally fifty lakes were selected from the
outermost, or non-urban ring. The lakes were located by generating XY coordinates and then
going into the field to find the water body nearest to the set of XY coordinates. Selected lakes
ranged in size from .003 to 5,667 hectares. Along with sampling the phyto and zooplankton
within each of the lakes, water chemistry parameters such as phosphorus, nitrogen, dissolved
oxygen, dissolved organic carbon, and chlorophyll a were also sampled.
In order to determine the land-use surrounding each of the lakes, the Sterner lab worked
with Dr. Fei Yuan who was a student of Dr. Marv Bauer in 2002. Dr. Yuan classified the area
surrounding each of the lakes by finding the area of lake and determining the radius of a circle of
equal area to the lake. She then classified land-use around each lake in 0.5, 1, 2, and 3 radius
buffers. Land-use was classified into 12 categories: High intensity urban, low intensity urban,
transportation, crops, grass, conifer, deciduous, water meadow, low shrubs, wetland, and
impervious.
I came into the lab to work on the Twin Cities Urban Lakes Project as the biodiversity
project was called in the fall of 2009. In the spring of 2009 my advisor and I worked together to
analyze the water chemistry data. We used data mining in Statistica to create regression trees in
Statistica in order to determine what land-use type had the most impact on different water quality
parameters. Through this analysis we discovered that percent crops had the biggest impact on
total phosphorus. High percent crop meant high phosphorus and low percent crop meant lower
phosphorus. After percent crop, the next most important land-use on total phosphorus was lawn.
Lawn greater than 49.9 percent meant higher phosphorus and lawn less than 49.9 percent meant
lower phosphorus.
My advisor and I found these results to be very interesting and relevant. In 2004, the
state of Minnesota banned the use of residential phosphorus based fertilizers in the 7-county
Twin Cities Metro Area (TCMA). With this ban in place, we hypothesized that lawn should no
longer be the most important land-use controlling total phosphorus concentrations in urban lakes
with low crops. This hypothesis was generated because we anticipated that without excess
phosphorus being applied to urban lawns, runoff from lawns into urban lakes would not be as
significant a contributor of phosphorus.
Looking at phosphorus in the urban system is very important. In freshwater systems,
phosphorus is not only an essential nutrient for plant growth, it is often the limiting nutrient of
the system (Schindler, 1977; Lee et al, 1978). If phosphorus is the limiting nutrient, when more
phosphorus is added to freshwater systems, increased algal and vegetative growth is expected.
These excess blooms have been documented in numerous studies (Carpenter, 1998; Findlay and
Kasian, 1987). This eutrophication can result in anoxic conditions, fish kills, and reduced
recreational value. In the TCMA alone, 163 lakes are listed on the Clean Water Act 303d list for
impairment due to excess nutrients/eutrophication (MPCA, 2010). The impacts of
eutrophication are an estimated $2.2 billion annually in the United States (Dodds et al, 2009).
Urban systems are considered a major contributor of phosphorus to surface water bodies
(Osborne and Wiley, 1986). The urban environment is very heterogeneous. The varying land
cover classes transport large quantities of phosphorus to receiving water bodies via surface
runoff and sediment transport to surface waters by storm water removal systems (Walsh et al,
2005). The natural flow pathways which are altered by impervious surfaces and man-made
sewers increase the speed with which water and nutrients are removed from the landscape and
transported to receiving water bodies.
To test our hypothesis, during the summer of 2010, I re-sampled the 64 lakes which were
classified as low crop in 2002. I followed the 2002 procedures and collected the same water
chemistry parameters. In order to re-run our regression trees, the land-use surrounding the lakes
needs to be updated for 2010 as well.
The goals of my project were to initially update the land-use in the .5, 1, 2, and 3r buffers
surrounding the 100 lakes in our study. I intended to classify the percent land-use type and make
a chart similar to the one produced by Dr. Yuan in the appendix. I also wanted to make the
classification as accurate as the one done by Dr. Yuan in 2002. Her total percent accuracy was
93.3 and she had a kappa statistic of 91.6. After discussion with the lab TA, the goals of the
study were refined to include classification of the 3r buffer only surrounding the fourteen lakes
in our study located in Anoka County. The lakes in this subset ranged in size from .003 ha to
94.7 ha.
Materials
For this project I used four images. Two of the images were 30 meter resolution, 7 band,
LandSat 5 TM images. These images were obtained from http://glovis.usgs.gov/ and these
images were from July 2002 and July 2010. They are from path 27 and row 29. There was no
cloud cover in the July 2002 image but there was 14% cloud cover present in the July 2010
image. This cloud cover did not block out any important portions of the image but it did make
the image overall hazier by reducing the digital number values as compared to the 2002 image.
The other two images were obtained from www.mngeo.state.mn.us/ and the images were
obtained by the National Agricultural Imagery Program (NAIP). They are 1 meter resolution
with red, green and blue bands. The two images are from summer 2003 and summer 2010.
In addition to these images the guidance for this study was found in Yuan et al 2005 from
Remote Sensing of the Environment. Data was made available from the Sterner lab existing data
set for the Twin Cities Urban Lakes Project. Google Earth was also used to locate lakes on the
images. All of the analysis for the project was done using Erdas Imagine 2010 and ArcMap in
ArcGIS 9.3.1.
Procedures
When I started the project, I believed that the methods for the entire updated
classification would be located in the Yuan et al, 2005 paper. After beginning the project,
however, I realized that the paper was the methodology for a land-use classification of the entire
7-county Twin Cities Metro Area. There were no specific instructions or documentation on how
the full TCMA classification became the percent land-use classifications found in the chart in the
lab data. I had scheduled a meeting to talk with Dr. Yuan for October 29, 2010. Unfortunately,
the death of my uncle on October 26, 2010 necessitated that I reschedule my meeting with her. I
was not able to meet with her again until December 10, 2010. At the meeting she did not
remember many specific details about the classification because it was six years ago. Therefore,
I had to develop the protocol for classifying the areas in the 3r buffer around the lake.
My initial plan of action was to subset all images to include only the areas within the 3
radius buffer zone around each lake, classify all images using unsupervised classification,
determine percent accuracy for all images using NAIP data for reference, compare the percent
accuracies between the 30 m and 1 m resolution, and do change detection between the 30 m 2002
and 2010 images. I needed to classify all of the images prior to comparison between images in
order to account for the 14% haze over the July 2010 LandSat image. I also wanted “to” and
“from” changes so thematic change detection made sense. The goal of doing change detection
and comparing accuracy among the 30 meter and 1 meter resolution images was to get at the
differences in land-cover which were due to changes in land-use and the difference which were
due to classification error.
I chose to make some changes to the protocol used by Yuan et al. Major differences
included using primarily 1 meter data instead of 30 meter data because of the small size of many
of the lakes. The minimum mapping unit of 30 meters was too big to accurately describe the
land surrounding the lakes. I also decided to classify into only five classes. In Yuan et al, landuse classification is broken into 7 classes and in the lab data file 12 classes are present. I felt that
based on ground referencing the sites and the needs of the analysis that lawn, impervious, trees,
wetland, and water are the only classes necessary.
To begin with I added the XY coordinates of the center points of each lake in ArcMap. I
then buffer each center point by 3 times the radius of the lake. The buffer distances used by Dr.
Yuan were obtained from the lab excel file. Then I merged all of the buffered files. I added the
merged buffer file in Erdas and copied them to an area of interest (AOI) layer and extracted them
from the .sid (NAIP 1-meter) county file.
Unfortunately, this did not work for two primary reasons. The first was that the center
points from the files were not at the center points of the lakes. Additionally and even more
importantly, when the AOI file was extracted from the .sid none of the .sid file went with it so I
just had empty circles and no raster after the extract.
To deal with these issues, I loaded all of the county rasters into Erdas in different
viewers. I added an inquire cursor to the 2010 NAIP image and linked all views. I added an
inquire box and used Google Earth to locate all of the lakes. For each lake I recorded the center
XY coordinates in meters. I then used the subset feature to cut out a box about five times the
size of each lake in each view. The coordinates of the inquire box were used for each subset.
The subsets were then opened in ArcMap. In ArcMap, I used the extract by circle feature with
the new center point coordinates and existing buffer radius to extract the area of interest for each
lake. I did this for all the images of the same lake in order so that all subsetted images were
loaded in the viewer simultaneously. This allowed me to overlay the images and check for
geometric correctness. The images overlaid well. After creating circular subsets for each of the
lakes in each of the images I attempted to use the MosaicPro from 2D feature in Erdas to put the
images back together. This worked for the 30m resolution images but did not work for the 1m
resolution images. The 1m resolution imagery was too large to save and took about four hours to
process. I mapped a new network drive on the lab computer and reran the merge of 1 meter data.
After spending the time doing the merge, it turned out the 1 meter merged images were too large
to work with, the unclassified areas between lakes made it difficult to pan from one lake to
another, and that I needed statistics on individual lakes so the classifications needed to be done
on the subsetted images.
For classification, I originally tried a supervised classification on the 1 meter NAIP
imagery but with no IR band to detect water, the classifier was confused and made major errors
classifying water and wetlands. I then decided to do an unsupervised classification with many
classes. I tried different numbers of classes ranging from 100 to 10. In the end it appeared that
60 classes were optimal for the 1m NAIP images. After completing the classification, however, I
realized that the water within the lake and surrounding the lake of interest were classified both as
water. The water in the lake should not be considered part of the classification so it needed to be
classified as a distinct class that could be eliminated in the analysis. To exclude the water I
opened the July 2010 1 m NAIP imagery in ArcMap. Then I digitized the exterior boundary of
the lake using the polygon drawing tool. I then converted the drawing to a feature (shapefile)
and extracted the shapefile from the 1 meter circular subset. I exported the extract as an image
and opened it in Erdas. In a separate viewer, I loaded the digitized shapefile and a new AOI
layer. I copied the shapefile into the AOI layer and saved the AOI. In order to give the lake a
unique spectral signature, I then radiometrically corrected the extracted image by rescaling. I
rescaled from digital number value 254 to 254 and rescaled by the just saved AOI. This
produces a white water body. To add the newly rescaled water body to the 1m circular mosaic, I
used the MosaicPro from 2D feature. For this process I also tried to recode the circular subset
directly from an AOI file but without the extracted subset this process was not successful when
mosaicked.
After I did this for each of the 2010 1m images, I once again did the 60 class
unsupervised classification. Unfortunately, this time the thematic recode did not properly work.
A possible source of the failure was a repeated message to close attribute editor prior to saving
and before reopening for the recode. This message appeared both on the Saint Paul and
Minneapolis campuses. I asked a lab tech to help me understand the error message but he could
not understand the problem either. To get around the error message after the classification, I had
to force save by closing the classification and then reopen for the recode. After I assigned new
values to the classes and clicked apply and then close, when I reopened the image there was
typically only the unclassified class and then one or two random old classes like trees 1 or
wetland 3. Because this recode was now saved, all of my 60 previous classes were lost and the
classification had to be done over again from scratch. By visual interpretation however it
appeared that the 60 class unsupervised classification with the excluded water did a good job of
distinguishing between the five classes. The main problem with this classification was that
approximately two classes for each image were separate classes for the shadows of trees and
buildings.
For the 30 m resolution images the 60 class unsupervised classification was not optimal.
Due to the low pixel numbers in many of the lakes, when I ran the 60 class unsupervised on the
merged image, some lakes would only have 1 pixel classified and would be very difficult to find
and identify for the classification. I found that a 20 class classification worked better by
increasing the pixel values per class. In the lakes with higher numbers of pixels especially, there
were still mixed classes with the 20 class scheme. To reduce some of the confusion and
eliminate the lakes from being part of the classification, I needed to digitize the 30m lakes and
rescale them as well. Unfortunately, the lakes were less defined in the 30m resolution imagery
so digitization was not possible. I tried to mosaic the rescaled 1 meter lakes with the 30 meter
data and that did not work because they contained different numbers of layers.
Results and Discussion
Because of all of the issues with the classifications, I was not able to meet any of my
goals but I did exert a lot of effort and I learned a lot. I also learned that I have a lot more to
learn and do. I need to complete these classifications in order to complete my Masters project so
I will continue to work on this project until it is complete.
Because the classifications were never properly recoded, I was not able to perform an
accuracy assessment. When I do finish the classification process I will do an accuracy
assessment using 50 randomly distributed points in each of the five classes. The fact that I
digitized the water features of my lake will impact my accuracy and hopefully improve the
accuracy. There may be confusion however because for many of the lakes the boundaries
between wetland and lake were quite fuzzy and I used my memory from being on the lakes this
summer and my best judgment to digitize the lake boundary while maintaining what I considered
to be wetland. My accuracy assessment mechanism will differ from the assessment mechanism
used by Yuan et al. For their assessment, they randomly placed approximately 363 polygons
containing at least 100 pixels throughout the metro area and then assessed the accuracy of each
pixel. This form of assessment is not possible for my areas due to the low pixel values in many
of my subsets.
As I have already stated there were many problems with my classifications. The biggest
unresolved problem is the issue with the recode error. I need to figure that out so that I can
continue on with the project. Other problems I encountered were the sheer size of the 1m
resolution images, I was not set up to deal with needing that much space and I lost time and data
because of it, and the finickiness of Erdas, for example, MosaicPro for 2D would only open
about once every four times I attempted to open it.
All of the analysis for my Masters project up to this point has been done using the
classification made in 2002 based on the 30m resolution images. After looking at the images and
the pixels in these buffers especially around the small lakes, I am now questioning whether the
percent land-use classifications that my initial analysis has been done with are believable. This
project has opened my eyes to making sure that I understand the data that I am working with in a
project and not just accepting that it is the proper data for my project. I will see after doing
change detection and comparing the accuracy assessment if my gut instincts about 30m
resolution being too coarse are supported.
References
Journals and Websites
Carpenter S., Caraco, N., Correll, D., Howarth, R., Sharpley, A., and V. Smith. 1998. Nonpoint
pollution of surface waters with phosphorus and nitrogen. Ecological Applications 8:559-568
Dodds, W., Bouska, W., Eitzmann, J., Pilger, T., Pitts, K., Riley, A., Schloesser, J., and D.
Thornbrugh. 2009. Eutrophication of U.S. Freshwaters: Analysis of Potential Economic
Damages 43(1):12-19
Findlay, D. and S. Kasian. 1987. Phytoplankton community responses to nutrient addition in
lake-226, Experimental Lakes Area, Northwest Ontario. Canadian Journal of Fisheries and
Aquatic Sciences 44: 25-46
Lee, G., Rast, W., and R. Jones. 1978. Eutrophication of water bodies: Insights for an age-old
problem. Environmental Science and Technology 12(8):900-908
[MPCA] Minnesota Pollution Control Agency. 2010. Inventory of all Impaired Waters.
Available online at: http://www.pca.state.mn.us/index.php/water/water-types-andprograms/minnesotas-impaired-waters-and-tmdls/assessment-and-listing/303d-list-of-impairedwaters.html?menuid=&redirect=1. Accessed 5 November 2010.
Osborne, L. and M. Wiley. 1986. Empirical Relationships Between Land Use/Cover and Stream
Water Quality in an Agricultural Watershed. Journal of Environmental Management 26:9-27
Schindler, DW. 1977. Evolution of Phosphorus Limitation in Lakes. Science 195: 260-262
Walsh, C., Roy, A., Feminella, J., Cottingham, P., Groffman, P., and R. Morgan. 2005. The
Urban stream syndrome: current knowledge and the search for a cure. J N Am Benthol Soc
24:706-723.
Yuan, F., Sawaya, K., Loeffelholz, B., and M. Bauer. 2005. Land cover classification and change
analysis of the Twin Cities (Minnesota) Metropolitan Area by multitemporal Landsat remote
sensing. Remote Sensing of Environment 98: 317-328
Forest Resource Lab pdfs
5262
Lab 2 – Introduction to Imagine and Image Data
Lab 5 – Unclassified Images
Lab 6 – Classified Images
5131
Lab 1
Lab 2
Image Acquisition
LandSat http://glovis.usgs.gov/
National Agricultural Imagery Program www.mngeo.state.mn.us/
Appendices
Appendix 1. 100 sites selected through the Twin Cities Metro Area
Appendix 2. Decision tree for total phosphorus using 2002 land-use classifications and 2002
summer total phosphorus values
Lake ID
1-25
2-12
2-13
2-16
2-18
N-00
N-02
N-12
N-17
N-25
N-26
N-35
N-41
N-54
High
Low
Intensity Intensity
Low
Urban
Urban
Transportation Crops Grass Conifer Deciduous Water Meadow Shrubs W
25
67
1
0
0
0
0
7
0
0
36
40
12
2
0
4
2
1
1
0
22
29
3
2
0
2
15
0
27
0
37
18
18
0
0
3
16
0
8
0
0
3
3
0
0
39
47
0
0
0
13
7
0
31
0
22
18
0
9
0
0
0
0
0
0
6
63
0
19
0
0
11
0
33
0
0
56
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
9
0
1
8
11
71
0
32
29
3
13
0
0
0
0
23
0
13
25
0
63
0
0
0
0
0
0
4
11
3
17
0
4
16
17
26
0
10
40
7
12
2
1
5
11
11
0
Appendix 3. 2002 Percent land cover classifications
Land cover
class
Agriculture
Forest
Grass
Urban
Water
Wetland
Overall
accuracy
Kappa statistic
2002
Producer’s
95.8
97.3
98.1
89.6
95.9
81.9
User’s
96.4
88.8
78.9
99.8
96.7
84.3
93.2
91.6
Appendix 4. Yuan et al accuracy assessment results for 2002
Appendix 5. A comparison of lake N-12 using 30 meter resolution LandSat data (left) and 1
meter NAIP data (right)
Appendix 6. Digitized lake boundary and merged lake and land-use image.
Appendix 7. Lake 2-12 with a rescaled lake. Classified using a 60 class unsupervised classifier.
Download