these early results were obtained using one year’s set of fia field data, distance: euclidean...

1
O verallaccuracies for forestcover type classification,722 subplots 0.3 0.35 0.4 0.45 0.5 0.55 0.6 0.65 0.7 0.75 0.8 1 2 3 4 5 6 7 8 9 10 Num berofnearestneighbors used Overallaccuracy Upland,March-April-May-July images Lowland,March-April-May-July images Upland,M arch-A pril-M ay im ages Lowland,M arch-A pril-M ay im ages Upland,own clusterexcluded,4 dates Lowland,own clusterexcluded,4 dates Upland,own clusterexcluded,3 dates Lowland,own clusterexcluded,3 dates C ross-validation R M S E's for volum e,696 subplots 50 55 60 65 70 75 80 85 90 95 1 2 3 4 5 6 7 8 9 10 Num ber ofnearestneighbors used RM SE,m 3/ha S tandard deviation ofdata M ean ofdata RM S E ,allsubplots used RM S E,ow n clusterexcluded RM S E ,subplots closerthan 40 m eters notused RM S E,sim ple location optim ization used These early results were obtained using one year’s set of FIA field data, DISTANCE: EUCLIDEAN WEIGHTING FUNCTION: NO WEIGHTS. NUMBER OF PLOTS: 696 NUMBER OF BANDS: 34 NUMBER OF NEIGHBORS: 1 VOLUME CROSS-VALIDATION RMSE (ALL) = 81.1354 m3 ,126.495 % VOLUME ALL MIN AND MAX =0 and 384.71 VOLUME ALL MEAN = 64.1413 VOLUME STANDARD DEVIATION (ALL) = 70.0947 VOLUME BIAS (ALL) = -1.40832 Class\Ref 0-40 41-80 8 1-120 121-160 >160 User’s acc 0-40 217 54 35 12 23 0.6363 41-80 47 35 17 16 10 0.28 81-120 40 14 24 12 15 0.2285 121-160 15 20 13 7 8 0.1111 >160 17 10 9 11 15 0.2419 Prod.acc 0.645833 0.263158 0.244898 0.12069 0.211268 -------------------------------------------------------------------- VOLUME CROSS-VALIDATION RMSE (LOWLAND) = 66.624 m3 ,154.966 % NUMBER OF PLOTS (LOWLAND) = 164 VOLUME LOWLAND MIN AND MAX = 0 and 254.07 VOLUME LOWLAND MEAN = 42.9926 VOLUME STANDARD DEVIATION (LOWLAND) = 53.712 VOLUME BIAS (LOWLAND) = -2.87561 Class\Ref 0-40 41-80 8 1-120 121-160 >160 User’s acc 0-40 82 8 12 3 5 0.7454 41-80 7 7 3 0 1 0.3888 81-120 9 4 6 3 1 0.2608 121-160 4 0 2 0 0 0 >160 4 1 1 0 1 0.1666 Prod.acc 0.773585 0.350 0.250 0.000 0.125 -------------------------------------------------------------------- VOLUME CROSS-VALIDATION RMSE (UPLAND) = 86.8592 m3 ,122.924 % NUMBER OF PLOTS (UPLAND) = 532 VOLUME UPLAND MIN AND MAX = 0 and 384.71 VOLUME UPLAND MEAN = 70.6608 VOLUME STANDARD DEVIATION (UPLAND) = 73.2422 VOLUME BIAS (UPLAND) = -0.0196241 Class\Ref 0-40 41-80 8 1-120 121-160 >160 User’s acc 0-40 134 42 23 9 20 0.5877 41-80 42 29 13 16 7 0.271 81-120 27 14 20 9 13 0.2409 121-160 12 18 11 7 9 0.1228 >160 15 10 7 11 14 0.2456 Prod.acc 0.582609 0.256637 0.27027 0.134615 0.222222 Wall-to-wall extension of Forest Inventory and Wall-to-wall extension of Forest Inventory and Analysis: Analysis: K-nearest neighbor estimation and classification 3. Early results concerning accuracy, imagery and data Reija Haapanen, Alan Ek, Marvin Bauer, Kali Sawaya -- Department of Forest Resources, University of Minnesota • The new FIA 4-subplot data has some special features due to the close proximity of subplots; in forest cover type classification, the nearest neighbors tend to be in the same cluster. When use of neighboring subplots is prohibited, the accuracies fall to 30% • Of the satellite imagery tested (March, April, May and July combined), the first three dates tend to carry enough information for cover type classification • Upland-lowland stratification improves forest cover type classification Example of program outputs showing volume accuracies only 696 subplots for our image. The volume accuracies are quite low partly due to the small number of plots. However, by using simple optimization of the field plot locations, the errors drop considerably. Field measurements Large-area statistics & error estimations Preprocessed image & field data combined Preprocessed field data Statistics for small area (e.g., county) Thematic forest maps Image analysis: estimation & classification Preprocessed image data Original satellite image Preprocessing New FIA 4-subplot field cluster design Spectral band 1 of Landsat 7 ETM+ satellite image A georeferenced Landsat 7 ETM+ image, clouds excluded. The black dots represent FIA field plot clusters The Minnesota land use/land cover classification Cover type classification Post-processing kNN’ kNN’ 2. Flow of the kNN process and materials used • The kNN estimation and classification method is a simple, but very powerful way to extend a wide range of field data to landscapes • The method assigns a pixel the field data of the most similar pixel, for which actual field plot data exists. The similarity is defined in terms of the feature space (e.g., Euclidean distance between satellite image pixels) • kNN retains the full range of variability inherent in the sample Forest/ non-forest stratification Research funding provided by: The USDA Forest Service Forest Inventory and Analysis program (FIA) has been conducting state-by-state and ultimately nationwide forest inventories for decades. Yet these field plot based inventories have not been able to produce precise county and local estimates and useful operational maps. Additionally, traditional satellite-based forest classifications have been unable to match detailed forest type identification with ground based survey definitions to provide for interpolation and extrapolation of FIA data. Precise classification has been limited to general or aggregate classes of little use for improving inventory precision and providing truly useful operational forest maps. The k-nearest neighbors approach (kNN), adapted from the Finnish Forest Research Institute, offers a means of applying satellite and GIS data so as to impute forest cover type, timber volume, and other FIA data from field plots surrounding large or small areas on the basis of the spectral characterization of neighboring pixels. To the extent that such post- stratification can be successfully applied, the 1. Rationale Photo courtesy of Dave Hansen

Upload: cordelia-hood

Post on 24-Dec-2015

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: These early results were obtained using one year’s set of FIA field data, DISTANCE: EUCLIDEAN WEIGHTING FUNCTION: NO WEIGHTS. NUMBER OF PLOTS: 696 NUMBER

Overall accuracies for forest cover type classification, 722 subplots

0.3

0.35

0.4

0.45

0.5

0.55

0.6

0.65

0.7

0.75

0.8

1 2 3 4 5 6 7 8 9 10

Number of nearest neighbors used

Ove

rall

accu

racy

Upland, March-April-May-Julyimages

Lowland, March-April-May-Julyimages

Upland, March-April-May images

Lowland, March-April-May images

Upland, own cluster excluded, 4dates

Lowland, own cluster excluded, 4dates

Upland, own cluster excluded, 3dates

Lowland, own cluster excluded, 3dates

Cross-validation RMSE's for volume, 696 subplots

50

55

60

65

70

75

80

85

90

95

1 2 3 4 5 6 7 8 9 10

Number of nearest neighbors used

RM

SE

, m

3/h

a

Standard deviation of data

Mean of data

RMSE, all subplots used

RMSE, own cluster excluded

RMSE, subplots closer than 40meters not used

RMSE, simple locationoptimization used

These early results were obtained using one year’s set of FIA field data,

DISTANCE: EUCLIDEANWEIGHTING FUNCTION: NO WEIGHTS.NUMBER OF PLOTS: 696NUMBER OF BANDS: 34NUMBER OF NEIGHBORS: 1VOLUME CROSS-VALIDATION RMSE (ALL) = 81.1354 m3 ,126.495 %VOLUME ALL MIN AND MAX =0 and 384.71VOLUME ALL MEAN = 64.1413VOLUME STANDARD DEVIATION (ALL) = 70.0947VOLUME BIAS (ALL) = -1.40832

Class\Ref 0-40 41-80 8 1-120 121-160 >160 User’s acc 0-40 217 54 35 12 23 0.6363 41-80 47 35 17 16 10 0.28 81-120 40 14 24 12 15 0.2285121-160 15 20 13 7 8 0.1111 >160 17 10 9 11 15 0.2419Prod.acc 0.645833 0.263158 0.244898 0.12069 0.211268-------------------------------------------------------------------- VOLUME CROSS-VALIDATION RMSE (LOWLAND) = 66.624 m3 ,154.966 %NUMBER OF PLOTS (LOWLAND) = 164VOLUME LOWLAND MIN AND MAX = 0 and 254.07VOLUME LOWLAND MEAN = 42.9926VOLUME STANDARD DEVIATION (LOWLAND) = 53.712VOLUME BIAS (LOWLAND) = -2.87561

Class\Ref 0-40 41-80 8 1-120 121-160 >160 User’s acc0-40 82 8 12 3 5 0.745441-80 7 7 3 0 1 0.388881-120 9 4 6 3 1 0.2608121-160 4 0 2 0 0 0>160 4 1 1 0 1 0.1666Prod.acc 0.773585 0.350 0.250 0.000 0.125--------------------------------------------------------------------VOLUME CROSS-VALIDATION RMSE (UPLAND) = 86.8592 m3 ,122.924 %NUMBER OF PLOTS (UPLAND) = 532VOLUME UPLAND MIN AND MAX = 0 and 384.71VOLUME UPLAND MEAN = 70.6608VOLUME STANDARD DEVIATION (UPLAND) = 73.2422VOLUME BIAS (UPLAND) = -0.0196241

Class\Ref 0-40 41-80 8 1-120 121-160 >160 User’s acc0-40 134 42 23 9 20 0.587741-80 42 29 13 16 7 0.27181-120 27 14 20 9 13 0.2409121-160 12 18 11 7 9 0.1228>160 15 10 7 11 14  0.2456Prod.acc 0.582609 0.256637 0.27027 0.134615 0.222222

Wall-to-wall extension of Forest Inventory and Analysis:Wall-to-wall extension of Forest Inventory and Analysis:K-nearest neighbor estimation and classification

3. Early results concerning accuracy, imagery and data

Reija Haapanen, Alan Ek, Marvin Bauer, Kali Sawaya -- Department of Forest Resources, University of Minnesota

• The new FIA 4-subplot data has some special features due to the close proximity of subplots; in forest cover type classification, the nearest neighbors tend to be in the same cluster. When use of neighboring subplots is prohibited, the accuracies fall to 30%

• Of the satellite imagery tested (March, April, May and July combined), the first three dates tend to carry enough information for cover type classification• Upland-lowland stratification improves forest cover type classification

Example of program outputs showing volume accuracies

only 696 subplots for our image. The volume accuracies are quite low partly due to the small number of plots. However, by using simple optimization of the field plot locations, the errors drop considerably.

Field measurements

Large-area statistics &

error estimations

Preprocessed image & field data combined

Preprocessed field data

Statistics for small area

(e.g., county)

Thematicforest maps

Image analysis:estimation & classification

Preprocessed image data

Original satellite image

Preprocessing

New FIA 4-subplot field cluster design

Spectral band 1 of Landsat 7 ETM+satellite image

A georeferenced Landsat 7 ETM+ image, cloudsexcluded. The black dots represent FIA field plot clusters

The Minnesota land use/land cover classification

Cover type classification

Post-processing

‘‘kNN’kNN’

2. Flow of the kNN process and materials used

• The kNN estimation and classification method is a simple, but very powerful way to extend a wide range of field data to landscapes

• The method assigns a pixel the field data of the most similar pixel, for which actual field plot data exists. The similarity is defined in terms of the feature space (e.g., Euclidean distance between satellite image pixels)

• kNN retains the full range of variability inherent in the sample

Forest/ non-forest

stratification

Research funding provided by:

The USDA Forest Service Forest Inventory and Analysis program (FIA) has been conducting state-by-state and ultimately nationwide forest inventories for decades.  Yet these field plot based inventories have not been able to produce precise county and local estimates and useful operational maps.  Additionally, traditional satellite-based forest classifications have been unable to match detailed forest type identification with ground based survey definitions to provide for interpolation and extrapolation of FIA data.  Precise classification has been limited to general or aggregate classes of little use for improving inventory precision and providing truly useful operational forest maps.  

The k-nearest neighbors approach (kNN), adapted from the Finnish Forest Research Institute, offers a means of applying satellite and GIS data so as to impute forest cover type, timber volume, and other FIA data from field plots surrounding large or small areas on the basis of the spectral characterization of neighboring pixels.  To the extent that such post-stratification can be successfully applied, the method offers agencies and industry a) greater precision at survey unit to local levels of estimation, and b) detailed inventory attributes within type polygons over large areas.   In fact, our researchers can map virtually any FIA plot attribute.  The method produces estimates and maps according to the actual inventory classifications and definitions rather than an abstract set that must later be reinterpreted.

1. Rationale

Photo courtesy of Dave Hansen