Impact of training and validation sample selection on classification accuracy and accuracy assessment when using reference polygons in object-based classification

被引：71

作者：

Zhen, Zhen ^{[1
]}

Quackenbush, Lindi J. ^{[2
]}

Stehman, Stephen V. ^{[1
]}

Zhang, Lianjun ^{[1
]}

机构：

[1] SUNY Syracuse, Coll Environm Sci & Forestry, Dept Forest & Nat Resources Management, Syracuse, NY 13210 USA

[2] SUNY Syracuse, Coll Environm Sci & Forestry, Dept Environm & Resource Engn, Syracuse, NY 13210 USA

来源：

INTERNATIONAL JOURNAL OF REMOTE SENSING | 2013年 / 34卷 / 19期

关键词：

LIDAR DATA; IMAGERY; DESIGN;

D O I：

10.1080/01431161.2013.810822

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

Reference polygons are homogenous areas that aim to provide the best available assessment of ground condition that the user can identify. Delineation of such polygons provides a convenient and efficient approach for researchers to identify training and validation data for supervised classification. However, the spatial dependence of training and validation data should be taken into account when the two data sets are obtained from a common set of reference polygons. We investigate the effect on classification accuracy and the accuracy estimates derived from the validation data when training and validation data are obtained from four selection schemes. The four schemes are composed of two sampling designs (simple random and systematic) and two methods for splitting sample points between training and validation (validation points in separate polygons from training points and validation points and training points split within the same polygons). A supervised object-based classification of the study region was repeated 30 times for each selection scheme. The selection scheme did not impact classification accuracy, but estimates of overall (OA), user's (UA), and producer's (PA) accuracies produced from the validation data overestimated accuracy for the study region by about 10%. The degree of overestimation was slightly greater when the validation sample points were allowed to be in the same polygons as the training data points. These results suggest that accuracy estimates derived from splitting training and validation within a limited set of reference polygons should be regarded with suspicion. To be fully confident in the validity of the accuracy estimates, additional validation sample points selected from the region outside the reference polygons may be needed to augment the validation sample selected from the reference polygons.

引用

页码：6914 / 6930

页数：17

共 50 条

[31] An Object-Based Linear Weight Assignment Fusion Scheme to Improve Classification Accuracy Using Landsat and MODIS Data at the Decision Level
Guan, Xudong
Liu, Gaohuan
Huang, Chong
Liu, Qingsheng
Wu, Chunsheng
Jin, Yan
Li, Yafei
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (12): : 6989 - 7002
[32] Model selection and assessment for classification using validation
Jaworski, W
ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, PT 1, PROCEEDINGS, 2005, 3641 : 481 - 490
[33] Impact of Training Set Size on Object-Based Land Cover Classification: A Comparison of Three Classifiers
Myburgh, Gerhard
van Niekerk, Adriaan
INTERNATIONAL JOURNAL OF APPLIED GEOSPATIAL RESEARCH, 2014, 5 (03) : 49 - 67
[34] A DYNAMIC HIERARCHICAL FEATURE SELECTION METHOD FOR OBJECT-BASED CLASSIFICATION OF WETLANDS
Mahdavi, Sahel
Salehi, Bahram
Amani, Meisam
Granger, Jean
Brisco, Brian
Huang, Weimin
2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 570 - 573
[35] Accuracy Improvements to Pixel-Based and Object-Based LULC Classification with Auxiliary Datasets from Google Earth Engine
Qu, Le'an
Chen, Zhenjie
Li, Manchun
Zhi, Junjun
Wang, Huiming
REMOTE SENSING, 2021, 13 (03)
[36] Auxiliary datasets improve accuracy of object-based land use/land cover classification in heterogeneous savanna landscapes
Hurskainen, P.
Adhikari, H.
Siljander, M.
Pellikka, P. K. E.
Hemp, A.
REMOTE SENSING OF ENVIRONMENT, 2019, 233
[37] Impacts of Sample Design for Validation Data on the Accuracy of Feedforward Neural Network Classification
Foody, Giles M.
APPLIED SCIENCES-BASEL, 2017, 7 (09):
[38] Accuracy Assessment of Object Oriented and Knowledge Base Image Classification using P-Trees
Seetha, M.
Sunitha, K. V. N.
Parameswari, D. V. Lalitha
Ravi, G.
2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 5, 2010, : 760 - 763
[39] Improvement of Hydrological Network Model Using Object-based Classification based from InfoGain Feature Selection
Bentir, Sarah Alma P.
Ballado, Alejandro H., Jr.
Balan, Ariel Kelly D.
Lazaro, Jose B.
2017 IEEE 9TH INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY, COMMUNICATION AND CONTROL, ENVIRONMENT AND MANAGEMENT (IEEE HNICEM), 2017,
[40] Effect of training-sample size and classification difficulty on the accuracy of genomic predictors
Vlad Popovici
Weijie Chen
Brandon D Gallas
Christos Hatzis
Weiwei Shi
Frank W Samuelson
Yuri Nikolsky
Marina Tsyganova
Alex Ishkin
Tatiana Nikolskaya
Kenneth R Hess
Vicente Valero
Daniel Booser
Mauro Delorenzi
Gabriel N Hortobagyi
Leming Shi
W Fraser Symmans
Lajos Pusztai
Breast Cancer Research, 12

← 1 2 3 4 5 →