Impact of training and validation sample selection on classification accuracy and accuracy assessment when using reference polygons in object-based classification

被引:71
|
作者
Zhen, Zhen [1 ]
Quackenbush, Lindi J. [2 ]
Stehman, Stephen V. [1 ]
Zhang, Lianjun [1 ]
机构
[1] SUNY Syracuse, Coll Environm Sci & Forestry, Dept Forest & Nat Resources Management, Syracuse, NY 13210 USA
[2] SUNY Syracuse, Coll Environm Sci & Forestry, Dept Environm & Resource Engn, Syracuse, NY 13210 USA
关键词
LIDAR DATA; IMAGERY; DESIGN;
D O I
10.1080/01431161.2013.810822
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Reference polygons are homogenous areas that aim to provide the best available assessment of ground condition that the user can identify. Delineation of such polygons provides a convenient and efficient approach for researchers to identify training and validation data for supervised classification. However, the spatial dependence of training and validation data should be taken into account when the two data sets are obtained from a common set of reference polygons. We investigate the effect on classification accuracy and the accuracy estimates derived from the validation data when training and validation data are obtained from four selection schemes. The four schemes are composed of two sampling designs (simple random and systematic) and two methods for splitting sample points between training and validation (validation points in separate polygons from training points and validation points and training points split within the same polygons). A supervised object-based classification of the study region was repeated 30 times for each selection scheme. The selection scheme did not impact classification accuracy, but estimates of overall (OA), user's (UA), and producer's (PA) accuracies produced from the validation data overestimated accuracy for the study region by about 10%. The degree of overestimation was slightly greater when the validation sample points were allowed to be in the same polygons as the training data points. These results suggest that accuracy estimates derived from splitting training and validation within a limited set of reference polygons should be regarded with suspicion. To be fully confident in the validity of the accuracy estimates, additional validation sample points selected from the region outside the reference polygons may be needed to augment the validation sample selected from the reference polygons.
引用
收藏
页码:6914 / 6930
页数:17
相关论文
共 50 条
  • [31] An Object-Based Linear Weight Assignment Fusion Scheme to Improve Classification Accuracy Using Landsat and MODIS Data at the Decision Level
    Guan, Xudong
    Liu, Gaohuan
    Huang, Chong
    Liu, Qingsheng
    Wu, Chunsheng
    Jin, Yan
    Li, Yafei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (12): : 6989 - 7002
  • [32] Model selection and assessment for classification using validation
    Jaworski, W
    ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, PT 1, PROCEEDINGS, 2005, 3641 : 481 - 490
  • [33] Impact of Training Set Size on Object-Based Land Cover Classification: A Comparison of Three Classifiers
    Myburgh, Gerhard
    van Niekerk, Adriaan
    INTERNATIONAL JOURNAL OF APPLIED GEOSPATIAL RESEARCH, 2014, 5 (03) : 49 - 67
  • [34] A DYNAMIC HIERARCHICAL FEATURE SELECTION METHOD FOR OBJECT-BASED CLASSIFICATION OF WETLANDS
    Mahdavi, Sahel
    Salehi, Bahram
    Amani, Meisam
    Granger, Jean
    Brisco, Brian
    Huang, Weimin
    2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 570 - 573
  • [35] Accuracy Improvements to Pixel-Based and Object-Based LULC Classification with Auxiliary Datasets from Google Earth Engine
    Qu, Le'an
    Chen, Zhenjie
    Li, Manchun
    Zhi, Junjun
    Wang, Huiming
    REMOTE SENSING, 2021, 13 (03)
  • [36] Auxiliary datasets improve accuracy of object-based land use/land cover classification in heterogeneous savanna landscapes
    Hurskainen, P.
    Adhikari, H.
    Siljander, M.
    Pellikka, P. K. E.
    Hemp, A.
    REMOTE SENSING OF ENVIRONMENT, 2019, 233
  • [37] Impacts of Sample Design for Validation Data on the Accuracy of Feedforward Neural Network Classification
    Foody, Giles M.
    APPLIED SCIENCES-BASEL, 2017, 7 (09):
  • [38] Accuracy Assessment of Object Oriented and Knowledge Base Image Classification using P-Trees
    Seetha, M.
    Sunitha, K. V. N.
    Parameswari, D. V. Lalitha
    Ravi, G.
    2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 5, 2010, : 760 - 763
  • [39] Improvement of Hydrological Network Model Using Object-based Classification based from InfoGain Feature Selection
    Bentir, Sarah Alma P.
    Ballado, Alejandro H., Jr.
    Balan, Ariel Kelly D.
    Lazaro, Jose B.
    2017 IEEE 9TH INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY, COMMUNICATION AND CONTROL, ENVIRONMENT AND MANAGEMENT (IEEE HNICEM), 2017,
  • [40] Effect of training-sample size and classification difficulty on the accuracy of genomic predictors
    Vlad Popovici
    Weijie Chen
    Brandon D Gallas
    Christos Hatzis
    Weiwei Shi
    Frank W Samuelson
    Yuri Nikolsky
    Marina Tsyganova
    Alex Ishkin
    Tatiana Nikolskaya
    Kenneth R Hess
    Vicente Valero
    Daniel Booser
    Mauro Delorenzi
    Gabriel N Hortobagyi
    Leming Shi
    W Fraser Symmans
    Lajos Pusztai
    Breast Cancer Research, 12