A Survey of Sampling Methods for Hyperspectral Remote Sensing: Addressing Bias Induced by Random Sampling

Furkejuvvon:
Bibliográfalaš dieđut
Publikašuvnnas:Remote Sensing vol. 17, no. 8 (2025), p. 1373
Váldodahkki: Decker, Kevin T
Eará dahkkit: Borghetti, Brett J
Almmustuhtton:
MDPI AG
Fáttát:
Liŋkkat:Citation/Abstract
Full Text + Graphics
Full Text - PDF
Fáddágilkorat: Lasit fáddágilkoriid
Eai fáddágilkorat, Lasit vuosttaš fáddágilkora!
Govvádus
Abstrákta:Identified as early as 2000, the challenges involved in developing and assessing remote sensing models with small datasets remain, with one key issue persisting: the misuse of random sampling to generate training and testing data. This practice often introduces a high degree of correlation between the sets, leading to an overestimation of model generalizability. Despite the early recognition of this problem, few researchers have investigated its nuances or developed effective sampling techniques to address it. Our survey highlights that mitigation strategies to reduce this bias remain underutilized in practice, distorting the interpretation and comparison of results across the field. In this work, we introduce a set of desirable characteristics to evaluate sampling algorithms, with a primary focus on their tendency to induce correlation between training and test data, while also accounting for other relevant factors. Using these characteristics, we survey 146 articles, identify 16 unique sampling algorithms, and evaluate them. Our evaluation reveals two broad archetypes of sampling techniques that effectively mitigate correlation and are suitable for model development.
ISSN:2072-4292
DOI:10.3390/rs17081373
Gáldu:Advanced Technologies & Aerospace Database