In this paper, we face the problem of collecting training samples for regression problems under an active learning perspective. In particular, we propose various active learning strategies specifically developed for regression approaches based on Gaussian processes (GPs) and support vector machines (SVMs). For GP regression, the first two strategies are based on the idea of adding samples that are dissimilar from the current training samples in terms of covariance measure, while the third one uses a pool of regressors in order to select the samples with the greater disagreements between the different regressors. Finally, the last strategy exploits an intrinsic GP regression outcome to pick up the most difficult and hence interesting samples to label. For SVM regression, the method based on the pool of regressors and two additional strategies based on the selection of the samples distant from the current support vectors in the kernel-induced feature space are proposed. The experimental results obtained on simulated and real data sets show that the proposed strategies exhibit a good capability to select samples that are significant for the regression process, thus opening the way to the active learning approach for remote-sensing regression problems.
Active learning methods for biophysical parameter estimation / Pasolli, Edoardo; Melgani, Farid; Alajlan, Naif; Bazi, Yakoub. - In: IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING. - ISSN 0196-2892. - 50:10 PART2(2012), pp. 4071-4084. [10.1109/TGRS.2012.2187906]
Active learning methods for biophysical parameter estimation
Pasolli, Edoardo;
2012
Abstract
In this paper, we face the problem of collecting training samples for regression problems under an active learning perspective. In particular, we propose various active learning strategies specifically developed for regression approaches based on Gaussian processes (GPs) and support vector machines (SVMs). For GP regression, the first two strategies are based on the idea of adding samples that are dissimilar from the current training samples in terms of covariance measure, while the third one uses a pool of regressors in order to select the samples with the greater disagreements between the different regressors. Finally, the last strategy exploits an intrinsic GP regression outcome to pick up the most difficult and hence interesting samples to label. For SVM regression, the method based on the pool of regressors and two additional strategies based on the selection of the samples distant from the current support vectors in the kernel-induced feature space are proposed. The experimental results obtained on simulated and real data sets show that the proposed strategies exhibit a good capability to select samples that are significant for the regression process, thus opening the way to the active learning approach for remote-sensing regression problems.File | Dimensione | Formato | |
---|---|---|---|
Pasolli_2012.pdf
solo utenti autorizzati
Tipologia:
Documento in Post-print
Licenza:
Accesso privato/ristretto
Dimensione
1.43 MB
Formato
Adobe PDF
|
1.43 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.