Accurate monitoring of crop phenology is essential for precision agriculture and yield forecasting. However, satellite-derived time series often suffer from inherent noise, such as residual atmospheric effects and mixed pixels, as well as a frequent lack of ground-truth data in agriculture. In response, this study proposes an algorithm to define the type of spectral signatures for the principal phenological stages of crops, using them as the foundation for training supervised machine learning classification models. The algorithm was developed using Fuzzy C-Means (FCM) clustering to identify the spectral signature reference groups in winter wheat across the Burgos region (Spain) during the 2020 and 2021 growing seasons. To enhance cluster independence and biological coherence, a multi-step filtering process was implemented, including spectral purity (membership degree, SAM, and SAMder) and temporal coherence filters. The filtered and labeled dataset (80% original Burgos dataset) was used to train supervised classification models (KNN and XGBoost). The models’ reliability was verified through three wheat tests (remaining 20%), labeled using other clustering techniques, and an independent barley dataset from diverse geographic locations (Valladolid and Soria). The filtering process significantly improved cluster stability by removing outliers and transition spectral signatures. The supervised models demonstrated exceptional performance; the KNN model slightly outperformed XGB, achieving a mean Accuracy of 0.977, a Kappa of 0.967, and an F1-score of 0.977 in the wheat external test. Furthermore, the model showed, when applied to barley, that its phenological spectral signatures are equivalent in shape to those of wheat, with an Accuracy of 0.965 and an F1-score of 0.974. In addition, it was verified that the type spectral signatures remain the same regardless of the location. This study presents a robust classification tool capable of labeling four key phenological stages (tillering, stem elongation, ripening, and senescence) without ground truth. By effectively removing inherent satellite noise, the proposed methodology produces organized, cleaned datasets. This structured foundation is critical for future research integrating spectral signatures with harvester data to develop high-precision yield prediction models.
Building similarity graph...
Analyzing shared references across papers
Loading...
Arizo-García et al. (Tue,) studied this question.
www.synapsesocial.com/papers/69d894526c1944d70ce0546e — DOI: https://doi.org/10.3390/app16073598
Patricia Arizo-García
Sergio Castiñeira-Ibáñez
Beatriz Ricarte
Applied Sciences
Universitat Politècnica de València
Building similarity graph...
Analyzing shared references across papers
Loading...