Research output: Contribution to journal › Article › peer-review
Spatial and temporal classification and prediction of aspen probability in boreal forests using machine learning algorithms. / Трошин, Дмитрий Сергеевич; Файзулин, Максим; Мирин, Денис Моисеевич.
In: Environmental Monitoring and Assessment, Vol. 197, No. 5, 527, 01.05.2025.Research output: Contribution to journal › Article › peer-review
}
TY - JOUR
T1 - Spatial and temporal classification and prediction of aspen probability in boreal forests using machine learning algorithms
AU - Трошин, Дмитрий Сергеевич
AU - Файзулин, Максим
AU - Мирин, Денис Моисеевич
PY - 2025/5/1
Y1 - 2025/5/1
N2 - Mapping and classifying the probability of occurrence of Populus tremula L. (aspen) in boreal forests is a complex task for sustainable forest management and biodiversity conservation. As a key broadleaved species in the taiga region, aspen supports both forestry and local biodiversity habitats. This study introduces a methodology to predict aspen presence using Sentinel- 2 satellite data and machine learning, combining binary classification (presence/absence) with probability estimation. We utilized spectral features (e.g., NDVI, EVI, SAVI) extracted from Sentinel- 2 imagery, employing a logistic regression model to classify aspen occurrence. To assess feature importance, we applied Permutation Importance (PI) and SHAP, enhancing model interpretability and ensuring transparency in identifying influential factors for forest management applications. Results revealed the significant role of spectral features in determining aspen growth probability. SAVI exhibited a strong effect on classification accuracy due to its soil correction capability, while EVI and NDVI proved highly important in summer, reflecting seasonal vegetation dynamics. High EVI values often indicate complex vegetation and conifer biomass, whereas aspen, with its distinct canopy and phenology, shows lower EVI compared to conifers. NDVI, tied to aspen’s photosynthetic activity, remained a reliable indicator in mixed taiga forests. The model achieved an overall accuracy of 94.77% with XGBoost and 95.03% with Random Forest across all seasons, demonstrating robust performance. This reliable aspen distribution data aids forestry planning, such as harvesting, and the algorithm automates inventorying and mapping aspen stands, reducing reliance on labor-intensive ground surveys.
AB - Mapping and classifying the probability of occurrence of Populus tremula L. (aspen) in boreal forests is a complex task for sustainable forest management and biodiversity conservation. As a key broadleaved species in the taiga region, aspen supports both forestry and local biodiversity habitats. This study introduces a methodology to predict aspen presence using Sentinel- 2 satellite data and machine learning, combining binary classification (presence/absence) with probability estimation. We utilized spectral features (e.g., NDVI, EVI, SAVI) extracted from Sentinel- 2 imagery, employing a logistic regression model to classify aspen occurrence. To assess feature importance, we applied Permutation Importance (PI) and SHAP, enhancing model interpretability and ensuring transparency in identifying influential factors for forest management applications. Results revealed the significant role of spectral features in determining aspen growth probability. SAVI exhibited a strong effect on classification accuracy due to its soil correction capability, while EVI and NDVI proved highly important in summer, reflecting seasonal vegetation dynamics. High EVI values often indicate complex vegetation and conifer biomass, whereas aspen, with its distinct canopy and phenology, shows lower EVI compared to conifers. NDVI, tied to aspen’s photosynthetic activity, remained a reliable indicator in mixed taiga forests. The model achieved an overall accuracy of 94.77% with XGBoost and 95.03% with Random Forest across all seasons, demonstrating robust performance. This reliable aspen distribution data aids forestry planning, such as harvesting, and the algorithm automates inventorying and mapping aspen stands, reducing reliance on labor-intensive ground surveys.
KW - Boreal forests
KW - Machine learning
KW - Populus tremula
KW - Prediction accuracy
KW - Space data
KW - Temporal data
UR - https://link.springer.com/article/10.1007/s10661-025-13985-9
UR - https://www.mendeley.com/catalogue/94345c68-d05e-3164-9d54-02e35d2348e5/
U2 - 10.1007/s10661-025-13985-9
DO - 10.1007/s10661-025-13985-9
M3 - Article
VL - 197
JO - Environmental Monitoring and Assessment
JF - Environmental Monitoring and Assessment
SN - 0167-6369
IS - 5
M1 - 527
ER -
ID: 144399598