Standard

Spatial and temporal classification and prediction of aspen probability in boreal forests using machine learning algorithms. / Трошин, Дмитрий Сергеевич; Файзулин, Максим; Мирин, Денис Моисеевич.

In: Environmental Monitoring and Assessment, Vol. 197, No. 5, 527, 01.05.2025.

Research output: Contribution to journalArticlepeer-review

Harvard

APA

Vancouver

Author

BibTeX

@article{5ea74b98d6a740fc998ae9a2899877a3,
title = "Spatial and temporal classification and prediction of aspen probability in boreal forests using machine learning algorithms",
abstract = "Mapping and classifying the probability of occurrence of Populus tremula L. (aspen) in boreal forests is a complex task for sustainable forest management and biodiversity conservation. As a key broadleaved species in the taiga region, aspen supports both forestry and local biodiversity habitats. This study introduces a methodology to predict aspen presence using Sentinel- 2 satellite data and machine learning, combining binary classification (presence/absence) with probability estimation. We utilized spectral features (e.g., NDVI, EVI, SAVI) extracted from Sentinel- 2 imagery, employing a logistic regression model to classify aspen occurrence. To assess feature importance, we applied Permutation Importance (PI) and SHAP, enhancing model interpretability and ensuring transparency in identifying influential factors for forest management applications. Results revealed the significant role of spectral features in determining aspen growth probability. SAVI exhibited a strong effect on classification accuracy due to its soil correction capability, while EVI and NDVI proved highly important in summer, reflecting seasonal vegetation dynamics. High EVI values often indicate complex vegetation and conifer biomass, whereas aspen, with its distinct canopy and phenology, shows lower EVI compared to conifers. NDVI, tied to aspen{\textquoteright}s photosynthetic activity, remained a reliable indicator in mixed taiga forests. The model achieved an overall accuracy of 94.77% with XGBoost and 95.03% with Random Forest across all seasons, demonstrating robust performance. This reliable aspen distribution data aids forestry planning, such as harvesting, and the algorithm automates inventorying and mapping aspen stands, reducing reliance on labor-intensive ground surveys.",
keywords = "Boreal forests, Machine learning, Populus tremula, Prediction accuracy, Space data, Temporal data",
author = "Трошин, {Дмитрий Сергеевич} and Максим Файзулин and Мирин, {Денис Моисеевич}",
year = "2025",
month = may,
day = "1",
doi = "10.1007/s10661-025-13985-9",
language = "English",
volume = "197",
journal = "Environmental Monitoring and Assessment",
issn = "0167-6369",
publisher = "Springer Nature",
number = "5",

}

RIS

TY - JOUR

T1 - Spatial and temporal classification and prediction of aspen probability in boreal forests using machine learning algorithms

AU - Трошин, Дмитрий Сергеевич

AU - Файзулин, Максим

AU - Мирин, Денис Моисеевич

PY - 2025/5/1

Y1 - 2025/5/1

N2 - Mapping and classifying the probability of occurrence of Populus tremula L. (aspen) in boreal forests is a complex task for sustainable forest management and biodiversity conservation. As a key broadleaved species in the taiga region, aspen supports both forestry and local biodiversity habitats. This study introduces a methodology to predict aspen presence using Sentinel- 2 satellite data and machine learning, combining binary classification (presence/absence) with probability estimation. We utilized spectral features (e.g., NDVI, EVI, SAVI) extracted from Sentinel- 2 imagery, employing a logistic regression model to classify aspen occurrence. To assess feature importance, we applied Permutation Importance (PI) and SHAP, enhancing model interpretability and ensuring transparency in identifying influential factors for forest management applications. Results revealed the significant role of spectral features in determining aspen growth probability. SAVI exhibited a strong effect on classification accuracy due to its soil correction capability, while EVI and NDVI proved highly important in summer, reflecting seasonal vegetation dynamics. High EVI values often indicate complex vegetation and conifer biomass, whereas aspen, with its distinct canopy and phenology, shows lower EVI compared to conifers. NDVI, tied to aspen’s photosynthetic activity, remained a reliable indicator in mixed taiga forests. The model achieved an overall accuracy of 94.77% with XGBoost and 95.03% with Random Forest across all seasons, demonstrating robust performance. This reliable aspen distribution data aids forestry planning, such as harvesting, and the algorithm automates inventorying and mapping aspen stands, reducing reliance on labor-intensive ground surveys.

AB - Mapping and classifying the probability of occurrence of Populus tremula L. (aspen) in boreal forests is a complex task for sustainable forest management and biodiversity conservation. As a key broadleaved species in the taiga region, aspen supports both forestry and local biodiversity habitats. This study introduces a methodology to predict aspen presence using Sentinel- 2 satellite data and machine learning, combining binary classification (presence/absence) with probability estimation. We utilized spectral features (e.g., NDVI, EVI, SAVI) extracted from Sentinel- 2 imagery, employing a logistic regression model to classify aspen occurrence. To assess feature importance, we applied Permutation Importance (PI) and SHAP, enhancing model interpretability and ensuring transparency in identifying influential factors for forest management applications. Results revealed the significant role of spectral features in determining aspen growth probability. SAVI exhibited a strong effect on classification accuracy due to its soil correction capability, while EVI and NDVI proved highly important in summer, reflecting seasonal vegetation dynamics. High EVI values often indicate complex vegetation and conifer biomass, whereas aspen, with its distinct canopy and phenology, shows lower EVI compared to conifers. NDVI, tied to aspen’s photosynthetic activity, remained a reliable indicator in mixed taiga forests. The model achieved an overall accuracy of 94.77% with XGBoost and 95.03% with Random Forest across all seasons, demonstrating robust performance. This reliable aspen distribution data aids forestry planning, such as harvesting, and the algorithm automates inventorying and mapping aspen stands, reducing reliance on labor-intensive ground surveys.

KW - Boreal forests

KW - Machine learning

KW - Populus tremula

KW - Prediction accuracy

KW - Space data

KW - Temporal data

UR - https://link.springer.com/article/10.1007/s10661-025-13985-9

UR - https://www.mendeley.com/catalogue/94345c68-d05e-3164-9d54-02e35d2348e5/

U2 - 10.1007/s10661-025-13985-9

DO - 10.1007/s10661-025-13985-9

M3 - Article

VL - 197

JO - Environmental Monitoring and Assessment

JF - Environmental Monitoring and Assessment

SN - 0167-6369

IS - 5

M1 - 527

ER -

ID: 144399598