基于机器学习的广域净生态系统碳交换分析与模拟

Analysis and Simulation of Widespread Net Ecosystem Exchange Utilizing Machine Learning

  • 摘要: 净生态系统碳交换量(NEE)是生态系统碳收支的关键指标,对理解碳循环机制和制定气候变化应对策略至关重要。该研究基于2003-2010年7个广域分布的森林和草地生态系统碳水通量站点数据,运用随机森林(RF)、梯度提升决策树(GBDT)、极度提升树(XGBoost)、LightGBM模型及线性回归5种机器学习模型,结合Pearson相关性分析和地理探测器,系统分析了影响NEE年际与季节变化的关键环境驱动因素。研究旨在评估这些模型在预测NEE年际和季节变化方面的适用性,并为模型优化提供理论支持。结果表明: 2003-2010年间,总体NEE碳汇能力显著下降(Slop=17.14,P<0.05),仅西双版纳站和海北站碳汇能力呈增强趋势(SlopXSBN=-2.61和Slop HBGCT=-5.64);季节上,夏季NEE与其他季节有显著差异,而春季草地生态系统的碳汇能力呈显著增强趋势(Slop=-0.74,P<0.05)。年际变化的关键驱动因子包括大气压、土壤含水量、辐射和风速,而季节变化的主要驱动因子是温度、土壤含水量和土壤温度。模型比较中,RF模型预测NEE年际变化的准确度和精确度表现最佳(R2=0.94);季节NEE预测中,RF同样具有较高的预测精确度,但LightGBM模型和XGBoost模型在春季和冬季的预测中最为准确。该研究通过结合地理探测器的空间统计和机器学习模型的关键因子识别技术,为揭示NEE时空变化模式及其驱动机制提供了新的视角和方法。

     

    Abstract: The Net Ecosystem Exchange (NEE) serves as a critical metric for assessing ecosystem carbon budgets, offering valuable insights into carbon cycling mechanisms and informing strategies for climate change mitigation. This study leverages ground-based ChinaFLUX data collected from seven widespread forest and grassland ecosystems between 2003 and 2010. Five machine learning models-Random Forest (RF), Gradient Boosting Decision Tree (GBDT), Extreme Gradient Boosting (XGBoost), LightGBM model and linear regression, were employed in conjunction with Pearson correlation analysis and geographic detector analysis to systematically investigate the key environmental drivers influencing the interannual and seasonal variations of NEE. The primary objectives of this study are to evaluate the applicability of these models in predicting NEE variations across different temporal scales and to provide theoretical foundations for model optimization. The findings reveal a significant decline in the overall ecosystem carbon sink capacity, as indicated by NEE, from 2003 to 2010 (Slope=17.14, P<0.05). However, the carbon sink capacities at the Xishuangbanna and Haibei sites exhibited an upward trend (SlopeXSBN=-2.61 and Slope HBGCT=-5.64). Seasonal analysis highlighted pronounced disparities in NEE during summer compared to other seasons. Notably, grassland ecosystems demonstrated a marked increase in carbon sequestration capacity during spring, with a slope of -0.74 and a P-value less than 0.05. The primary drivers of interannual variability were identified as atmospheric pressure, soil moisture, radiation, and wind speed, while seasonal fluctuations were predominantly influenced by temperature, soil moisture and soil temperature. Among the models evaluated, the RF model demonstrated the highest accuracy and precision in predicting interannual NEE (R2=0.94). For seasonal predictions, the RF model also exhibited strong performance, while the LightGBM model and XGBoost model were particularly accurate for spring and winter predictions, respectively. By integrating spatial statistics from the geographical detector with key factor identification techniques from machine learning models, this study offers a novel perspective and methodological framework for elucidating the spatiotemporal patterns of NEE and its underlying driving mechanisms.

     

/

返回文章
返回