Chinese Antarctic Center of Surveying and Mapping, Wuhan University, Wuhan, 430070, China.
State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, 430070, China.
Environ Sci Pollut Res Int. 2022 May;29(22):33205-33217. doi: 10.1007/s11356-021-17513-3. Epub 2022 Jan 13.
Correlations between socioeconomic factors and poverty in regression models do not reflect actual relationships, especially when data exhibit patterns of spatial heterogeneity. Spatial regression models can estimate the relationships between socioeconomic factors and poverty in defined geographical areas, explaining the imbalanced distribution of poverty, but the relationships between these factors and poverty are not always linear however, and conventional simple linear local regression models do not accurately capture these nonlinear relationships. To fill this gap, we used a local regression method, geographically weighted random forest regression (GW-RFR), that integrates a spatial weight matrix (SWM) and random forest (RF). The GW-RFR evaluates the spatial variations in the nonlinear relationships between variables. A county-level poverty data set of China was employed to estimate the performance of the GW-RFR against the random forest (RF). In this poverty application, the value of [Formula: see text] was 0.128 higher than that of the RF, the NRMSE value was 1.6% lower than the RF, and the MAE value was 0.295 lower than the RF. These results showed that the relationship between poverty factors and poverty varies with space at the county level in China, and the GW-RFR was suitable for dealing with nonlinear relationships in local regression analysis.
回归模型中社会经济因素与贫困之间的相关性并不能反映实际关系,特别是在数据呈现出空间异质性模式时。空间回归模型可以估计特定地理区域内社会经济因素与贫困之间的关系,解释贫困的不平衡分布,但这些因素与贫困之间的关系并不总是线性的,然而,传统的简单线性局部回归模型并不能准确捕捉这些非线性关系。为了填补这一空白,我们使用了一种局部回归方法,即地理加权随机森林回归(GW-RFR),它集成了空间权重矩阵(SWM)和随机森林(RF)。GW-RFR 评估了变量之间非线性关系的空间变化。我们采用了中国的一个县级贫困数据集来评估 GW-RFR 对随机森林(RF)的性能。在这个贫困应用中,[公式:见文本]的值比 RF 高 0.128,NRMSE 值比 RF 低 1.6%,MAE 值比 RF 低 0.295。这些结果表明,中国县级贫困因素与贫困之间的关系存在空间差异,GW-RFR 适合处理局部回归分析中的非线性关系。