热带气象学报2026,Vol.42Issue(1):132-152,21.DOI:10.16032/j.issn.1004-4965.2026.011
基于因果分析的机器学习模型在空气质量预报中的构建及评估——以广州市为例
Establishment and Evaluation of Machine Learning Models Based on Causal Analysis in Air Quality Forecasting:A Case Study of Guangzhou
摘要
Abstract
To address the increasing challenge of persistent pollution and rising ozone(O₃)levels in Guangzhou,this study developed advanced air quality forecasting models using machine learning techniques.Based on environmental monitoring and meteorological observation data,the Liang-Kleeman information flow was used to conduct causal analysis on factors affecting the concentrations of atmospheric pollutants such as CO,NO2,O3,PM2.5,PM10,and SO2.Using the Random Forest(RF),Extreme Gradient Boosting(XGBoost),and Long Short-Term Memory Neural Network(LSTM)algorithms for integrated modeling,five distinct pollutant concentration forecasting models(RF,XG,LSTM,and the integrated models MIX1 and MIX2)were constructed.These models forecast pollutant concentrations,which were then used to calculate the Air Quality Index(AQI)and identify the primary pollutant.The results show that the integrated models(MIX1 and MIX2)generally outperform the single ones(RF,XGBoost,and LSTM models).For pollutant concentration forecasting,the MIX1 model was optimal for CO,NO2,and O3,while the MIX2 model performed best for PM10,PM2.5,and SO2.For air quality forecasting,the MIX2 model was superior for 1-2 day forecasts,whereas the MIX1 model was optimal for 3-7 day forecasts.The accuracy rates for primary pollutant by the MIX1 and MIX2 models for 1-7 day forecast were 71.26%-83.33%and 73.71%-81.11%,respectively.The models showed high reliability,with accuracy rates for primary pollutant identification ranging from 71.26%to 83.33%for MIX1 and 73.71%to 81.11%for MIX2 across the 1-7 day forecast,providing valuable tools for environmental authorities to implement targeted air pollution control measures.关键词
机器学习/因果分析/空气质量预报Key words
machine learning/causal analysis/air quality forecasting分类
天文与地球科学引用本文复制引用
云翔,张璐瑶,刘子菁,翟志宏,蔡顺明,朱丽媛,何耀斌..基于因果分析的机器学习模型在空气质量预报中的构建及评估——以广州市为例[J].热带气象学报,2026,42(1):132-152,21.基金项目
广东省气象局科技项目(GRMC2023M10、GRMC2025Q42) (GRMC2023M10、GRMC2025Q42)
中国气象局气象干部学院科研项目(2025CMATCQN16) (2025CMATCQN16)
广州市气象学会科学技术研究项目(Z202326)共同资助 (Z202326)