Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Reading LevelReading Level
-
Content TypeContent Type
-
YearFrom:-To:
-
More FiltersMore FiltersItem TypeIs Full-Text AvailableSubjectPublisherSourceDonorLanguagePlace of PublicationContributorsLocation
Done
Filters
Reset
7,413
result(s) for
"Pollution forecasting"
Sort by:
Application of the XGBoost Machine Learning Method in PM2.5 Prediction: A Case Study of Shanghai
2020
Air quality forecasting is crucial to reducing air pollution in China, which has detrimental effects on human health. Atmospheric chemical-transport models can provide air pollutant forecasts with high temporal and spatial resolution and are widely used for routine air quality predictions (e.g., 1–3 days in advance). However, the model’s performance is limited by uncertainties in the emission inventory and biases in the initial and boundary conditions, as well as deficiencies in the current chemical and physical schemes. As a result, experimentation with several new methods, such as machine learning, is occurring in the field of air quality forecasting. This study combined hourly PM
2.5
mass concentration forecasts from an operational air quality numerical prediction system (WRF-Chem) at the Shanghai Meteorological Service (SMS) with comprehensive near-surface measurements of air pollutants and meteorological conditions to develop a machine learning model that estimates the daily PM
2.5
mass concentration in Shanghai, China. With correlation coefficients that are higher by 50–100% and a standard deviation that is lower by 14–24 µg m
–3
, the machine learning model provides significantly better daily forecasting of PM
2.5
than the WRF-Chem model. Thus, this research offers a new technique for enhancing air quality forecasting in China.
Journal Article
PM2.5 forecasting for an urban area based on deep learning and decomposition method
by
Abdul Malek, Marlinda
,
Zaini, Nur’atiah
,
Ahmed, Ali Najah
in
639/166/986
,
639/705/1042
,
704/172/4081
2022
Rapid growth in industrialization and urbanization have resulted in high concentration of air pollutants in the environment and thus causing severe air pollution. Excessive emission of particulate matter to ambient air has negatively impacted the health and well-being of human society. Therefore, accurate forecasting of air pollutant concentration is crucial to mitigate the associated health risk. This study aims to predict the hourly PM2.5 concentration for an urban area in Malaysia using a hybrid deep learning model. Ensemble empirical mode decomposition (EEMD) was employed to decompose the original sequence data of particulate matter into several subseries. Long short-term memory (LSTM) was used to individually forecast the decomposed subseries considering the influence of air pollutant parameters for 1-h ahead forecasting. Then, the outputs of each forecast were aggregated to obtain the final forecasting of PM2.5 concentration. This study utilized two air quality datasets from two monitoring stations to validate the performance of proposed hybrid EEMD-LSTM model based on various data distributions. The spatial and temporal correlation for the proposed dataset were analysed to determine the significant input parameters for the forecasting model. The LSTM architecture consists of two LSTM layers and the data decomposition method is added in the data pre-processing stage to improve the forecasting accuracy. Finally, a comparison analysis was conducted to compare the performance of the proposed model with other deep learning models. The results illustrated that EEMD-LSTM yielded the highest accuracy results among other deep learning models, and the hybrid forecasting model was proved to have superior performance as compared to individual models.
Journal Article
Comparative Analysis of Multiple Deep Learning Models for Forecasting Monthly Ambient PM2.5 Concentrations: A Case Study in Dezhou City, China
by
Guo, Qingchun
,
He, Zhenfang
in
Air pollution
,
Air pollution forecasting
,
artificial neural network
2024
Ambient air pollution affects human health, vegetative growth and sustainable socio-economic development. Therefore, air pollution data in Dezhou City in China are collected from January 2014 to December 2023, and multiple deep learning models are used to forecast air pollution PM2.5 concentrations. The ability of the multiple models is evaluated and compared with observed data using various statistical parameters. Although all eight deep learning models can accomplish PM2.5 forecasting assignments, the precision accuracy of the CNN-GRU-LSTM forecasting method is 34.28% higher than that of the ANN forecasting method. The result shows that CNN-GRU-LSTM has the best forecasting performance compared to the other seven models, achieving an R (correlation coefficient) of 0.9686 and an RMSE (root mean square error) of 4.6491 μg/m3. The RMSE values of CNN, GRU and LSTM models are 57.00%, 35.98% and 32.78% higher than that of the CNN-GRU-LSTM method, respectively. The forecasting results reveal that the CNN-GRU-LSTM predictor remarkably improves the performances of benchmark CNN, GRU and LSTM models in overall forecasting. This research method provides a new perspective for predictive forecasting of ambient air pollution PM2.5 concentrations. The research results of the predictive model provide a scientific basis for air pollution prevention and control.
Journal Article
Machine learning framework for forecasting air pollution: Evaluating seasonal and climatic influences in Istanbul, Turkey
by
Elhaty, Ismail A.
,
Al-Najjar, Hazem
,
AL-Rousan, Nadia
in
Accuracy
,
Air Pollutants - analysis
,
Air pollution
2025
Air pollution, driven by seasonal and meteorological variations, poses a significant threat to public health and urban sustainability. Despite numerous forecasting approaches, the influence of seasonal patterns on air pollutant levels remains underexplored. This study presents a computational framework utilizing the Nonlinear Autoregressive network with Exogenous inputs (NARX) model to predict concentrations of key pollutants SO₂, PM₁₀, NO, NOX, and O₃ in Esenyurt, one of the most industrialized districts in Istanbul, Turkey. Through systematic feature selection techniques, the study determines the most influential seasonal factors for each pollutant, reducing model complexity while improving predictive accuracy. The developed framework exhibits substantial improvements in predictive performance, with the optimal models achieving high determination coefficients (up to R² = 0.965 for O₃) and low error metrics across training and validation datasets. Particularly, the inclusion of seasonal variables considerably improved prediction accuracy for NO, NO₂, and PM₁₀, while SO₂ predictions performed best when utilizing comprehensive seasonal indicators. These results demonstrate that seasonal dynamics play a crucial role in governing pollutant behavior and highlight the importance of incorporating such variables in forecasting models. This research contributes significantly to the field by advancing methodological approaches in air quality prediction while providing an adaptable model for policymakers and environmental agencies to implement in proactive pollution management strategies. Through examination of seasonal dependencies in air pollutant patterns, the study delivers a practical tool for urban planning and public health applications in rapidly expanding metropolitan regions.
Journal Article
Air Pollution Forecasting Using Artificial and Wavelet Neural Networks with Meteorological Conditions
by
Meng, Jingjing
,
Liu, Jiazhen
,
Chen, Yongjin
in
Air pollution
,
Air pollution forecasting
,
Air quality
2020
Air quality forecasting is a significant method of protecting public health because it provides early warning of harmful air pollutants. In this study, we used correlation analysis and artificial neural networks (ANNs; including wavelet ANNs [WANNs]) to identify the linear and nonlinear associations, respectively, between the air pollution index (API) and meteorological variables in Xi’an and Lanzhou. Evaluating twelve algorithms and nineteen network topologies for the ANN and WANN models, we discovered that the optimal input variables for an API forecasting model were the APIs from the 3 preceding days and sixteen selected meteorological factors. Additionally, the API could be accurately predicted based solely on the value recorded 3 days earlier. Based on the correlation coefficients between the air pollution index of the targeted day and the tested variables, the API displayed the closest relationship with the API 1 day earlier as well as stronger correlations with the average temperature, average water vapor pressure, minimum temperature, maximum temperature, API 2 days earlier, and API 3 days earlier. When Bayesian regularization was applied as a training algorithm, the WANN and ANN models accurately reproduced the APIs in both Xi’an and Lanzhou, although the WANN model (R = 0.8846 for Xi’an and R = 0.8906 for Lanzhou) performed better than the ANN (R = 0.8037 for Xi’an and R = 0.7742 for Lanzhou) during the forecasting stage. These results demonstrate that WANNs are effective in short-term API forecasting because they can recognize historic patterns and thereby identify nonlinear relationships between the input and output variables. Thus, our study may provide a theoretical basis for environmental management policies.
Journal Article
Improving 3-day deterministic air pollution forecasts using machine learning algorithms
by
Zhang, Zhiguo
,
Engardt, Magnuz
,
Ma, Xiaoliang
in
Additives
,
Air pollution
,
Air pollution forecasting
2024
As air pollution is regarded as the single largest environmental health risk in Europe it is important that communication to the public is up to date and accurate and provides means to avoid exposure to high air pollution levels. Long- and short-term exposure to outdoor air pollution is associated with increased risks of mortality and morbidity. Up-to-date information on present and coming days' air quality helps people avoid exposure during episodes with high levels of air pollution. Air quality forecasts can be based on deterministic dispersion modelling, but to be accurate this requires detailed information on future emissions, meteorological conditions and process-oriented dispersion modelling. In this paper, we apply different machine learning (ML) algorithms – random forest (RF), extreme gradient boosting (XGB), and long short-term memory (LSTM) – to improve 1, 2, and 3 d deterministic forecasts of PM10, NOx, and O3 at different sites in Greater Stockholm, Sweden. It is shown that the deterministic forecasts can be significantly improved using the ML models but that the degree of improvement of the deterministic forecasts depends more on pollutant and site than on what ML algorithm is applied. Also, four feature importance methods, namely the mean decrease in impurity (MDI) method, permutation method, gradient-based method, and Shapley additive explanations (SHAP) method, are utilized to identify significant features that are common and robust across all models and methods for a pollutant. Deterministic forecasts of PM10 are improved by the ML models through the input of lagged measurements and Julian day partly reflecting seasonal variations not properly parameterized in the deterministic forecasts. A systematic discrepancy by the deterministic forecasts in the diurnal cycle of NOx is removed by the ML models considering lagged measurements and calendar data like hour and weekday, reflecting the influence of local traffic emissions. For O3 at the urban background site, the local photochemistry is not properly accounted for by the relatively coarse Copernicus Atmosphere Monitoring Service ensemble model (CAMS) used here for forecasting O3 but is compensated for using the ML models by taking lagged measurements into account. Through multiple repetitions of the training process, the resulting ML models achieved improvements for all sites and pollutants. For NOx at street canyon sites, mean squared error (MSE) decreased by up to 60 %, and seven metrics, such as R2 and mean absolute percentage error (MAPE), exhibited consistent results. The prediction of PM10 is improved significantly at the urban background site, whereas the ML models at street sites have difficulty capturing more information. The prediction accuracy of O3 also modestly increased, with differences between metrics. Further work is needed to reduce deviations between model results and measurements for short periods with relatively high concentrations (peaks) at the street canyon sites. Such peaks can be due to a combination of non-typical emissions and unfavourable meteorological conditions, which are rather difficult to forecast. Furthermore, we show that general models trained using data from selected street sites can improve the deterministic forecasts of NOx at the station not involved in model training. For PM10 this was only possible using more complex LSTM models. An important aspect to consider when choosing ML algorithms is the computational requirements for training the models in the deployment of the system. Tree-based models (RF and XGB) require fewer computational resources and yield comparable performance in comparison to LSTM. Therefore, tree-based models are now implemented operationally in the forecasts of air pollution and health risks in Stockholm. Nevertheless, there is big potential to develop generic models using advanced ML to take into account not only local temporal variation but also spatial variation at different stations.
Journal Article
An ensemble learning based hybrid model and framework for air pollution forecasting
by
Chang, Yue-Shan
,
Chiao, Hsin-Ta
,
Huang, Yo-Ping
in
Air pollution
,
Air pollution forecasting
,
Air quality
2020
As advance of economy and industry, the impact of air pollution has gradually gained attention. In order to predict air quality, there were many studies that exploited various machine learning techniques to build predictive model for pollutant concentration or air quality prediction. However, enhancing the prediction performance always is the common problem of existing studies. Traditional templates based on machine learning and deep learning methods, such as GBTR (gradient boosted tree regression), SVR (support vector machine-based regression), and LSTM (long short-term memory), are most promising approaches to address these problems. Some previous researches showed that ensemble learning technology can improve predictive performance of other domains. In order to improve the accuracy of forecasting, in this paper, we propose a hybrid model and framework to improve the forecasting accuracy of air pollution. We not only exploit stacking-based ensemble learning scheme with Pearson correlation coefficient to calculate the correlation between different machine learning models to integrate various forecasting models together, but also construct a framework based on Spark+Hadoop machine learning and TensorFlow deep learning framework to physically integrate these models to demonstrate the next 1 to 8 h’ air pollution forecasting. We also conduct experiments and compare the result with GBTR, SVR, LSTM, and LSTM2 (version 2) models to demonstrate the proposed hybrid model’s predictive performance. The experimental results show that the hybrid model is superior to the existing models used for predicting air pollution.
Journal Article
Air Pollution Prediction with Multi-Modal Data and Deep Neural Networks
by
Kalajdziski, Slobodan
,
Trajkovik, Vladimir
,
Zdravevski, Eftim
in
accuracy
,
Air monitoring
,
Air pollution
2020
Air pollution is becoming a rising and serious environmental problem, especially in urban areas affected by an increasing migration rate. The large availability of sensor data enables the adoption of analytical tools to provide decision support capabilities. Employing sensors facilitates air pollution monitoring, but the lack of predictive capability limits such systems’ potential in practical scenarios. On the other hand, forecasting methods offer the opportunity to predict the future pollution in specific areas, potentially suggesting useful preventive measures. To date, many works tackled the problem of air pollution forecasting, most of which are based on sequence models. These models are trained with raw pollution data and are subsequently utilized to make predictions. This paper proposes a novel approach evaluating four different architectures that utilize camera images to estimate the air pollution in those areas. These images are further enhanced with weather data to boost the classification accuracy. The proposed approach exploits generative adversarial networks combined with data augmentation techniques to mitigate the class imbalance problem. The experiments show that the proposed method achieves robust accuracy of up to 0.88, which is comparable to sequence models and conventional models that utilize air pollution data. This is a remarkable result considering that the historic air pollution data is directly related to the output—future air pollution data, whereas the proposed architecture uses camera images to recognize the air pollution—which is an inherently much more difficult problem.
Journal Article
Synergistic effects of synoptic weather patterns and topography on air quality: a case of the Sichuan Basin of China
2019
Heavy air pollution is strongly influenced by weather conditions and is thus sensitive to climate change. Especially, for the areas with complex topography such as the Sichuan Basin (SB), one of the most polluted areas of China, the synergistic effects of synoptic weather patterns and topography on air quality are unclear and warrant investigation. This study examined the typical synoptic patterns of SB in winter days of 2013–2017 and revealed their synergistic effects with topography on air quality. Three categories of synoptic patterns including dry low-trough, high-pressure, and wet low-vortex patterns accompanying heavy, medium, and slight air pollution, respectively, were identified. In particular, the dry low-trough patterns occur most frequently, accounting for around 62% of the total days. In the case of this pattern, westerly wind prevails over the SB and the aloft atmosphere is warmer than the Tibetan Plateau (TP) at the same height, which induces the cold air over TP moving eastward to the SB. Under the synergistic effects of the cold air eastward movement and TP, a strong descending motion (known as foehn) is observed on the leeward slope of the towering TP. This foehn warming causes a stable layer above the planetary boundary layer (PBL), which suppresses secondary circulation and PBL. These features restrict atmospheric pollutant dispersion, resulting in poor air quality. In contrast, for the high-pressure and wet low-vortex patterns, cold air masses from the north invade southward and cover the northwest SB. This invasion remarkably decreases the atmospheric stability of the lower troposphere, deepens the PBL, and enhances the height of secondary circulation, thereby facilitating air pollutant dispersion. Moreover, the wet low-vortex pattern is accompanied by frequent precipitation events (with 80% rainy days), further bringing down air pollution levels. These findings provide an insight for improving air pollution forecast in the complex terrain areas under global warming.
Journal Article