Abstract—Air pollution is a considerable health danger to the environment. The objective of this study was to assess the characteristics of air quality and predict PM10 concentrations using boosted regression trees (BRTs). The maximum daily PM10 concentration data from 2002 to 2016 were obtained from the air quality monitoring station in Kuching, Sarawak. Eighty percent of the monitoring records were used for the training and twenty percent for the validation of the models. The best iteration of the BRT model was performed by optimizing the prediction performance, while the BRT algorithm model was constructed from multiple regression models. The two main parameters that were used were the learning rate (lr) and tree complexity (tc), which were fixed at 0.01 and 5, respectively. Meanwhile, the number of trees (nt) was determined by using an independent test set (test), a 5-fold cross validation (CV) and out-of-bag (OOB) estimation. The algorithm model for the BRT produced by using the CV was the best guide to be used compared with the OOB to test the predicted PM10 concentration. The performance indicators showed that the model was adequate for the next day’s prediction (PA=0.638, R2=0.427, IA=0.749, NAE=0.267, and RMSE=28.455).
Index Terms—Accuracy measures, air Pollution, boosted regression trees, PM10, regression.
The authors are with the Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA, 13500 Permatang Pauh, Malaysia (e-mail: email@example.com, firstname.lastname@example.org, email@example.com, firstname.lastname@example.org, email@example.com).
Cite: Wan Nur Shaziayani, Ahmad Zia Ul-Saufie, Syarifah Adilah Mohamed Yusoff, Hasfazilah Ahmat, and Zuraira Libasin, "Evaluation of Boosted Regression Tree for the Prediction of the Maximum 24-Hour Concentration of Particulate Matter," International Journal of Environmental Science and Development vol. 12, no. 4, pp. 126-130, 2021.Copyright © 2021 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).