Forecast Of Air Pollution In An Industrial City Of Eastern India

IJEP 41(11): 1245-1251 : Vol. 41 Issue. 11 (November 2021)

Tripta Sinha1*, Kunal Sinha2, Abhinav Sahay1, Aman Kumar1 and Sandeep Nath Sahdeo3

1. Amity University, Patna – 801503, Bihar, India
2. GLA University, Mathura – 281 406, U.P., India
3. BIT Lalpur, Ranchi – 834 001, Jharkhand, India


In modern era, data science has emerged as an efficient tool for generating forecasting models. With the help of it, prediction has become easier than ever before. Making a prediction is necessary to build strategies accordingly. Therefore, it is now being implemented in every field including environmental studies. Assessment of the long-term concentration trends of air pollutants can be one of its fields of application. The present study can be viewed as a small step towards this. The study deals with generating a time-series model to forecast the future level of concentration of air pollutants in an industrial city Jamshedpur of eastern part of India which is a prominent site for mining and industries. Four monitoring stations were selected to monitor the air quality of the entire city during the year 2007-2017. The selection of the study period is based on the availability of data. Auto regressive integrated moving average and the SAS coding were employed in the prediction. Results showed that the model performed more than satisfactory in predicting the concentration levels of various pollutants. The finding of this study will help to bring awareness about the situation of pollution and the trend in the area which are at the least priority for the government bodies and will also help in its rational management. The findings also depict the efficiency of data science and the requirement for further research in this area. The method utilised is not area specific and can be applied to other regions.


Data science, forecasting, Python, ARIMA, air pollution


  1. Pope, R. and J. Wu. 2014. Characterizing air pollution patterns on multiple time scales in urban areas : A landscape ecological approach. Urban Ecosyst., 17:855-874.
  2. Paschalidou, A.K., et al. 2011. Forecasting hourley PM10concentration in Cyprus through artificial neural networks and multiple regression models: Implications to local environmental management. Env. Sci. Poll. Res., 18:316-327.
  3. Pandey, B., M. Agrawal and S. Singh. 2014. Assessment of air pollution around coal mining area : Emphasizing on spatial distributions, seasonal variations and heavy metals, using cluster and principal component analysis. Atmos. Poll. Res., 5(1): 79-81
  4. Corani, G. 2005. Air quality prediction in Milan : Feed-forward neural network, pruned neural networks and lazy learning. Ecol. Model., 185:513-529.
  5. Srivastava, C., S. Singh and A. P. Singh. 2018. Estimation of air pollution in Delhi using machine learning techniques. International Conference on Computing power and communication technologies (GUCON). Proceedings, pp 304-309.
  6. Ghorani-Azam, A., B. Riahi-Zanjani and M. Balali-Mood. 2016. Effect of air pollution on human health and practical measures for prevention in Iran. J. Res. Medical Sci., 21:65.
  7. See, S.W. and R. Balasubramanian. 2008. Chemical characteristics of fine particles emitted from different gas cooking methods. Atmos. Env., 42:8852-8862.
  8. Shah, A.S.V., et al. 2013. Global association of air pollution and heart failure : A systematic review and meta-analysis. The Lancet., 382:1039-1048.
  9. Mukherjee, A. and M. Agrawal. 2018. A global perspective of fine particulate matter pollution and its health effects. In Reviews of environmental contamination and toxicology (vol 244). Ed F.A. Gunther and P. de Voogt. Springer. pp 5-51.
  10. Donkelaar, A.V., et al. 2010. Global estimates of ambient fine particulate matter concentrations from satellite-based aerosol optical depth. Development and application. Env. Health Perspect., 118:847-855.
  11. Lee, B.J., B. Kim and K. Lee. 2014. Air pollution exposure and cardovascular diseases. Toxicol. Res., 30(2):71-75.
  12. Karambelas, A., et al. 2018. Urban versus rural health impacts attributable to PM2.5and O3in northern India. Env. Res. Letters. 13(6).
  13. Meenakshi, P. and M.K. Saseetharan. 2004. Urban air pollution forecasting with respect to SPM using time series neural networks modelling approach-A case study in Coimbatore city. J. Env. Sci. Eng., 46:92-101.
  14. Donoho, D. 2015. 50 years of data science. Turkey Centennial workshop. Princeton, NJ.
  15. Conway, D. 2010. Data science venn diagram. Available at : http://drewconway. com/zia/2013/26/the-data-science-venn-diagram.
  16. Tierney, B. 2012. Data science is multidisciplinary. Available at :
  17. Matter, U. 2013. Data science is business/computational social science in academia? Available at : http://gtven the data. computational. html.
  18. Ayankoya, K., A. Calitz and J. Greyling. 2014. Intrinsic relations between data science, big data, business analytics and datafication. Southern Africian Institute for Computer Scientist and Information Technologist Annual Conference. Proceedings, pp 192-198.
  19. Van der Aalst, W.M.P. 2014. Data scientist : The engineer of the future. Proceedings of the I-ESA. Enterprise Interoperability VI. Springer-Verlag, Berlin. pp 13-28.
  20. Ambika, G.N., et al. 2019. Air quality index prediction using linear regression. Int. J. Recent Tech. Eng., 8(2).
  21. Brian, S., et al. 2018. Forecasting air quality time series using deep learning. J. Air Waste Manage. Assoc., 68(8):866-886.
  22. Dubey, B., A.K. Pal and G. Singh. 2012. Trace metal composition of airborne particulate matter in the coal mining and non-mining areas of Dhanbad region, Jharkhand, India. Atmos. Poll. Res., 3(2):238-246.
  23. Eeftens, M., et al. 2012. Development of landuse regression models for PM2.5, PM2.5absorbance, PM10and PM coarse in 20 European study areas : Results of the ESCAPE project. Env. Sci. Tech., 46:11195-11205.
  24. Kanakiya, R.S., S.K. Singh and U. Shah. 2015. GIS application for spactial and temporal analysis of the air pollutants in urban area. Int. J. Adv. Remote Sens. GIS. 4:1120-1129.
  25. Kottur, S.V. and S.S. Mantha. 2015. An integrated model using artifical neural network (ANN) and kriging for forecasting air pollutants using meteorological data. Int. J. Adv. Res. Comput. Commun. Eng., 4:146-152.
  26. Mapoma, et al. 2014. Air quality assessment of carbon monoxide, nitrogen dioxide and sulphur dioxide levels in Blantyre, Malawi : A statisical approach to a stationary environmental monitoring station. African J. Env. Sci. Tech., 8(6):330-343.
  27. Maynard, D., et al. 2007. Mortality risk associated with short-term exposure to traffic particle and sulphates. Env. Health Persp., 115:751-755.
  28. Gordon, S., et al. 2019. Data science of natural environment : A research roadmap. Front. Env. Sci. DOI:10.3389/fenvs.2019.00121.
  29. Banerjee, S., et al. 2016. Seasonal variation in heavy metal contaminations in water and sediments of Jamshedpur stretch of Subarnarekha river, India. Env. Earth Sci., 75(3):265.
  30. Blair, G. S., et al. 2019. Data science of the natural environment: A research roadmap. Frontiers Env. Sci., 7:121.