Time Series Analysis of Beijing Air Pollution
<Chunqing Xu, cx495, cx495>
Air pollution has always been a hot issue in Beijing and China. It is harmful for residents and even though the government has attempted to solve this severe problem with different methods, it still occurs. In this project, factors causing air pollution in Beijing would be detected from time series analysis. Finding what are the main factors not only reveal why it is so hard to reduce air pollution in China, but also help the government make better policies towards air pollution.
Beijing PM2.5 hourly data (2015)
Resource: U.S. Department of State Air Quality Monitoring Program
This dataset provided by U.S. Department of State Air Quality Monitoring Program contains hourly PM2.5 value of Beijing, providing more than 8000 monitoring data in one year, which is beneficial for time series analysis. Useless columns would be dropped and the time information would be transferred into formal datetime data type for analysis.
Time series analysis containing rolling means and rolling standard deviations would be applied to this dataset. Hourly PM2.5 value will be explored in time. The amount of time meeting the air quality standard would be calculated. What's more, event detection may be used if there is a obvious sudden change in the time series plot.
1. Materials from PUI class
2. Time series analysis with pandas
1. A conclusion – what are the main factors of Beijing air pollution?
2. Visualizations – time series plots.
3. Suggestions for related government sectors and organizations to help reduce air pollution in Beijing.