Keynotes:
-
Preprocessed the data in Python
- Outliers were treated by throttling to max value allowed for each variable.
- Missing values were replaced by median in case of numerical variables and mode in case of categorical variables.
-
Loaded preprocessed data in Tableau Desktop
-
Changed default aggregation of measures from sum to average because they are measured in average over a period of time.
-
Chose 12 megacities out of 26 which are spread across the country for getting an overall idea of the AQI in India.
-
Bar charts for every pollutant in those top 12 megacities in pre and post covid times.
-
Found decrease in AQI and other pollutants after covid hit, which is good for the environment.