Analysing Glassdoor Reviews: Impact of Job Satisfaction on Employee Retention and Firm Performance in the IT Industry
The goal here is to create machine learning models to help with the titular objective. We want to determine the impact of:
- Job satisfaction on
- Employee retention and
- Firm performance
- within the IT industry only.
Our data sources are Glassdoor, Compustat for retrieving measures of firm financial performance, and a Ticker dataset to match company names with their tickers, ensuring that financial data can be retrieved for the companies reviewed on Glassdoor.
To achieve our goal, we'll follow these steps:
- Scrape Glassdoor reviews
- Clean Glassdoor data
- Retrieve tickers of the top public IT firms, effectively restricting the scope to only companies within the IT Industry, and only those with publicly available financial data
- Merge datasets
- Clean merged data
- Exploratory data analysis
- Extract employee retention data from Glassdoor review data
- Retrieve financial performance data
- Merge into final dataset and prepare for linear regression
- Visualise dataset relationships
- Create ML models with Job Satisfaction predicting Retention
- Create ML models with Job Satisfaction predicting Market Cap
- Create ML models with Job Satisfaction predicting Return on Assets