Add taxonomy file to repo and extra information to the README (#83)

nestauk · May 5, 2022 · f56774c · f56774c
1 parent eeb2109
commit f56774c
Show file tree

Hide file tree

Showing 2 changed files with 16 additions and 12 deletions.
diff --git a/README.md b/README.md
@@ -1,15 +1,23 @@
-# skills-taxonomy-v2
+# A New Approach to Building a Skills Taxonomy
+
+The full technical report and blog article for this project will be released soon (link to follow).
 
 ## Introduction
 
-The UK labour market has been especially tumultuous in light of COVID-19 and Brexit. Employers [now face both long-running and circumstantial skills challenges](https://www.edge.co.uk/documents/167/04.05.21_Skills_shortages_bulletin_summary.pdf) due to such disruptive events. As we observe the impact of current events on the skills landscape, an informed labour market is more important than ever. Therefore, in partnership with the [Economic Statistics Centre of Excellence (ESCoE)](https://www.escoe.ac.uk/), we are releasing an updated skills taxonomy that is more open, more up-to-date and methodologically refined.
+There is no official and fully open skills taxonomy in the UK. There is a really important need for such a taxonomy that would enable consistent conceptualisation of workforce skills, together with consistent terminology and language around skills used by educators, careers advisers, policy makers and employers. The lack of a consistent language has multiple consequences such as creating confusion over the skills required for particular roles or the training needs of employees. At the same time, the effects of COVID-19 and Brexit have triggered rapid changes in skill demands as well as new skill shortages. This shifting landscape has only increased the need for an open and up-to-date skills taxonomy for the UK which could help to provide better quality and up to date information, in turn to better inform policy.
+
+Therefore, in partnership with the [Economic Statistics Centre of Excellence (ESCoE)](https://www.escoe.ac.uk/), we are releasing an updated skills taxonomy that is more open, more up-to-date and methodologically refined.
 
-This repo contains the source code for this project. To read the full technical report, click [here](https://docs.google.com/document/d/1ZHE6Z6evxyUsSiojdNatSa_yMDJ8_UlB1K4YD1AhGG8/edit#heading=h.r6ck9fjaexcl).
+This repo contains the source code for this project.
 
 An overview of the methodology, coloured by the three main steps to the pipeline, can be visualised below:
 
 ![](./outputs/reports/figures/Jan%202022/methodology_overview_pipeline.jpg)
 
+### The taxonomy file
+
+The taxonomy file is given [here](./outputs/taxonomy_data/2022.01.21_hierarchy_structure_named.json). To view this JSON file in a friendly format, you should download it and open it using Firefox. Alternatively, you could also use an online tool such as [JSON formatter](https://jsonformatter.org/json-viewer).
+
 ### Pipeline steps
 
 More details of the steps included in this project, and running instructions, can be found in their respective READMEs:
@@ -28,17 +36,12 @@ This repository also contains various pieces of analysis of the taxonomy. These
 #### Examples of the hierarchy
 <img src="./outputs/reports/figures/Jan 2022/taxonomy_example.jpg" width="700">
 
-## Setup
 
-- Meet the data science cookiecutter [requirements](http://nestauk.github.io/ds-cookiecutter), in brief:
-  - Install: `git-crypt`
-  - Have a Nesta AWS account configured with `awscli`
-- Run `make install` to configure the development environment:
-  - Setup the conda environment
-  - Configure pre-commit
-  - Configure metaflow to use AWS
+## Running the code
+
+This repository has been made public in the interest of openness, and hopefully that some of the scripts and functions it contains may be useful for others. However, the TextKernel dataset of job adverts is not available for use anymore (either by Nesta staff or the general public). Because of this, the pipeline can no longer be run from start to finish.
 
-## Running locally
+### Conda environment
 
 When you are running scripts from this repo for the first time you need to create the environment by running `make conda-create` to create the conda environment. Then everytime after this you can activate it using `conda activate skills-taxonomy-v2`. If you update the requirements then run `make conda-update`.
 

diff --git a/outputs/taxonomy_data/2022.01.21_hierarchy_structure_named.json b/outputs/taxonomy_data/2022.01.21_hierarchy_structure_named.json