Skip to content

Commit

Permalink
Rerunning all the analysis notebooks for the report (#81)
Browse files Browse the repository at this point in the history
* Some edits to the naming of level b and c groups in build taxonomy, add config for new analysis

* Changes to locations_to_nuts to allow for new data structure

* Allow for tfidf params in naming of levels

* Rerunning all analysis and applications notebooks

* Replace report images with updated ones

* Add new images to main readme

* Add image of 2022 skill clusters

* Move get duplicate sent ids to pipeline and reformat output to be more memory efficient

* Save out duplicates only

* Darken green in examples jpg

* Update analysis notebooks to include the additional data from the duplicated sentences

* Add combination of duplicated sentences dictionary and save, and describe script in the readme

* Delete some lines that shouldnt be there from cluster embeddings
  • Loading branch information
lizgzil authored Feb 9, 2022
1 parent 0a56439 commit 7059e46
Show file tree
Hide file tree
Showing 26 changed files with 1,913 additions and 1,685 deletions.
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@ This repo contains the source code for this project. To read the full technical

An overview of the methodology, coloured by the three main steps to the pipeline, can be visualised below:

![](./outputs/reports/figures/methodology_overview_pipeline.jpg)
![](./outputs/reports/figures/Jan%202022/methodology_overview_pipeline.jpg)

### Pipeline steps

Expand All @@ -23,10 +23,10 @@ More details of the steps included in this project, and running instructions, ca

This repository also contains various pieces of analysis of the taxonomy. These are discussed in the main analysis [README file](skills_taxonomy_v2/analysis/README.md).

<img src="./outputs/reports/figures/hierarchy_numbers.jpg" width="700">
<img src="./outputs/reports/figures/Jan 2022/hierarchy_numbers.jpg" width="700">

#### Examples of the hierarchy
<img src="./outputs/reports/figures/taxonomy_example.jpg" width="700">
<img src="./outputs/reports/figures/Jan 2022/taxonomy_example.jpg" width="700">

## Setup

Expand Down
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file removed outputs/reports/figures/hierarchy_overview.jpg
Binary file not shown.
Binary file not shown.
Binary file removed outputs/reports/figures/taxonomy_example.jpg
Binary file not shown.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading

0 comments on commit 7059e46

Please sign in to comment.