HackZurich2017

Repository housing a chrome extension demo developed at HackZurich2017. The extension wraps a python script that can be used to analyze text from a news article to determine its fringiness (measure of non-meanstreamness). This is done by comparing the entites extracted from the article by PermID against entities in real-time stream of articles fetched via Thomson Reuters ® API.

Example Usage

text = <few-paragraphs-of-news-text>
res = fastrun(text)

x, y, f = fringiness(res_to_matrix(res_times)[0])
plot = embedding_plot_bokeh(x, y, f, res)

from bokeh.resources import CDN
html = file_html(plot, CDN, title = "my plot")
with open("file.html", "w") as file:
    file.write(html)

See also the Jupyter Notebook

Team Members:

Nikola Nikolov
Daniel Keller
Stan Kerstjens
Martin Holub

Get the word vectors here https://drive.google.com/file/d/0B7XkCwpI5KDYNlNUTTlSS21pQmM/edit and change the path in document_similarity.py

You need the gensim and NLTK libraries: pip install gensim nltk

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
Chrome Extension		Chrome Extension
data		data
rsc		rsc
.gitignore		.gitignore
Hackzurich.ipynb		Hackzurich.ipynb
Readme.md		Readme.md
cache		cache
data_getter.py		data_getter.py
document_similarity.py		document_similarity.py
file.html		file.html
flask_rest_api.py		flask_rest_api.py
fringiness.py		fringiness.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HackZurich2017

Example Usage

Team Members:

About

Releases

Packages

Contributors 2

Languages

martinholub/HackZurich2017

Folders and files

Latest commit

History

Repository files navigation

HackZurich2017

Example Usage

Team Members:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages