Skip to content
/ taggsy Public

Find the most commonly occurring words among documents

License

Notifications You must be signed in to change notification settings

mkarots/taggsy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

README

Taggsy is a python library that will pick the most common occuring words between a number documents.

Installation:

# Clone repo and make virtualenv
git clone [email protected]:michael-karotsieris/taggsy.git
cd taggsy; python -mvenv venv; source venv/bin/activate;

# Install dependencies and run tests
make init;make test;

# Install python package

pip install . 

Example usage:

from taggsy import Core
core = Core()
text1 = 'An example document'
text2 = 'Another document'

core.add_documents([text1, text2])
print(core.most_common())

>>> {'document': 
    {'count': 2,
     'docs': ['fbyzuj', 'ktbuwx'],
     'sentences': ['An example document', 'Another document']}}

Or run as python module

python -mtaggsy doc1.txt doc2.txt 

About

Find the most commonly occurring words among documents

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published