Change the repository type filter
All
Repositories list
26 repositories
stormcrawler-docker
PublicResources for running StormCrawler with Docker servicesdigitalpebble.github.io
Publiccrawlurlfrontier
Publictika
Public- StormCrawler topology to evaluate the performance of different backends and configurations
ansible-storm
Publicnutch
Publicurlfrontier-client
PublicURLFrontier client written in Rust (mostly as a way of learning Rust)TextClassification
PublicA Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and can be used as a front end to various ML algorithms. libSVM and liblinear are currently embedded.stormcrawlerfight
Publicbehemoth
Public archivecrawler-commons
Publicsc-warc
Publictescobank
Public archivebehemoth-commoncrawl
Public archivetika-cc
PublicNutchFight
Publicbehemoth-elasticsearch
Public archivebehemoth-textclassification
Public archiveTextClassificationPlugin
Public archivengrams-api
Public archive