Skip to content
This repository has been archived by the owner on Feb 16, 2022. It is now read-only.

Latest commit

 

History

History
33 lines (19 loc) · 657 Bytes

README.rst

File metadata and controls

33 lines (19 loc) · 657 Bytes

Wextracto: Web Data Extraction

Build Status

Wextracto is a toolkit for command-line web data extraction.

Installation

$ pip install wextracto

Kicking the Tyres

$ echo -e "[wex]\nsitemaps=wex.sitemaps:urls_from_sitemaps" > entry_points.txt
$ wex "http://www.ebay.com/robots.txt"

Documentation

The documentation can be found here:

http://wextracto.readthedocs.org/en/latest/index.html