README.md

Project Title

Web Scraper

Introduction

This README provides essential information for setting up and running the Web Scraper application. Before you begin, make sure you have the necessary prerequisites in place.

Prerequisites

Before you can run the Web Scraper application, you need to ensure you have the following prerequisites installed on your system:

Python (3.10 or higher)
pip (Python package manager)

To install the required Python packages, navigate to the project directory and run the following command:

pip install -r requirements.txt

Additionally, Web Scraper relies on the wkhtmltopdf tool for generating PDFs. If you are using Ubuntu, you can install it using the following command:

sudo apt-get install wkhtmltopdf

Usage

Once you have met all the prerequisites, you can run the Web Scraper application with the following command:

python scraper.py <base url>

Replace <base url> with the URL you want to use as the starting point for your scraping task.

Examples

Here are a few example commands for running the Web Scraper application:

python scraper.py https://example.com
python scraper.py https://anotherwebsite.com

Name	Name	Last commit message	Last commit date
Latest commit Siva-Venigalla Merge pull request Siva-Venigalla#2 from piedpiper36/vask Oct 19, 2023 719f3d2 · Oct 19, 2023 History 8 Commits
src	src	bs.py: Added Exception Handling and Persistence Support.	Oct 19, 2023
README.md	README.md	First cut of Web Scraper App	Oct 17, 2023
requirements.txt	requirements.txt	First cut of Web Scraper App	Oct 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README.md

Project Title

Introduction

Prerequisites

Usage

Examples

About

Releases

Packages

Languages

JMkrish/web_scraping

Folders and files

Latest commit

History

Repository files navigation

README.md

Project Title

Introduction

Prerequisites

Usage

Examples

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages