Skip to content

JMkrish/web_scraping

This branch is 12 commits behind Siva-Venigalla/web_scraping:main.

Repository files navigation

README.md

Project Title

Web Scraper

Introduction

This README provides essential information for setting up and running the Web Scraper application. Before you begin, make sure you have the necessary prerequisites in place.

Prerequisites

Before you can run the Web Scraper application, you need to ensure you have the following prerequisites installed on your system:

  • Python (3.10 or higher)
  • pip (Python package manager)

To install the required Python packages, navigate to the project directory and run the following command:

pip install -r requirements.txt

Additionally, Web Scraper relies on the wkhtmltopdf tool for generating PDFs. If you are using Ubuntu, you can install it using the following command:

sudo apt-get install wkhtmltopdf

Usage

Once you have met all the prerequisites, you can run the Web Scraper application with the following command:

python scraper.py <base url>

Replace <base url> with the URL you want to use as the starting point for your scraping task.

Examples

Here are a few example commands for running the Web Scraper application:

python scraper.py https://example.com
python scraper.py https://anotherwebsite.com

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%