🌐 Price Scraper CLI

A modular CLI for fast, resilient scraping of product data from e-commerce stores. Built for extensibility and polite scraping.

Status & Metadata

Field	Details
Version
License
Language

Features & Resilience

Multi-Region Support: Scrapes sites SL and JP.
Modular Architecture: Keeps core logic separate from site-specific scrapers.
Auto-Update: Automatically checks for updates against the GitHub repository on startup.
Resilient Scraping: Automatic retries handle transient errors with polite delays.
Brand Extraction: Guards against messy or incomplete upstream data.
Interactive CLI: Guides dependency checks, region selection, and scraper choice.
Excel Export: Output exports to clean .xlsx files for analysis.

Getting Started

1. Prerequisites

Install Python 3.8+.

2. Installation

Clone the repo and install dependencies:

git clone https://github.com/Optane002/Web_Scraper.git
cd Web_Scraper
pip install -r requirements.txt # requests, pandas, openpyxl, urllib3, etc.

3. Run the Scraper

python web_scraper.py

The CLI presents a shell-style header, performs dependency checks, then prompts for region and target site.

Project Structure & Extensibility

price-scraper-cli/
├── config/
│   └── sites.py        # Maps countries/sites to their scrapers and config.
├── scrapers/
│   ├── __init__.py     # Package initializer.
│   ├── country1/       # Country1 scrapers
│   └── country2/          # Country2 scrapers
├── web_scraper.py      # Main CLI entry point and flow controller.
├── requirements.txt    # Python dependencies.
├── version.txt         # Current version tracking.
└── README.md

Adding a New Website

Create a scraper: add scrapers/<country>/<site>.py with a function that accepts a config dict and returns a list of product dicts.
Wire it up:
- Import your scraper inside scrapers/<country>/__init__.py.
- Extend SUPPORTED_SITES in config/sites.py with the new entry (base URL, category IDs, export filename, etc.).

License

This project is licensed under the MIT License. See LICENSE for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🌐 Price Scraper CLI

Status & Metadata

Table of Contents

Features & Resilience

Getting Started

1. Prerequisites

2. Installation

3. Run the Scraper

Project Structure & Extensibility

Adding a New Website

License

About

Uh oh!

Releases 2

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
config		config
scrapers		scrapers
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
version.txt		version.txt
web_scraper.py		web_scraper.py

License

Optane002/Web_Scraper

Folders and files

Latest commit

History

Repository files navigation

🌐 Price Scraper CLI

Status & Metadata

Table of Contents

Features & Resilience

Getting Started

1. Prerequisites

2. Installation

3. Run the Scraper

Project Structure & Extensibility

Adding a New Website

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages