Selenium grid scraper

Instructions to run:

Install the dependencies with the pip install -r requirements.txt command.
Create the selenium grid container with the docker compose up command.
Run the book-scrapper.py file.

How it works?

This script's purpose is to scrape book data from https://books.toscrape.com using Selenium Grid to create concurrent sessions for scraping data from multiple pages and store it in a CSV file.

Note: The website being scraped is open for scraping, and Selenium is not necessary. However, it's worth noting that this script was created as a training exercise, hence the use of Selenium despite it not being necessary.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Selenium grid scraper

Instructions to run:

How it works?

Files

README.md

Latest commit

History

README.md

File metadata and controls

Selenium grid scraper

Instructions to run:

How it works?