Skip to content

Latest commit

 

History

History
17 lines (11 loc) · 691 Bytes

README.md

File metadata and controls

17 lines (11 loc) · 691 Bytes

Selenium grid scraper

Instructions to run:

  1. Install the dependencies with the pip install -r requirements.txt command.
  2. Create the selenium grid container with the docker compose up command.
  3. Run the book-scrapper.py file.

How it works?

This script's purpose is to scrape book data from https://books.toscrape.com using Selenium Grid to create concurrent sessions for scraping data from multiple pages and store it in a CSV file.

Note: The website being scraped is open for scraping, and Selenium is not necessary. However, it's worth noting that this script was created as a training exercise, hence the use of Selenium despite it not being necessary.