Web_Scrap_e-commerce

It's a simple app which is design to extract the number of products in each category from an e-commerce website using BeautifulSoup(bs4) and Selenium. The website that has been scrapped https://www.harveynorman.com.au/.

Prerequisites

BeautifulSoup4
Selenium and Selenium-Webdriver
Requests
Pandas

Installation

Clone the repo

git clone https:://github.com/udit1999/Web_Scrap_e-commerce.git

Install Python packages

pip install -r requirements.txt

Download the selenium webdriver(according to your system) and save it on webdriver folder.(this is only for firefox)

Usage

To get all the product links

  python scrapper.py

To get number of product in each category

  python app.py

The data will be stored in csv file in Data Folder.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Data		Data
pages		pages
parsers		parsers
product_locators		product_locators
webdriver		webdriver
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirement.txt		requirement.txt
scrapper.py		scrapper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web_Scrap_e-commerce

Prerequisites

Installation

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Web_Scrap_e-commerce

Prerequisites

Installation

Usage

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages