It's a simple app which is design to extract the number of products in each category from an e-commerce website using BeautifulSoup(bs4) and Selenium. The website that has been scrapped https://www.harveynorman.com.au/.
- BeautifulSoup4
- Selenium and Selenium-Webdriver
- Requests
- Pandas
- Clone the repo
git clone https:://github.com/udit1999/Web_Scrap_e-commerce.git- Install Python packages
pip install -r requirements.txt- Download the selenium webdriver(according to your system) and save it on webdriver folder.(this is only for firefox)
- To get all the product links
python scrapper.py- To get number of product in each category
python app.pyThe data will be stored in csv file in Data Folder.