Skip to content

AbdulSameer2005/CodeAlpha_BookMarketIntelligence

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

📚 Book Market Intelligence Web Scraper

Python Badge Pandas Badge BeautifulSoup Badge

👨‍💻 Author

Shaik Abdul Sameer

Python Developer | Data Analytics Enthusiast | AI/ML Aspirant


📖 Project Overview

This project demonstrates web scraping using Python, BeautifulSoup, and Pandas. The application automatically extracts book information from multiple pages of an online bookstore and stores the collected data in CSV format for further analysis.


🎯 Objectives

  • Collect book data from a website
  • Extract book titles, prices, and ratings
  • Store structured data in CSV format
  • Build a dataset for data analysis and visualization

🛠 Technologies Used

  • Python
  • Requests
  • BeautifulSoup
  • Pandas

✨ Features

✅ Multi-page web scraping

✅ Automated data collection

✅ Book title extraction

✅ Price extraction

✅ Rating extraction

✅ CSV dataset generation


📊 Dataset Information

The generated dataset contains:

Column Description
Title Book Title
Price Book Price
Rating Customer Rating

📂 Project Structure

CodeAlpha_BookMarketIntelligence
│
├── scraper.py
├── books_market_data.csv
├── requirements.txt
└── README.md

🚀 How to Run

  1. Clone the repository
git clone https://github.com/AbdulSameer2005/CodeAlpha_BookMarketIntelligence.git
  1. Install dependencies
pip install -r requirements.txt
  1. Run the scraper
python scraper.py

📈 Results

  • Successfully scraped 100 book records
  • Generated a structured CSV dataset
  • Demonstrated practical web scraping and data collection skills

🎓 Learning Outcomes

  • HTML Parsing
  • Web Scraping
  • Data Collection
  • Data Cleaning
  • CSV Export
  • Python Automation

⭐ Internship Task

CodeAlpha Data Analytics Internship – Task 1 (Web Scraping)


Connect With Me

📬 Contact

Shaik Abdul Sameer

📧 Email: abdulsameershaik013@gmail.com

🔗 LinkedIn: https://www.linkedin.com/in/shaik-abdul-sameer-31a5712a5

💻 GitHub: https://github.com/AbdulSameer2005

About

Professional web scraping project using Python, BeautifulSoup and Pandas.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages