Python Developer | Data Analytics Enthusiast | AI/ML Aspirant
This project demonstrates web scraping using Python, BeautifulSoup, and Pandas. The application automatically extracts book information from multiple pages of an online bookstore and stores the collected data in CSV format for further analysis.
- Collect book data from a website
- Extract book titles, prices, and ratings
- Store structured data in CSV format
- Build a dataset for data analysis and visualization
- Python
- Requests
- BeautifulSoup
- Pandas
✅ Multi-page web scraping
✅ Automated data collection
✅ Book title extraction
✅ Price extraction
✅ Rating extraction
✅ CSV dataset generation
The generated dataset contains:
| Column | Description |
|---|---|
| Title | Book Title |
| Price | Book Price |
| Rating | Customer Rating |
CodeAlpha_BookMarketIntelligence
│
├── scraper.py
├── books_market_data.csv
├── requirements.txt
└── README.md
- Clone the repository
git clone https://github.com/AbdulSameer2005/CodeAlpha_BookMarketIntelligence.git- Install dependencies
pip install -r requirements.txt- Run the scraper
python scraper.py- Successfully scraped 100 book records
- Generated a structured CSV dataset
- Demonstrated practical web scraping and data collection skills
- HTML Parsing
- Web Scraping
- Data Collection
- Data Cleaning
- CSV Export
- Python Automation
CodeAlpha Data Analytics Internship – Task 1 (Web Scraping)
📧 Email: abdulsameershaik013@gmail.com
🔗 LinkedIn: https://www.linkedin.com/in/shaik-abdul-sameer-31a5712a5
💻 GitHub: https://github.com/AbdulSameer2005