Academic Awards Scraper (This is all just a test placeholder)

This repository contains Python scripts for scraping data from websites of highly prestigious and prestigious academic awards. The goal is to collect information about award recipients, categories, and other relevant details.

Features

Scrapes data from multiple academic award websites
Extracts information such as award names, recipients, categories, and years
Stores data in a structured format (CSV, JSON, etc.)
Handles website navigation and pagination
Includes error handling and logging

Requirements

Python 3.8+
BeautifulSoup4
Requests
Pandas
Selenium (for dynamic content)

Installation

Clone the repository:

git clone https://github.com/yourusername/academic-awards-scraper.git
cd academic-awards-scraper

Create and activate a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows use `venv\Scripts\activate`

Install the required packages:
```
pip install -r requirements.txt
```

Usage

Update the config.json file with the URLs of the award websites you want to scrape and other configuration details.
Run the scraper:
```
python scraper.py
```
The scraped data will be saved in the output directory in the specified format.

Configuration

The config.json file should contain the following fields:

urls: List of award website URLs to scrape
output_format: Format to save the scraped data (e.g., CSV, JSON)
log_level: Logging level (e.g., DEBUG, INFO, WARNING)

Example config.json:

{
    "urls": [
        "https://example.com/award1",
        "https://example.com/award2"
    ],
    "output_format": "csv",
    "log_level": "INFO"
}

Contributing

Contributions are welcome! Please fork the repository and submit a pull request with your changes.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgements

BeautifulSoup4
Requests
Pandas
Selenium

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Awards Scrape Code.ipynb		Awards Scrape Code.ipynb
LICENSE		LICENSE
Python.gitignore		Python.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Academic Awards Scraper (This is all just a test placeholder)

Features

Requirements

Installation

Usage

Configuration

Contributing

License

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Academic Awards Scraper (This is all just a test placeholder)

Features

Requirements

Installation

Usage

Configuration

Contributing

License

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages