Skip to content

GarlicDeveloper/Ollama-WebScrapper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

DESCRIPTION

Web Scraping & AI Integration Tool

Overview

This project is a Python-based tool designed for efficient web content extraction and interaction with AI models. It fetches webpage data, processes HTML elements, and integrates with locally hosted AI models for advanced content analysis.

Features

  • Automated Web Scraping: Uses BeautifulSoup and requests to extract text and links from websites.
  • AI-Powered Analysis: Integrates with OpenAI's API and locally hosted AI models like llama3.2.
  • Structured Data Extraction: Implements a Website class for efficient content parsing.
  • Error Handling & API Key Management: Ensures secure API usage with environment-based configuration.

Tech Stack

  • Python
  • BeautifulSoup
  • Requests
  • OpenAI API
  • Local AI Models (Llama3.2)

Usage

  1. Clone the repository:

    git clone https://github.com/GarlicDeveloper/Ollama-WebScrapper.git
    
  2. Install dependencies:

    https://ollama.com/
    
  3. Just Install Ollama in your local machine and Run Below Line in powershell

    ollama run llama3.2
    
  4. Run the script:

    Just Shift+Enter all the lines in Jupyter lab
    

Contribution

Feel free to contribute by submitting issues or pull requests!


🚀

About

A Python-based web scraping and API integration tool that fetches, processes, and interacts with online content using BeautifulSoup, requests, and OpenAI's API. Features a structured Website class for extracting page data efficiently. Ideal for automating webpage analysis and AI-powered interactions.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors