Skip to content

A simple web scraper that extracts images, links, and videos from any webpage and displays them in a clean, responsive UI. Built using Spring Boot (JSoup) for scraping and HTML, CSS, JavaScript for the frontend. Lightweight, easy to run, and perfect for learning or testing web scraping.

Notifications You must be signed in to change notification settings

vivekbargude/Web_Scrapper_Using_Java_Jsoup

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation


🕸️ Web Scraper UI + Spring Boot Backend

A lightweight web-scraping tool that extracts Images, Links, and Videos from any public webpage. The project contains:

  • Spring Boot Backend for server-side scraping
  • HTML/CSS/JS Frontend UI
  • Displays images, links, and videos cleanly in the browser
  • Simple, clean, and minimal UI

🚀 Features

🔍 Scrapes:

  • All images (<img src="">)
  • All links (<a href="">)
  • All videos (<video> or embedded sources)

🖥️ Frontend Features:

  • Clean UI
  • Section-wise display
  • Automatic rendering using .map()
  • Fully responsive

⚙️ Backend:

  • Spring Boot REST API

  • /api/scrape?url=... endpoint

  • Returns JSON:

    {
      "images": [],
      "links": [],
      "videos": []
    }

📂 Project Structure

web-scraper/
│
├── backend/              # Spring Boot App
│   ├── controller/
│   ├── service/
│   ├── ScraperApplication.java
│
├── frontend/
│   └── index.html        # Your UI
│
└── README.md

🛠️ Setup & Run

1️⃣ Clone the Repository

https://github.com/vivekbargude/Web_Scrapper_Using_Java_Jsoup.git
cd Web_Scrapper_Using_Java_Jsoup

🏗️ Backend Setup (Spring Boot)

Install dependencies

mvn install

Run Spring Boot server

mvn spring-boot:run

This starts the backend on:

http://localhost:8080

🌐 Frontend Setup

Simply open the index.html in any browser. Or use Live Server:


🖼️ Screenshots

1️⃣ Images Section Output

Images Section

Shows all scraped images displayed as thumbnails in grid layout.


2️⃣ Links Section Output

Links Section

Shows clickable links extracted from the page.


3️⃣ Videos Section Output

Videos Section

Shows playable video elements or extracted video URLs.


📡 API Example

Request:

GET http://localhost:8080/api/scrape?url=https://example.com

Response:

{
  "images": ["https://example.com/a.png"],
  "links": ["https://example.com/about"],
  "videos": []
}

About

A simple web scraper that extracts images, links, and videos from any webpage and displays them in a clean, responsive UI. Built using Spring Boot (JSoup) for scraping and HTML, CSS, JavaScript for the frontend. Lightweight, easy to run, and perfect for learning or testing web scraping.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published