document-scanner/README.md at master · Mayan-S/document-scanner

Description:

This C++ application is a document scanner and OCR (Optical Character Recognition) tool built using OpenCV and Tesseract OCR. It allows users to select a document image, automatically detects and extracts the document from the image, applies perspective correction, and then performs OCR to extract the text content. The scanned image and the recognized text are saved in the results folder for future use.

Program In Use:

User Input: The user is prompted to select a document image (1–8).

Image Loading: The selected image is loaded from the Resources/ folder.
Preprocessing: The image is converted to grayscale, blurred, and edges are detected.
Contour Detection: The largest 4-point contour is found and reordered.
Warping & Cropping: The document is warped to a flat, top-down view and cropped.

BEFORE:

AFTER:

OCR: Tesseract extracts text from the scanned image.
Saving Results: The scanned image is saved as Results/output.jpg, and the text is saved as Results/output.txt.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls