Skip to content

webfoolery/python-pdf-image-extractor

Repository files navigation

PDF Image Extractor

It's a short Python script to parse PDF files & export the images found inside in their original formats.

Setup

  1. Clone the repo
  2. Create the virtual environment - python3 -m venv env
  3. Start the virtual environment, ie: . env/Scripts/activate (or whatever your system needs to do so)
  4. pip install -r ./requirements.txt
  5. End the virtual environment, ie: deactivate

Usage

  1. Place a PDF file in the same directory as the script
    • Linux: Run_pdf-image.py
    • Windows: As above or run _get-pdf-image.bat
  2. Check in the same directory to find the image/s. Image file names are based upon the original file name with page number, image number & file type appended

CHANGELOG

2022-06-23

  • Adds command line arguments for --single or --multiple (default) images
  • Adds extra .bat file for running single image task

About

Extracts images from PDF files in original format

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published