Semantic Image Search for Lightroom Classic

Find photos by describing them in plain English.

Instead of scrolling through thousands of images or relying on keywords you remembered to add, just type what you're looking for:

"red Porsche at sunset"
"person laughing at camera"
"foggy mountain landscape"
"crowd cheering at race track"

The system understands the visual content of your images, not just their filenames or metadata.

What This Does

This tool adds a search dialog to Lightroom Classic. You type a description, and it finds matching photos from your library. Results appear as a collection you can browse, rate, or export like any other.

It works by analyzing each image and creating a mathematical "fingerprint" of its visual content. When you search, it compares your description against all those fingerprints to find the best matches. This happens in under a second, even with hundreds of thousands of images.

Requirements

Your Computer

Mac with Apple Silicon (M1, M2, M3, or M4) - strongly recommended
At least 24GB of RAM if you want to use Lightroom while searching
Enough disk space for the search index (roughly 1GB per 100,000 images)

Windows and Linux with NVIDIA graphics cards also work, but this guide focuses on Mac.

Your Photo Library

Images must be accessible on a mounted drive (internal, external, or NAS)
Supports JPEG, TIFF, PNG, and most RAW formats (NEF, DNG, CR2, CR3, ARW, RAF, ORF, etc.)
The system reads your actual image files, not Lightroom's previews

How It Works (The Simple Version)

Three pieces work together:

The Index - A database containing the visual "fingerprint" of each image
The Search Server - A background process that handles search requests
The Lightroom Plugin - Adds the search dialog to Lightroom

You build the index once (this takes a while). After that, searching is nearly instant.

Setup Guide

This requires some work in Terminal. Don't worry - you'll copy and paste most commands, and I'll explain what each one does.

Step 1: Install the Required Tools

Open Terminal (find it in Applications > Utilities, or search for it in Spotlight).

First, install Homebrew if you don't have it. This is a tool that makes installing other software easier:

/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"

Follow any instructions it gives you. You may need to enter your password.

Now install the tools we need:

brew install python@3.11 dcraw qdrant

This installs:

Python - A programming language that runs the search system
dcraw - A tool for reading RAW camera files
Qdrant - The database that stores image fingerprints

Step 2: Download This Project

Decide where you want to keep this project. Your home folder is fine:

cd ~
git clone https://github.com/blwfish/LrC-tools.git
cd LrC-tools

(If you received this as a folder instead of from GitHub, just navigate to that folder in Terminal using cd /path/to/folder.)

Step 3: Set Up Python

Create an isolated Python environment for this project:

python3 -m venv venv
source venv/bin/activate

Your terminal prompt should now show (venv) at the beginning.

Install the required Python packages:

pip install torch torchvision open-clip-torch qdrant-client flask pillow tqdm

This downloads about 2GB of files. It may take a few minutes.

Step 4: Build the Image Index

This is the slow part. The system needs to analyze every image in your library and create its fingerprint. For 100,000 images, expect 8-10 hours. For 500,000 images, expect 2-3 days.

You only do this once. After that, you can add new images incrementally.

Start the fingerprint database:

Open a new Terminal window and run:

qdrant

Leave this window open. Qdrant needs to stay running while you build the index.

Configure and run the indexer:

Back in your first Terminal window, edit the indexing script to point to your images:

nano embed_full_archive.py

Find this line near the top:

ARCHIVE_ROOT = "/Volumes/archive2/images/"

Change the path to wherever your images are stored. Press Ctrl+O to save, then Ctrl+X to exit.

Now start the indexing:

source venv/bin/activate
nohup python embed_full_archive.py >> embed_full.log 2>&1 &

This runs in the background. You can close Terminal, restart your computer, whatever - it will keep going.

Check progress:

tail -f embed_full.log

Press Ctrl+C to stop watching the log.

Step 5: Set Up Automatic Startup

You want the search server and database to start automatically when you log in.

For Qdrant, create a startup file:

nano ~/Library/LaunchAgents/com.qdrant.plist

Paste this content:

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE plist PUBLIC "-//Apple//DTD PLIST 1.0//EN" "http://www.apple.com/DTDs/PropertyList-1.0.dtd">
<plist version="1.0">
<dict>
    <key>Label</key>
    <string>com.qdrant</string>
    <key>ProgramArguments</key>
    <array>
        <string>/opt/homebrew/bin/qdrant</string>
    </array>
    <key>RunAtLoad</key>
    <true/>
    <key>KeepAlive</key>
    <true/>
</dict>
</plist>

Save and exit (Ctrl+O, Ctrl+X).

For the search server, the project includes a startup file. Copy it and adjust the paths:

cp com.blw.imagesearch.plist ~/Library/LaunchAgents/
nano ~/Library/LaunchAgents/com.blw.imagesearch.plist

Update any paths to match where you installed the project.

Enable both services:

launchctl load ~/Library/LaunchAgents/com.qdrant.plist
launchctl load ~/Library/LaunchAgents/com.blw.imagesearch.plist

Step 6: Install the Lightroom Plugin

Create a link from Lightroom's plugin folder to this project:

ln -s "$(pwd)/SemanticSearch.lrplugin" \
    ~/Library/Application\ Support/Adobe/Lightroom/Modules/SemanticSearch.lrplugin

Restart Lightroom Classic.

Using the Search

Open Lightroom Classic and go to the Library module
From the menu: Library > Plug-in Extras > Semantic Search...
Type what you're looking for
Click Search
Results appear in a collection under 0_Semantic_Searches

Search Tips

Be specific: "yellow Corvette on race track" works better than "car"
Describe the scene: "crowd watching fireworks at night" finds those moments
Include context: "person standing on mountain summit with clouds below"
Try variations: If "sunset" doesn't find what you want, try "orange sky" or "golden hour"

Understanding Results

Each result has a similarity score between 0 and 1. In practice:

0.30+ = Strong match, very likely what you're looking for
0.25-0.30 = Good match, worth reviewing
0.20-0.25 = Possible match, might be relevant
Below 0.20 = Weak match, probably not what you want

The default threshold is 0.20. You can adjust this in the search dialog.

Keeping Your Index Updated

After importing new photos, run the update script to add them to the index:

cd ~/LrC-tools  # or wherever you installed this
source venv/bin/activate
python update_index.py

The updater is smart:

Only scans directories that have changed since the last run
Detects moved files (no re-indexing needed, just updates the path)
Skips files already in the index

For a typical import of a few hundred to a few thousand photos, this takes just a few minutes.

Update Options

# Normal update - index new/moved files
python update_index.py

# Only scan a specific year
python update_index.py --year 2025

# Also remove entries for deleted files
python update_index.py --cleanup

# See what would happen without making changes
python update_index.py --dry-run

# Force a complete scan (ignore directory timestamps)
python update_index.py --full-scan

# Verbose output for debugging
python update_index.py --verbose

First-Time Migration

If you're upgrading from the old checkpoint-based system, run the migration first:

python migrate_to_catalog.py

This creates a SQLite catalog at ~/.local/share/photo_tools/catalog.db that tracks:

Which files have been indexed
Content hashes for move detection
Directory timestamps for fast scanning

The migration reads your existing Qdrant data and computes hashes for all indexed files. Takes about 4-5 hours for 500k files (disk I/O bound).

Troubleshooting

"Failed to connect to search server"

The search server isn't running. Try:

# Check if it's running
curl http://localhost:5555/health

# If not, start it manually
cd ~/LrC-tools
source venv/bin/activate
python search_server.py

"No matching images found"

Your images might not be indexed yet (check if indexing is complete)
The images might be stored in a different location than what was indexed
Try a more general search term

Search is slow

The first search after starting the server takes 5-10 seconds while the AI model loads into memory. Subsequent searches should be under a second.

Lightroom doesn't show the plugin

Make sure you restarted Lightroom after installing
Check File > Plug-in Manager - the plugin should appear there
Verify the symbolic link exists: ls -la ~/Library/Application\ Support/Adobe/Lightroom/Modules/

Technical Details (For the Curious)

This system uses CLIP (Contrastive Language-Image Pre-training), an AI model developed by OpenAI that understands both images and text. It was trained on hundreds of millions of image-caption pairs from the internet.

When you index an image, CLIP converts it into a list of 768 numbers (a "vector") that captures its visual essence. When you search, CLIP converts your text query into the same kind of vector. Finding matches is just finding which image vectors are closest to your query vector.

The vector database (Qdrant) is optimized for exactly this kind of "find the nearest neighbors" search, which is why it can search hundreds of thousands of images in milliseconds.

Credits

OpenCLIP - Open source CLIP implementation
Qdrant - Vector database
dcraw - RAW file decoder by Dave Coffin

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
SemanticSearch.lrplugin		SemanticSearch.lrplugin
common		common
.gitignore		.gitignore
README.md		README.md
com.blw.imagesearch.plist		com.blw.imagesearch.plist
create_sample.py		create_sample.py
embed_full_archive.py		embed_full_archive.py
migrate_to_catalog.py		migrate_to_catalog.py
search.py		search.py
search_server.py		search_server.py
update_index.py		update_index.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Semantic Image Search for Lightroom Classic

What This Does

Requirements

Your Computer

Your Photo Library

How It Works (The Simple Version)

Setup Guide

Step 1: Install the Required Tools

Step 2: Download This Project

Step 3: Set Up Python

Step 4: Build the Image Index

Step 5: Set Up Automatic Startup

Step 6: Install the Lightroom Plugin

Using the Search

Search Tips

Understanding Results

Keeping Your Index Updated

Update Options

First-Time Migration

Troubleshooting

"Failed to connect to search server"

"No matching images found"

Search is slow

Lightroom doesn't show the plugin

Technical Details (For the Curious)

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Semantic Image Search for Lightroom Classic

What This Does

Requirements

Your Computer

Your Photo Library

How It Works (The Simple Version)

Setup Guide

Step 1: Install the Required Tools

Step 2: Download This Project

Step 3: Set Up Python

Step 4: Build the Image Index

Step 5: Set Up Automatic Startup

Step 6: Install the Lightroom Plugin

Using the Search

Search Tips

Understanding Results

Keeping Your Index Updated

Update Options

First-Time Migration

Troubleshooting

"Failed to connect to search server"

"No matching images found"

Search is slow

Lightroom doesn't show the plugin

Technical Details (For the Curious)

Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages