Near-perfect photo-ID of the Hula painted frog with zero-shot deep local-feature matching

This repository implements a non-invasive photo-identification system for individual Hula painted frogs (an endangered species), using deep learning-based local and global feature matching. The system achieves 98% accuracy in closed-set individual frog identification using zero-shot deep local-feature matching (ALIKED+LightGlue).

Paper: arXiv:2601.08798
Key finding: Zero-shot local feature matching significantly outperforms fine-tuned global embedding models for individual frog re-identification.

System Overview

Dataset: 1,232 ventral images from 191 endangered Hula painted frogs
Best approach: Two-stage pipeline (MiewID global + ALIKED+LightGlue local)
Key models evaluated: MiewID, MegaDescriptor, ALIKED+LightGlue, SIFT+LightGlue
Application: Practical, non-invasive field monitoring for conservation

Quick Start

1. Setup Environment

# Create conda environments (if not already done)
conda env create -f requirements.txt -n glue

# Activate environment for preprocessing and evaluation
conda activate glue

2. Download Data

Dataset: Zenodo: 20026776
1,232 ventral photographs from 191 Hula painted frogs (labeled/unlabeled splits)

# Create symlink to Zenodo data directory
ln -s /path/to/zenodo/zenodo_data data

3. Run Preprocessing

Preprocess images using MegaDetector (bbox cropping) and SAM (masking):

conda activate glue
./preprocessing.sh

Output CSVs:

data/labeled_bbox.csv — paths to bbox-cropped images
data/labeled_mask.csv — paths to SAM-masked images

4. Reproduce Paper Results

Run all evaluations (takes ~2–4 hours on GPU):

conda activate glue
./run_experiments.sh

Output: evaluation_results.md with all model metrics, figures saved to results/

File Structure

LatoniaReIDpaper/
├── README.md                          # This file
├── preprocessing.sh                   # Bbox cropping + masking pipeline
├── run_experiments.sh                 # Full evaluation pipeline
├── crop.py                            # MegaDetector bbox cropping (CSV-based)
├── masking.py                         # SAM masking (CSV-based)
├── train_set.csv                      # Training split (1,000 labeled images)
├── validation_set.csv                 # Validation split (232 labeled images)
│
├── evaluate.py                        # Single-model evaluation
├── evaluate_twostage.py               # Two-stage evaluation
├── compare_performance.py             # Generate result tables/plots
├── openset.py                         # Open-set analysis (PR curves)
│
├── batch_prediction/
    ├── batch_predict.py               # Generate predictions for unlabeled data
    ├── batch_predictions_stats.py     # Analyze expert review statistics
    └── batch_prediction_app.py        # Gradio UI for expert review

Note: Data directory (data/) and preprocessed results are created locally via preprocessing.sh and are not committed. Download the dataset from Zenodo and set up as a symlink.

Usage

Single-Model Evaluation

# Global models (use bbox-cropped images)
python evaluate.py MegaDescriptor-L-224 cosine --val_csv data/labeled_bbox.csv --device cuda

# Local models (use SAM-masked images)
python evaluate.py aliked lightglue --val_csv data/labeled_mask.csv --device cuda

# With checkpoint
python evaluate.py miewid-msv3 cosine \
  --checkpoint checkpoints/miewid-msv3-latonia-1232.pt \
  --val_csv validation_set.csv --device cuda

Two-Stage Evaluation

Combine a global model (stage 1) with local matching (stage 2):

python evaluate_twostage.py miewid-msv3 aliked \
  --stage1_csv validation_set.csv \
  --stage2_csv data/labeled_mask.csv \
  --checkpoint1 checkpoints/miewid-msv3-latonia-1232.pt \
  --device cuda --top_k 100

Batch Prediction (Unlabeled Data)

Generate predictions for unlabeled images and review results:

# Generate predictions
python batch_prediction/batch_predict.py \
  --unlabeled_csv data/unlabeled_mask.csv \
  --labeled_csv data/labeled_mask.csv \
  --output batch_predictions.json

# Analyze expert review statistics
python batch_prediction/batch_predictions_stats.py --input batch_predictions.json

# Review predictions interactively with Gradio UI
python batch_prediction/batch_prediction_app.py
# Open http://localhost:7860 in browser

Expected Results

Closed-set evaluation on 191 frogs (1,232 labeled images):

Model	Method	Top-1 Accuracy
ALIKED+LightGlue	Zero-shot local feature matching	99.8% ✓
Two Stage	Two-stage pipeline	99.0% ✓
MiewID-msv3 (finetuned)	Global embedding (trained)	62.1%
MiewID-msv3 (zero-shot)	Global embedding (zero-shot)	10.5%

Training

To finetune MiewID on Hula painted frog data:

conda activate Latonia  # Different environment for training

python train.py \
  --train_csv train_set.csv \
  --val_csv validation_set.csv \
  --checkpoint path/to/miewid-msv3 \
  --margin 0.4 --scale 30 \
  --batch_size 24 --epochs 100

Note: Global embedding models (like MiewID) are less effective than zero-shot local matching for this task. The paper demonstrates that ALIKED+LightGlue achieves 99% accuracy without any frog-specific training, making finetuning unnecessary.

Citation

If you use this work, please cite the paper:

@article{Yesharim2026,
  title={Near-perfect photo-ID of the Hula painted frog with zero-shot deep local-feature matching},
  author={Maayan Yesharim and R. G. Bina Perl and Uri Roll and Sarig Gafny and Eli Geffen and Yoav Ram},
  journal={arXiv preprint arXiv:2601.08798},
  year={2026}
}

arXiv: https://arxiv.org/abs/2601.08798

License

This project is licensed under the MIT License — see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Near-perfect photo-ID of the Hula painted frog with zero-shot deep local-feature matching

System Overview

Quick Start

1. Setup Environment

2. Download Data

3. Run Preprocessing

4. Reproduce Paper Results

File Structure

Usage

Single-Model Evaluation

Two-Stage Evaluation

Batch Prediction (Unlabeled Data)

Expected Results

Training

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 183 Commits
batch_prediction		batch_prediction
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
compare_performance.py		compare_performance.py
config.py		config.py
crop.py		crop.py
datasets.py		datasets.py
evaluate.py		evaluate.py
evaluate_twostage.py		evaluate_twostage.py
evaluation_results.md		evaluation_results.md
image_transform.py		image_transform.py
masking.py		masking.py
metrics.py		metrics.py
models.py		models.py
openset.py		openset.py
preprocessing.sh		preprocessing.sh
requirements.txt		requirements.txt
run_experiments.sh		run_experiments.sh
similarities.py		similarities.py
split.py		split.py
test_glue_pair.py		test_glue_pair.py
train.py		train.py
train_set.csv		train_set.csv
utils.py		utils.py
validation_set.csv		validation_set.csv

Folders and files

Latest commit

History

Repository files navigation

Near-perfect photo-ID of the Hula painted frog with zero-shot deep local-feature matching

System Overview

Quick Start

1. Setup Environment

2. Download Data

3. Run Preprocessing

4. Reproduce Paper Results

File Structure

Usage

Single-Model Evaluation

Two-Stage Evaluation

Batch Prediction (Unlabeled Data)

Expected Results

Training

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages