Controlled 3D geometry probing (CLIP & DINOv2)

This repo supports frozen CLIP / DINOv2 features + linear probes, plus two mesh pipelines:

Experiment 1 controlled pipeline: ShapeNetCore/Objaverse meshes → Blender texture controls → RGB, depth, normal, and mask buffers.
Smoke fixtures: synthetic primitives for tiny local pipeline checks.

An optional Blender path (blender/render_dataset.py) remains for high-quality chirality renders using a JSON manifest (no ShapeNet dependency).

Layout

src/datasets/ — ModelNet40 index, synthetic cache, rendered Dataset wrappers
src/rendering/ — multi-view pyrender backend (RGB / depth / normals)
blender/ — headless Blender chirality pipeline + manifest
configs/ — YAML / Hydra
data/shapenet_hf/ — optional local ShapeNetCore snapshots/extractions
data/objaverse/ and data/objaverse_cache/ — optional local Objaverse assets/cache
data/synthetic_primitives/ — generated .obj primitives + catalog.json

Setup (Python)

cd "/path/to/CV final project"
python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
# PyTorch: pick the wheel that matches your machine from https://pytorch.org
export PYTHONPATH="$PWD"
export CV_PROJECT_ROOT="$PWD"

pyrender / PyOpenGL: requirements.txt installs pyrender from GitHub (not PyPI 0.1.45) so pip can resolve PyOpenGL>=3.1.7 (PyPI pyrender wrongly pins 3.1.0, which breaks Python 3.7+). You need git on your PATH for that line.

If you cannot use git+https, install PyPI pyrender without pulling its OpenGL pin, then upgrade OpenGL:

pip install "pyrender==0.1.45" --no-deps
pip install "PyOpenGL>=3.1.7,<3.2"
pip install trimesh networkx scipy pyglet "Pillow>=10" imageio freetype-py six

Headless rendering: on Linux servers you may need export PYOPENGL_PLATFORM=osmesa (and OSMesa installed) or EGL; on macOS the default often works for offscreen pyrender.

1) Legacy ModelNet40 download

ModelNet40 is kept only for legacy validation utilities. It is not the default Experiment 1 source because .off meshes do not provide photorealistic texture controls.

Automatic (may fail if the server blocks bots; then use manual steps printed by the script):

python scripts/download_modelnet40.py

Manual fallback:

Open https://modelnet.cs.princeton.edu/ and download ModelNet40.zip.
Unzip so you have data/modelnet40/train/<category>/*.off and data/modelnet40/test/<category>/*.off.

Optional Blender manifest from the same tree:

python scripts/prepare_modelnet_manifest.py \
  --modelnet-root data/modelnet40 \
  --output data/metadata/modelnet40_manifest.json

2) Synthetic primitives (default experiment)

python scripts/setup_synthetic_primitives.py

Creates data/synthetic_primitives/train/*.obj, val/*.obj, and catalog.json.

2b) ShapeNetCore from Hugging Face

After your Hugging Face account has access to the gated ShapeNet/ShapeNetCore repository, log in locally and preprocess a small category subset first:

huggingface-cli login
python scripts/preprocess_assets.py \
  --config configs/exp1_mvp.yaml \
  --shapenet-hf-repo-id ShapeNet/ShapeNetCore \
  --shapenet-hf-local-dir data/shapenet_hf/ShapeNetCore \
  --shapenet-category chair \
  --shapenet-category table \
  --max-objects 50 \
  --overwrite

ShapeNet/ShapeNetCore is stored as per-category synset ZIPs; preprocessing downloads only the requested category ZIPs when --shapenet-category is set, extracts them into data/shapenet_hf/extracted/, and normalizes the meshes. This writes the normal Experiment 1 asset manifests under data/exp1/manifests/ and normalized GLBs under data/exp1/normalized_assets/, so downstream render planning can keep using:

python scripts/create_render_plan.py \
  --config configs/exp1_mvp.yaml \
  --asset-manifest data/exp1/manifests/assets_normalized.jsonl

Once you have access to the GLB mirror, use the same command with --shapenet-hf-repo-id ShapeNet/shapenetcore-glb and a separate local dir such as data/shapenet_hf/shapenetcore-glb. Use --shapenet-no-download with --shapenet-hf-local-dir to scan an existing snapshot without contacting Hugging Face.

2c) Objaverse Bounded Download

Objaverse access is optional and bounded by config defaults in configs/exp1/paths.yaml (datasets.objaverse.max_objects and datasets.objaverse.max_download_gb). The downloader writes a normal asset manifest that scripts/preprocess_assets.py can consume alongside ShapeNet:

python scripts/download_objaverse_assets.py \
  --config configs/exp1_mvp.yaml \
  --category chair \
  --category table \
  --max-objects 50 \
  --max-download-gb 5

Then prepare a render plan from configured ShapeNet/Objaverse sources without running Blender:

make exp1-mvp-plan

2d) Bounded Depth/Normal Run

Use configs/exp1_bounded.yaml as the next non-toy scale-up config. It keeps the core geometry probes only:

surface_normal_aggregate;
relative_depth_regions.

Before rendering, estimate the render count and storage:

make exp1-bounded-estimate

Prepare assets and render chunks without launching Blender:

make exp1-bounded-plan

Then render and finish the pipeline:

BLENDER_BIN="/Applications/Blender.app/Contents/MacOS/Blender" \
  bash data/exp1_bounded/manifests/render_chunks/run_blender_chunks.sh

HF_HOME=data/hf_cache PYTHONPATH=. python scripts/run_exp1_pipeline.py \
  --config configs/exp1_bounded.yaml \
  --stages post_render ml

2e) Experiment 1 smoke pipeline

The safe smoke command prepares a tiny synthetic asset catalog, writes an Experiment 1 render plan, exports JSONL render chunks, and creates a Blender shell script. Stages are idempotent; rerunning the command skips outputs that already exist unless --force is passed.

make exp1-smoke

Render the prepared chunks with Blender:

bash data/exp1/manifests/render_chunks/run_blender_chunks.sh

After Blender finishes, build QC, labels, features, probes, aggregated results, and figures:

make exp1-smoke-post

Equivalent direct runner commands:

PYTHONPATH=. python scripts/run_exp1_pipeline.py \
  --config configs/exp1_smoke.yaml \
  --stages smoke_prepare

PYTHONPATH=. python scripts/run_exp1_pipeline.py \
  --config configs/exp1_smoke.yaml \
  --stages post_render ml

Standalone reporting commands:

PYTHONPATH=. python scripts/aggregate_exp1_results.py --config configs/exp1_smoke.yaml
PYTHONPATH=. python scripts/make_exp1_figures.py --config configs/exp1_smoke.yaml

3) Rendered sample schema (ModelNet + synthetic + future corpora)

Each rendered training sample is a flat dict (one view per row when flatten_views=True):

{
    "mesh_path": str,
    "category": str,
    "split": str,
    "view_id": int,
    "object_id": str,
    "dataset": str,  # "modelnet40" | "synthetic_primitives" | ...
    "rgb": np.uint8[H, W, 3],
    "depth": np.float32[H, W],   # linear depth in meters (inf = background)
    "normal": np.float32[H, W, 3],  # camera-space normals in [-1, 1]
}

PyTorch datasets

RenderedSyntheticPrimitiveDataset — default controlled renders.
RenderedModelNetDataset — ModelNet40 validation / training renders.
RenderedMeshDataset — generic wrapper over any list of records with the same keys (e.g. future ShapeNet rows with mesh_path + category + split).

Example:

from src.rendering.mesh_renderer import RenderConfig
from src.datasets import RenderedSyntheticPrimitiveDataset

ds = RenderedSyntheticPrimitiveDataset(
    split="train",
    render_cfg=RenderConfig(image_size=(224, 224), n_views=8, seed=0),
)
sample = ds[0]  # keys: mesh_path, category, split, view_id, rgb, depth, normal, ...

PyTorch3D: not required. The active backend is trimesh + pyrender (src/rendering/mesh_renderer.py). Optional PyTorch3D hook lives in src/rendering/pytorch3d_backend.py (stub for you to implement if you install pytorch3d).

Blender (optional) chirality pipeline

Install Blender 3.6+.
Install PyYAML into Blender’s Python (see previous README sections).
Point configs/render_config.yaml → paths.mesh_manifest at data/metadata/modelnet40_manifest.json.
Run ./scripts/render_chirality_dataset.sh.

Feature extraction & probes (unchanged)

./scripts/extract_clip_features.sh
./scripts/extract_dino_features.sh
./scripts/train_clip_probe.sh
./scripts/train_dino_probe.sh

Hydra

python -m src.training.train_probe --config-name=clip_probe probe.epochs=5

Licenses

ModelNet (Princeton) — follow their terms for redistribution and citation.
CLIP, DINOv2, OpenCLIP — respect each model license in publications.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.venv		.venv
.vscode		.vscode
blender		blender
configs		configs
data		data
exp1		exp1
notebooks		notebooks
reports		reports
scripts		scripts
src		src
tests		tests
.DS_Store		.DS_Store
.gitignore		.gitignore
AGENTS.md		AGENTS.md
EXPERIMENT1_HANDOFF.md		EXPERIMENT1_HANDOFF.md
EXPERIMENT1_TASKS.md		EXPERIMENT1_TASKS.md
Makefile		Makefile
README.md		README.md
pyrightconfig.json		pyrightconfig.json
pytest.ini		pytest.ini
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Controlled 3D geometry probing (CLIP & DINOv2)

Layout

Setup (Python)

1) Legacy ModelNet40 download

2) Synthetic primitives (default experiment)

2b) ShapeNetCore from Hugging Face

2c) Objaverse Bounded Download

2d) Bounded Depth/Normal Run

2e) Experiment 1 smoke pipeline

3) Rendered sample schema (ModelNet + synthetic + future corpora)

Blender (optional) chirality pipeline

Feature extraction & probes (unchanged)

Hydra

Licenses

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Controlled 3D geometry probing (CLIP & DINOv2)

Layout

Setup (Python)

1) Legacy ModelNet40 download

2) Synthetic primitives (default experiment)

2b) ShapeNetCore from Hugging Face

2c) Objaverse Bounded Download

2d) Bounded Depth/Normal Run

2e) Experiment 1 smoke pipeline

3) Rendered sample schema (ModelNet + synthetic + future corpora)

Blender (optional) chirality pipeline

Feature extraction & probes (unchanged)

Hydra

Licenses

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages