ComfyUI-Pixal3D

ComfyUI custom node for Pixal3D — Tencent's SIGGRAPH 2026 single-image to PBR-textured-3D pipeline — on Windows with RTX 30/40/50 GPUs. Runs standalone in ComfyUI's main Python (no extra worker env needed); optionally piggybacks on ComfyUI-TRELLIS2's pixi env if you already have it.

One image → textured PBR mesh in ~3-5 min on an RTX 5090.

_{Four base-color views of the generated PBR mesh.}

_{Same mesh, untextured clay shading — to read the geometry quality.}

⚠️ License you inherit

Pixal3D is licensed by Tencent for academic / non-commercial use only, and explicitly NOT for use within the European Union. By installing this plugin you agree to those terms. See NOTICE.md.

Will it work on my machine?

Short version: if you're on ComfyUI Desktop with an RTX 30/40/50 (Python 3.12 + Torch 2.8 + CUDA 12.8 — the Desktop defaults), this will work standalone.

	Requirement
OS	Windows 10 / 11 (x64) — bundled natten wheel is Windows-only
GPU	NVIDIA RTX 30 / 40 / 50 with ≥ 16 GB VRAM (24 GB+ recommended; the bundled workflows ship with `1024_cascade` + 16/16/16 steps, see the Note node in each for low-VRAM tweaks)
Disk	~50 GB free (24 GB Pixal3D + 4 GB other models + workspace)
CPU	Any modern x86_64 — Intel and AMD both work, no special requirements
Python	3.12 only (ComfyUI Desktop's `.venv` is 3.12 — bundled wheel is cp312)
PyTorch	2.8.x + CUDA 12.8 (Desktop's `.venv` matches by default — wheel is built against torch 2.8.0+cu128)
ComfyUI	Desktop (or portable). ComfyUI-TRELLIS2 is optional — if installed, the installer drops deps into TRELLIS2's pixi env; otherwise it installs into ComfyUI's `.venv`.

Will the wheel work for me?

The bundled wheels/natten-0.21.0+winsm89ptx-...-win_amd64.whl is locked to Windows + Python 3.12 + PyTorch 2.8 + CUDA 12.8 + NVIDIA GPU. If your setup matches all of the above (which the standard ComfyUI-TRELLIS2 pixi env does), the wheel just works.

If any one of those doesn't match (you're on Linux / Python 3.11 / PyTorch 2.7 / etc.), you need to install natten yourself for your env first, then run install.py — it will auto-detect your natten and skip the bundled wheel. Two options:

Linux: pip install natten==0.21.0 -f https://whl.natten.org (official prebuilt wheels for many cu/torch combos), then run install.py.
Windows with a non-default Python/PyTorch/GPU: build from source per docs/BUILD_NATTEN.md, then run install.py. The installer probes natten.HAS_LIBNATTEN + a real na2d call on cuda — if your wheel works, it's kept and the bundled one is skipped.

AMD / Intel GPUs are not supported — upstream Pixal3D requires CUDA.

Install

The fastest path is ComfyUI Manager — search for ComfyUI-Pixal3D, click Install, restart. The plugin's install.py runs automatically and installs all deps into ComfyUI's .venv (no extra worker env required).

For a manual clone:

# 1. Open the custom_nodes folder
cd $HOME\Documents\ComfyUI\custom_nodes

# 2. Clone this repo
git clone https://github.com/dreamrec/ComfyUI-Pixal3D.git

# 3. Run the installer with ComfyUI's Python
cd ComfyUI-Pixal3D
& "$HOME\Documents\ComfyUI\.venv\Scripts\python.exe" install.py

# 4. Restart ComfyUI Desktop

What install.py does, in ~30 seconds:

Picks a target Python: if ComfyUI-TRELLIS2 (pozzettiandrea's fork) is installed alongside, drops deps into its pixi worker env; otherwise installs into the calling Python (your ComfyUI .venv).
Clones TencentARC/Pixal3D at a pinned commit into _pixal3d_src/.
Installs MoGe + utils3d + pyrender + PyOpenGL into the target env.
Installs the bundled natten wheel from wheels/.
Patches Pixal3D's BiRefNet for the Windows inference_mode interaction.
Sanity-checks all imports.

On first queue, ComfyUI will download ~26 GB of model weights from HuggingFace (one-time, cached): Pixal3D weights (24 GB) + DINOv3 (1.2 GB) + MoGe-2 (1.3 GB) + BiRefNet (0.44 GB). Cold-start with download takes ~30 min on a fast connection; subsequent runs use the cache.

Use

After install + restart, three nodes appear in the Add Node menu under Pixal3D:

The only one you need is Pixal3D: Image to Mesh. Drop in an image, queue, get a GLB.

Two workflows are bundled in `workflows/`

File	Use when
`pixal3d_image_to_mesh.json`	Default. Internal BiRefNet does background removal automatically.
`pixal3d_image_to_mesh_with_external_rembg.json`	You want a better matte than BiRefNet (RMBG-2.0, SAM, manual). Connects a mask into the node, which skips internal background removal.

Load either via Workflow → Browse. Drop your image into the LoadImage node, hit Queue.

GLBs are auto-saved to ComfyUI/output/pixal3d_<timestamp>_<seed>.glb with PNG-textures (open in Blender / Three.js / any standard viewer).

Want an OBJ too? Set the save_obj widget to True (the bundled demos already do this in v0.1.10+) and you'll get pixal3d_<timestamp>_<seed>.obj + .mtl + base-color PNG written alongside the GLB. OBJ is base-color-only (the format has no standard metallic/roughness slots) — use it for DCC tools that prefer it; keep the GLB as the canonical PBR artifact.

Full parameter reference for all three nodes lives in docs/NODES.md.

Troubleshooting

Error	Fix
`Repository Not Found for url: https://huggingface.co/ckpts/...`	You're on v0.1.6. Update to ≥ v0.1.7 — the 404 was a misleading wrapper around an `mmgp` complex-dtype `KeyError`, fixed in v0.1.7.
`Input type (torch.cuda.FloatTensor) and weight type (torch.FloatTensor) should be the same`	You're on v0.1.7 and toggled `low_vram` between queue runs. Update to ≥ v0.1.8 — cache-hit device-resync fix.
`Inference tensors do not track version counter` mid-run	You're on an older version of this plugin. `git pull` and re-run `install.py`. (Fixed by wrapping `run_pixal3d` in `torch.inference_mode(False)`.)
Thin black lines on the textured mesh	Set the `background_color` widget on the node to `gray` (the default). If you're on an old saved workflow it may still have `black` — re-create the node from the menu.
`No module named 'pyrender'` / `'moge'` or PyOpenGL ctypes error	Target env missing render deps. Re-run `install.py` with the SAME Python that ComfyUI uses (Desktop: `.venv\Scripts\python.exe`).
`OSError: We couldn't connect to 'https://hf-mirror.com'`	Your environment has `HF_ENDPOINT` set to the Chinese mirror. The plugin overrides this internally; if you still hit it, restart ComfyUI after install.
`OutOfMemoryError` / `Allocation on device`	Drop `max_num_tokens` to 32768 or 24576, optionally enable `low_vram` on the node.
Blender refuses to open the GLB (`STB cannot decode image data`)	You have an old GLB from before this fix. Re-run; new GLBs use PNG textures.
Workflow JSON rejected with widget-index errors	You saved the workflow from an old plugin version. Delete the `Pixal3DImageToMesh` node and add a fresh one from the menu.

Memory + performance

Two runtime modes, two ceilings. v0.1.7+ runs Pixal3D in-process inside ComfyUI Desktop's .venv (standalone mode), which means comfy-aimdo is loaded and reserves a 16 GB cudaMallocAsync cast buffer on top of Pixal3D's own weights + activations. Numbers below are for standalone mode; the legacy TRELLIS2 worker-env path is ~16 GB lighter at every setting.

Setting	VRAM peak (standalone)	Time	Quality
`1536_cascade` + steps 16/16/16 + 49k tokens + 300k decim + 4096 tex + low_vram=true (bundled workflow defaults, v0.1.10+)	~32 GB	~65-70 s warm	Best quality — verified on RTX 5090
`1024_cascade` + steps 16/16/16 + 32k tokens + 300k decim + 4096 tex + low_vram=true	~17 GB	~3-5 min warm	High-fidelity safe — works on 16-24 GB cards
Same `1024_cascade` row + low_vram=false (32 GB+ cards)	~28-30 GB	~50-65 s warm	Fastest 1024 run, RTX 5090-class only
`1024_cascade` + steps 12/12/12 + 32k tokens + 200k decim + 2048 tex + low_vram=true	~14 GB	~3 min warm	Balanced for 16 GB cards
`1024_cascade` + steps 8/8/8 + 16k tokens + low_vram=true	~10 GB	~1.5-2 min warm	Preview / tight cards

The 1536_cascade unlock (v0.1.10+)

1536_cascade was previously documented as "OOMs on ≤34 GB cards" because peak VRAM with low_vram=false overshoots the 5090's 34 GB ceiling. That's only true in eager-placement mode. With low_vram=true flipped on, upstream Pixal3D moves the 4× DinoV3 image_cond_model extractors to CPU between stages — which frees enough headroom for the shape-1536 NAF attention. Verified on RTX 5090:

3 back-to-back successful runs in 1536_cascade + low_vram=true + 49k tokens + 4096 tex, each completing in 64-71 seconds warm.
Peak allocated ~32 GB, peak reserved ~34 GB — fits with very little headroom but does not crash.

If you have a 24 GB card and want to try 1536, drop max_num_tokens to 32768 first; if it OOMs, fall back to 1024_cascade.

Other non-obvious VRAM facts

keep_warm widget (v0.1.4+): the Pixal3D pipeline is ~14 GB resident once loaded; keep_warm=True (default) leaves it in VRAM so the next call is ~3 min, keep_warm=False auto-frees it at the end of the run (next call pays the ~7-10 min cold-load again).
Cold-load tax: the first run after a ComfyUI restart spends 1-3 min loading Pixal3D weights into RAM and another 30-60 s transferring to GPU. Subsequent runs hit the cached singleton.
The standalone-mode comfy-aimdo tax: in-process mode loads mmgp and pre-allocates a 16 GB cast buffer for fp/bf casts. This is why 16/16/16 + 65k tokens + low_vram=false (the old v0.1.3 defaults that ran ~14 GB peak in the worker env) now OOMs at ~30 GB on a 5090. v0.1.9+ demos default to low_vram=true which makes both 1024_cascade and (in v0.1.10+) 1536_cascade fit comfortably.
VRAM fragmentation after many warm runs: we've measured 1024_cascade + low_vram=false + 49k tokens succeeding 10 times in a row then OOMing on the 11th. The allocator gradually loses contiguous space; the next OOM-free run is whatever the allocator can defragment. If you hit this, queue Pixal3D: Free Pipeline between batches or restart ComfyUI.

Upstream roadmap

An experimental fork at visualbruno/ComfyUI-Trellis2#pixal3d is iterating on further VRAM-fit improvements (not yet upstreamed to pozzettiandrea/ComfyUI-TRELLIS2):

use_tiled_decoder widget — tiles the high-res DinoV3 inference so peak VRAM drops further. With v0.1.10+ we already fit 1536_cascade on 34 GB cards via low_vram=true; tiled decoder would unlock it on 24 GB cards too.
pipeline_type expanded to ["512", "1024", "1024_cascade", "1536_cascade"] — adds lighter modes for 12-16 GB cards.
Per-stage memory load/unload — interleaved offload between sampler stages, slimming peak VRAM further.
Standard natten-0.21.6 wheel bundled — our custom 60 MB natten-0.21.0+winsm89ptx becomes redundant.

When that branch merges, this plugin will adopt the new knobs in a follow-up release.

Credits + license

Wrapper code (this repo): MIT, dreamrec 2026.

The actual research / model work belongs to:

Pixal3D — Tencent ARC + Tsinghua, SIGGRAPH 2026. Tencent license — academic only, no EU use.
TRELLIS.2 + ComfyUI-TRELLIS2 — Microsoft Research + pozzettiandrea (MIT).
NATTEN — SHI-Labs (MIT).
NAF — valeoai (Apache 2.0).
MoGe — Microsoft Research (MIT).
BiRefNet — ZhengPeng7 (MIT).

Full third-party license breakdown in NOTICE.md.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.github/workflows		.github/workflows
docs		docs
examples		examples
nodes		nodes
patches		patches
wheels		wheels
workflows		workflows
.gitignore		.gitignore
LICENSE		LICENSE
NOTICE.md		NOTICE.md
README.md		README.md
__init__.py		__init__.py
install.py		install.py
prestartup_script.py		prestartup_script.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ComfyUI-Pixal3D

⚠️ License you inherit

Will it work on my machine?

Will the wheel work for me?

Install

Use

Two workflows are bundled in `workflows/`

Troubleshooting

Memory + performance

The 1536_cascade unlock (v0.1.10+)

Other non-obvious VRAM facts

Upstream roadmap

Credits + license

About

Uh oh!

Releases 11

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ComfyUI-Pixal3D

⚠️ License you inherit

Will it work on my machine?

Will the wheel work for me?

Install

Use

Two workflows are bundled in workflows/

Troubleshooting

Memory + performance

The 1536_cascade unlock (v0.1.10+)

Other non-obvious VRAM facts

Upstream roadmap

Credits + license

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 11

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Two workflows are bundled in `workflows/`

Packages