layout	fancy_home
permalink	/publications-visual/
title	Research Atlas

Curated Research Map

Systems, Agents, and Low-Level Vision

Three streams drive my current research. AIGC systems push structured creativity, vision-language agents reason over complex scenes, and low-level restoration builds the reliable backbone beneath both. The gallery below highlights representative works in each space.

AIGC Vision-Language Agents Low-Level Restoration

Top-tier venues
ICLR / CVPR / ICCV / AAAI

Research axes
AIGC · VLM · Low-Level

∞

Creativity
Task-driven design

AIGC Systems

Structure-Aware Poster Generation

Unified workflows for controllable poster generation, combining task distillation, reward modeling, and data curation.

Flagship Venues ICLR'26 · arXiv'26

Poster Design · arXiv 2026

PosterOmni — Generalized Artistic Poster Creation via Task Distillation and Unified Reward Feedback

A generalist system that distills local editing, global layout, and reward feedback into a single controllable workflow with public demos and datasets.

Paper Code Project

ICLR 2026 · Unified Diffusion

PosterCraft — Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

Accepted by ICLR'26, PosterCraft couples layout planning and stylized diffusion to translate prompts directly into production-ready posters with consistent typography.

Paper Code Project

Evaluation · arXiv 2025

An Empirical Study of GPT-4o Image Generation Capabilities

A 360° evaluation of GPT-4o image generation, benchmarking fidelity, controllability, and safety to guide industrial adoption.

Paper Project

Vision-Language & Agents

Reasoning-Centric Restoration Agents

MLLMs and autonomous agents coordinate perception, planning, and feedback loops for adverse-weather driving and iterative data engines.

Flagship Venues CVPR'25

CVPR 2025 · Multi-Model Feedback

SnowMaster — Comprehensive Real-world Image Desnowing via MLLM with Multi-Model Feedback Optimization

Combines reasoning from multiple language-vision experts with visual priors to schedule desnowing operations that adapt to scene structure.

Paper Project

CVPR 2025 · Autonomous Driving

JarvisIR — Elevating Autonomous Driving Perception with Intelligent Image Restoration

An intelligent restoration agent that reasons via MLLM dialogs, calling specialized tools to deweather automotive perception stacks.

Paper Project

CVPR 2025 · Data Engines

Detect Any Mirrors — Boosting Learning Reliability with an Iterative Data Engine

Uses agentic feedback and large-scale pseudo-labeling to tackle mirror detection, providing a blueprint for general VLM data refinement.

Paper Project

Low-Level Vision

Physics-aware Restoration Pipelines

Pairing generative priors with physical constraints produces reliable de-weathering, low-light enhancement, and deraining solutions.

Flagship Venues ICCV'25 · AAAI'25

ICCV 2025 · Controllable Haze

GenHaze — One-step Controllable Haze Generation for Real-World Dehazing

Provides a generative counterpart to dehazing by synthesizing paired data with explicit physical controls, enabling better restoration agents.

Paper

AAAI 2025 · Prompted Dehazing

PromptHaze — Prompting Real-world Dehazing via Depth Anything

Aligns promptable depth priors with restoration networks, achieving plug-and-play robustness across weather distributions.

Paper

arXiv 2024 · Low-Light Diffusion

AGLLDiff — Guiding Diffusion Models for Training-Free Low-Light Enhancement

Introduces adaptive guidance so diffusion models can enhance low-light scenes without paired training data.

Paper Project

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Systems, Agents, and Low-Level Vision

Structure-Aware Poster Generation

PosterOmni — Generalized Artistic Poster Creation via Task Distillation and Unified Reward Feedback

PosterCraft — Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

An Empirical Study of GPT-4o Image Generation Capabilities

Reasoning-Centric Restoration Agents

SnowMaster — Comprehensive Real-world Image Desnowing via MLLM with Multi-Model Feedback Optimization

JarvisIR — Elevating Autonomous Driving Perception with Intelligent Image Restoration

Detect Any Mirrors — Boosting Learning Reliability with an Iterative Data Engine

Physics-aware Restoration Pipelines

GenHaze — One-step Controllable Haze Generation for Real-World Dehazing

PromptHaze — Prompting Real-world Dehazing via Depth Anything

AGLLDiff — Guiding Diffusion Models for Training-Free Low-Light Enhancement

FilesExpand file tree

publications_visual.md

Latest commit

History

publications_visual.md

File metadata and controls

Systems, Agents, and Low-Level Vision

Structure-Aware Poster Generation

PosterOmni — Generalized Artistic Poster Creation via Task Distillation and Unified Reward Feedback

PosterCraft — Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

An Empirical Study of GPT-4o Image Generation Capabilities

Reasoning-Centric Restoration Agents

SnowMaster — Comprehensive Real-world Image Desnowing via MLLM with Multi-Model Feedback Optimization

JarvisIR — Elevating Autonomous Driving Perception with Intelligent Image Restoration

Detect Any Mirrors — Boosting Learning Reliability with an Iterative Data Engine

Physics-aware Restoration Pipelines

GenHaze — One-step Controllable Haze Generation for Real-World Dehazing

PromptHaze — Prompting Real-world Dehazing via Depth Anything

AGLLDiff — Guiding Diffusion Models for Training-Free Low-Light Enhancement