Skip to content

D-Robotics-AI-Lab/RSGen

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RSGen: Enhancing Layout-Driven Remote Sensing Image Generation with Diverse Edge Guidance

Xianbao Hou1, 2, ∗, Yonghao He2, ∗, ‡, Zeyd Boukhers3, John See4, Hu Su5, Wei Sui2, †, Cong Yang1, †

1 Soochow University, 2 D-Robotics, 3 Fraunhofer Institute for Applied Information Technology, 4 School of Mathematical and Computing Sciences, 5 Institute of Automation, Chinese Academy of Sciences

Equal Contribution, Corresponding Author, Project Lead

Paper on Arxiv

Abstract

Diffusion models have significantly mitigated the impact of annotated data scarcity in remote sensing (RS). Although recent approaches have successfully harnessed these models to enable diverse and controllable Layout-to-Image (L2I) synthesis, they still suffer from limited fine-grained control and fail to strictly adhere to bounding box constraints. To address these limitations, we propose RSGen, a plug-and-play framework that leverages diverse edge guidance to enhance layout-driven RS image generation. Specifically, RSGen employs a progressive enhancement strategy: 1) it first enriches the diversity of edge maps composited from retrieved training instances via Image-to-Image generation; and 2) subsequently utilizes these diverse edge maps as conditioning for existing L2I models to enforce pixel-level control within bounding boxes, ensuring the generated instances strictly adhere to the layout.

Overview

RSGen consists of two key components: the Edge2Edge module, designed for generating diverse edge maps, and the L2I FGControl module, which incorporates edge guidance to ensure accurate layout alignment. Together, these components address the challenges of limited diversity and spatial misalignment in remote sensing image generation.

Getting Started

step 1. Refer to install.md to install the environment.

step 2. Refer to datasets.md to prepare DIOR-RSVG, DOTA, HRSC2016 datasets.

step 3. Refer to train.md for training.

step 4. Refer to eval.md for evaluation.

TODOs

  • Release the paper on arXiv.
  • Release the FGControl code. (FICGen)
  • Release the complete code.
  • Release generated images by RSGen.

Contact

If you have any questions about this paper or code, feel free to email me at xbhou2024@stu.suda.edu.cn.

Acknowledgements

Our work is based on stable CC-Diff, MIGC, FICGen, We appreciate their excellent contributions for Layout-to-Image generation.

Citation

@article{hou2026rsgen,
  title={RSGen: Enhancing Layout-Driven Remote Sensing Image Generation with Diverse Edge Guidance},
  author={Hou, Xianbao and He, Yonghao and Boukhers, Zeyd and See, John and Su, Hu and Sui, Wei and Yang, Cong},
  journal={arXiv preprint arXiv:2603.15484},
  year={2026}
}

About

RSGen: Enhancing Layout-Driven Remote Sensing Image Generation with Diverse Edge Guidance

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages