RSEdit: Text-Guided Image Editing for Remote Sensing
Abstract
In this paper, we explore text-guided image editing in the remote sensing domain using generative modeling. We propose , a collection of models from U-Net to DiT with various configurations. Specifically, we present the first comprehensive study of conditioning strategies for building image editing models from off-the-shelf text-to-image ones. Our experiments show that achieves the best instruction-faithful edits while preserving geospatial structure. We release the code at https://github.com/Bili-Sakura/RSEdit-Preview and checkpoints at https://huggingface.co/collections/BiliSakura/rsedit.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.