GenDR

: Lightning Generative Detail Restorator

ByteDance

Abstract

Recent research applying text-to-image (T2I) diffusion models to real-world super-resolution (SR) has achieved remarkable success. However, fundamental misalignments between T2I and SR targets result in a dilemma between inference speed and detail fidelity. Specifically, T2I tasks prioritize multi-step inversion to synthesize coherent outputs aligned with textual prompts and shrink the latent space to reduce generating complexity. Contrariwise, SR tasks preserve most information from low-resolution input while solely restoring high-frequency details, thus necessitating sufficient latent space and fewer inference steps.

To bridge the gap, we present a one-step diffusion model for generative detail restoration, GenDR, distilled from a tailored diffusion model with larger latent space. In detail, we train a new SD2.1-VAE16 (0.9B) via representation alignment to expand latent space without enlarging the model size. Regarding step-distillation, we propose consistent score identity distillation (CiD) that incorporates SR task-specific loss into score distillation to leverage more SR priors and align the training target. Furthermore, we extend CiD with adversarial learning and representation alignment (CiDA) to enhance perceptual quality and accelerate training. We also polish the pipeline to achieve a more efficient inference. Experimental results demonstrate that GenDR achieves state-of-the-art performance in both quantitative metrics and visual fidelity.

Motivation

Divergent task objectives between T2I and SR make dilemma of slow inference speeds and detail fidelity. T2I task (generation > reconstruction) bridges the huge gap between initial distribution (noise) to target, thus preferring multi-steps (better refinement) and narrow latent space (less difficulty) to make results reasonable. SR task (reconstruction > generation) restores only details from adjacent distribution (LQ), needing fewer steps and high-dimensional space.

Motivation

BibTeX

@article{wang2025gendr,
  title={GenDR: Lightning Generative Detail Restorator},
  author={Wang, Yan and Zhao, Shijie and Chen, Kai and Zhang, Kexin and Li, Junlin and Zhang, Li},
  journal={arXiv preprint arXiv:2503.06790},
  year={2025}
}