Fusion of Transformer and diffusion features for real-world image super-resolution

ملخص البحث

In image super-resolution (SR), the main aim is to improve the quantitative results, such as PSNR and perceptual quality of the image. Some models can improve one or both of them such as the Transformer model and diffusion models. In this paper, we take the merit of both the Transformer and diffusion models to improve the real-world image SR model. So, we designed a fusion strategy for the outcome images of both the Transformer and diffusion. This fusion represents the integration of multiple sources of information or features, creating a more cohesive and comprehensive representation. A new model called the double diffusion image super-resolution (DDiffSR) model is proposed based on fusing both the Transformer and the diffusion model. The transformer model is used to extract a compact image SR prior. Meanwhile, the diffusion model is based on the residual denoise diffusion model to generate high-resolution images. The new DDiffSR model achieved state-of-the-art traditional and real-world image SR results. Also, the model achieved more appealing visual results. For example, our model enhanced the PSNR by 1.85 dB for the Set14 dataset.

مؤلف البحث

Garas Gendy & Nabil Sabor

تاريخ البحث

سبت, 14 يونيو 2025 - 12:00

مجلة البحث

Signal, Image and Video Processing

مؤلف البحث

Nabil Sabor Nafea Kolta

الناشر

Springer

عدد البحث

19, No. 734

سنة البحث

2025

كلية الهندسة

Fusion of Transformer and diffusion features for real-world image super-resolution

آخر الأبحاث

جامعة
اسيوط

روابط هامة

عنواننا

Fusion of Transformer and diffusion features for real-world image super-resolution

آخر الأبحاث

جامعة اسيوط

روابط هامة

عنواننا

جامعة
اسيوط