Image super resolution using deep learning techniques

Başlık:

Personal Author:

El Ballouti, Salah Eddine, author.

Yayın Bilgileri:

[s.l. : s.n.], 2024.

Fiziksel Tanımlama:

xi, 55 leaves : illustrations, tables ; 30 cm + 1 CD-ROM.

Genel Not:

Date of approval: 02.09.2024

Abstract:

Image SR using Deep Learning Techniques has become a critical area of research, with significant progress in improving image quality and detail. This thesis examines and contrasts eight advanced deep learning-based SR methods: CARN, EDSR, ESPCN, RCAN, RDN, SRCNN, SRGAN, and VDSR, using the DIV2K dataset. The evaluation covers multiple aspects to offer a thorough understanding of each method's effectiveness, efficiency, and structure. Performance measurements such as PSNR and SSIM are utilized for evaluating the fidelity of super-resolved images. Computational efficiency is evaluated based on inference time and memory requirements. Training time is analyzed, taking into account the speed of convergence for training on the DIV2K dataset. Model complexity is examined, exploring architectural details such as network depth, and the integration of specialized elements like residual blocks and attention mechanisms. Additionally, the thesis explains in a clear and detailed manner the trade-offs between performance and complexity, discussing whether more complex architectures deliver significantly better results compared to simpler models and whether the computational cost justifies the improvements. Finally, a qualitative comparison is conducted to emphasize the strengths and weaknesses of each technique. Through this comprehensive analysis, this thesis offers insights into the field of deep learning-based image SR, assisting researchers and practitioners in choosing the most appropriate method for various applications.

Derin Öğrenme Teknikleri kullanarak Görüntü Süper Çözünürlüğü (SR), görüntü kalitesini ve detayını geliştirmede önemli bir araştırma alanı haline gelmiştir. Bu tez, CARN, EDSR, ESPCN, RCAN, RDN, SRCNN, SRGAN ve VDSR olmak üzere sekiz gelişmiş derin öğrenme tabanlı SR yöntemini, DIV2K veri kümesini kullanarak incelemekte ve karşılaştırmaktadır. Değerlendirme, her yöntemin etkinliğini, verimliliğini ve yapısını tam anlamıyla anlamak için birden fazla yönü kapsamaktadır. Süper çözünürlüklü görüntülerin kalitesini değerlendirmek için tepe Sinyal-Gürültü Oranı (PSNR) ve Yapısal Benzerlik İndeksi (SSIM) gibi performans ölçümleri kullanılmaktadır. Hesaplama verimliliği, çıkarım süresi ve bellek gereksinimleri temel alınarak değerlendirilmiştir. Eğitim süresi, DIV2K veri kümesinde eğitim için yakınsamanın hızı göz önünde bulundurularak analiz edilmiştir. Model karmaşıklığı incelenmiş, ağ derinliği gibi mimari detaylar ve kalıntı blokları ve dikkat mekanizmaları gibi özel unsurların entegrasyonu gibi konular araştırılmıştır. Ek olarak bu tezde, performans ve karmaşıklık arasındaki dengeyi net ve detaylı bir şekilde ortaya koyduk; daha karmaşık mimarilerin daha basit modellere kıyasla önemli ölçüde daha iyi sonuçlar verip vermediğini ve hesaplama maliyetinin geliştirmeleri haklı çıkarıp çıkarmadığını inceledik. Son olarak, her tekniğin güçlü ve zayıf yönlerini vurgulamak için nitel bir karşılaştırma yaptık. Bu kapsamlı analiz aracılığıyla, bu tez, araştırmacıların ve uygulayıcıların çeşitli uygulamalar için en uygun yöntemi seçmelerine yardımcı olarak, derin öğrenme tabanlı görüntü süper çözünürlüğü alanında içgörüler sunmaktadır.

Subject Term:

Dissertations, Academic.

Added Author:

Ek Kurum Yazarı:

M.S. in Computer Engineering. Thesis.

Added Uniform Title:

Thesis (Master) -- Işık University: Graduate School for Science and Engineering.