주요 메뉴 바로가기 보조 메뉴 바로가기 본문 바로가기

콘텐츠 본문

논문 국내 국내전문학술지(KCI급) Enhancing Image Compression with Foveal Vision: A Multi-Focus FPSNR Assessment and Attention-Based Neural Network

  • 학술지 구분 국내전문학술지(KCI급)
  • 게재년월 2024-12
  • 저자명 Andri Agustav Wirabudi, Haechul Choi
  • 발행처명 한국방송미디어공학회
  • 발행국가 국내
  • 논문언어 외국어
  • 전체저자수 2

논문 초록 (Abstract)

In the field of image and video compression, the objective is to achieve a balance between compression efficiency and the quality of reconstructed images. 

The commonly used quality assessment method in this field is the Peak Signal-to-Noise Ratio (PSNR), which, however, has a limitation in that it only considers the differences in pixel values. 

To address this, our research introduces the Foveal Peak Signal-to-Noise Ratio (F_PSNR), a visual perception-based approach that reflects human foveal vision. 

Specifically, we propose a multi-focus F_PSNR assessment method that incorporates the visual characteristics of humans for images containing multiple objects of interest.

Additionally, we suggest a model that integrates an attention mechanism focusing on the quality of objects of interest into the existing neural network-based compression method to enhance perception-based quality. 

Experimental results using the KODAK dataset demonstrate that applying the attention mechanism to existing methods can enhance the human-perceptual compression efficiency of neural networks.