Visual Artifact Detection and Correction for Digital Images Via Deep Neural Networks (Real-ESRGAN)
Keywords:
Digital Image Processing, Image Quality Enhancement, Image Denoising, Deep Learning, Deep Neural Networks, Pre-Trained Models,, Real-ESRGAN, Visual Analysis, Descriptive Quantitative AnalysisAbstract
This work pursues to the improvement of digital images quality and the correction of visual artifacts, in particular noise, by means of deep neural networks, through an application-oriented investigation based on pre-trained models which are ready to be used in the execution on a resource-limited platform. The study is inspired by a typical problem in processing digital images, on one hand traditional image enhancement algorithms do not work well on real world low-quality images because they cannot well trade off between over smoothing noise and preserving visual details and image structural information, on the other hand, high quality ground truth images corresponding to low quality real images are not available. The experimental part of this work was performed on a real noisy digital image from an open-source platform. The image was processed using a specialized processing pipeline developed in Python. The PyTorch library was used to run the DL model and some dedicated libraries are also included such as Real-ESRGAN as the default model to enhance the image quality, BasicSR as the generic framework to manage the process workflow, and the installation of GFPGAN and FaceXLib to facilitate optional facial restoration. Prior to all other testing were done on the CPU, these experiments were conducted using only the Central Processing Unit(DCPU) and not any high end Graphical Processing Units(GPUs) The processing approach was to use the Real-ESRGAN model only in the inference stage without any training process or modification on the model architecture. The discussion of results was grounded on qualitative visual inspection aided with a descriptive quantitative analysis of result indicators that can be obtained from the image itself before and after processing, such as the image size, the total number of pixels, and the spatial upscaling factor. The result indicates a 16 times enlargement in pixel number after processing (under the upscaling factor of ×4), as well as an intuitive improvement on clearness of details and the visual noise decreasing, which means the enhancement on perceptual quality of the image. Results show that exploiting pre-trained DNN models is a realistic and time-efficient strategy for improving quality of noisy images of the real world, even if computational power is scarce at the time of acquisition of images. In addition, the work demonstrates the feasibility of leveraging open source software in the domain of digital imaging and thus paves the way for potential future investigation incorporating larger datasets or employing standardized quantitative evaluation metrics when appropriate evaluation conditions are obtainable.
References
Bush, T., & Glover, D. (2021). School leadership models: What do we know? School Leadership & Management, 41(1–2), 1–16. https://doi.org/10.1080/13632434.2020.1745605
Day, C., Gu, Q., & Sammons, P. (2020). The impact of leadership on student and teacher outcomes. Educational Administration Quarterly, 56(3), 406–440. https://doi.org/10.1177/0013161X20907250
Elad, M., Kawar, B., & Vaksman, G. (2023). Image denoising: The deep learning revolution and beyond. SIAM Journal on Imaging Sciences, 16(3), 1594–1654. https://doi.org/10.1137/23M1545859
Field, A. (2018). Discovering statistics using IBM SPSS statistics (5th ed.). Sage Publications.
Gonzalez, R. C., & Woods, R. E. (2018). Digital image processing (4th ed.). Pearson.
Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. MIT Press.
Hallinger, P., Liu, S., & Piyaman, P. (2021). Does principal leadership matter? Journal of Educational Administration, 59(2), 133–152. https://doi.org/10.1108/JEA-08-2020-0176
Harris, A., & Jones, M. (2022). Leading professional learning communities. School Leadership & Management, 42(4), 303–320. https://doi.org/10.1080/13632434.2021.1966836
Jain, A. K. (2015). Fundamentals of digital image processing. Prentice Hall.
Jain, V., & Seung, H. S. (2008). Natural image denoising with convolutional networks. In Advances in Neural Information Processing Systems (NeurIPS) (Vol. 21, pp. 769–776).
Leithwood, K., Harris, A., & Hopkins, D. (2020). Seven strong claims about successful school leadership. School Leadership & Management, 40(1), 5–22. https://doi.org/10.1080/13632434.2019.1596077
Liu, Y., Bellibaş, M. Ş., & Gümüş, S. (2021). The effect of instructional leadership on teacher professional learning. Educational Management Administration & Leadership, 49(4), 1–20. https://doi.org/10.1177/1741143220932586
Ng, S. W., Nguyen, T. D., & Hallinger, P. (2022). Leadership effects on teacher job performance. Teaching and Teacher Education, 108, 103512. https://doi.org/10.1016/j.tate.2021.103512
Northouse, P. G. (2022). Leadership: Theory and practice (9th ed.). Sage Publications.
Ohayon, G., Adrai, T., Vaksman, G., Elad, M., & Milanfar, P. (2021). High perceptual quality image denoising with a posterior sampling CGAN. arXiv:2103.04192.
Oplatka, I. (2021). Teacher performance and principal leadership. International Journal of Educational Management, 35(6), 1151–1165. https://doi.org/10.1108/IJEM-11-2020-0523
Robinson, V. M. J. (2020). Reducing the gap between leadership theory and practice. Educational Management Administration & Leadership, 48(4), 635–650. https://doi.org/10.1177/1741143219836664
Sebastian, J., Allensworth, E., & Huang, H. (2021). Principal leadership and teacher effectiveness. American Educational Research Journal, 58(4), 1–36. https://doi.org/10.3102/0002831220985339
Shava, G. N., & Ndebele, C. (2021). School leadership and teacher motivation. Journal of Educational Research and Practice, 11(1), 45–59.
Sun, J., & Leithwood, K. (2020). Leadership effects on teachers’ work and commitment. Educational Administration Quarterly, 56(3), 1–38. https://doi.org/10.1177/0013161X20921699
Tian, C., Xu, Y., Fei, L., & Yan, K. (2018). Deep learning for image denoising: A survey. arXiv:1810.05052.
Vincent, P., Larochelle, H., Bengio, Y., & Manzagol, P. A. (2010). Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. Journal of Machine Learning Research, 11, 3371–3408.
Wang, X., Xie, L., Dong, C., & Change Loy, C. (2021). Real-ESRGAN: Training real-world blind super-resolution with pure synthetic data. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV).
Wang, X., Yu, K., Wu, S., Gu, J., Liu, Y., Dong, C., Qiao, Y., & Change Loy, C. (2018). ESRGAN: Enhanced super-resolution generative adversarial networks. In ECCV Workshops.
Wang, Z., Bovik, A. C., Sheikh, H. R., & Simoncelli, E. P. (2004). Image quality assessment: From error visibility to structural similarity. IEEE Transactions on Image Processing, 13(4), 600–612. https://doi.org/10.1109/TIP.2003.819861
Wiyono, B. B., & Burhanuddin, B. (2022). Principal leadership and teacher performance in Islamic schools. Journal of Educational Leadership and Policy Studies, 6(2), 1–14.
Yasir, M., Imran, R., Irshad, M. K., Mohamad, N. A., & Khan, M. M. (2021). Leadership styles and employee performance. Leadership & Organization Development Journal, 42(3), 1–18. https://doi.org/10.1108/LODJ-02-2020-0071
Zhang, K., Ren, W., Luo, W., Lai, W.-S., Stenger, B., Yang, M.-H., & Li, H. (2022). Deep image deblurring: A survey. International Journal of Computer Vision, 130, 2103–2130. https://doi.org/10.1007/s11263-022-01633-5
Zhang, K., Zuo, W., Chen, Y., Meng, D., & Zhang, L. (2017). Beyond a Gaussian denoiser: Residual learning of deep CNN for image denoising. IEEE Transactions on Image Processing, 26(7), 3142–3155. https://doi.org/10.1109/TIP.2017.2662206




