Advancing neural aesthetic assessment of artistic images based on bundle features integration

Yan, Simin; Xu, Shuchang; Lei, Aiping; Zhang, Sanyuan

doi:10.1007/s00371-024-03732-5

Advancing neural aesthetic assessment of artistic images based on bundle features integration

Research
Published: 10 December 2024

Volume 41, pages 5447–5459 (2025)
Cite this article

The Visual Computer Aims and scope Submit manuscript

Simin Yan¹,
Shuchang Xu²,
Aiping Lei³ &
…
Sanyuan Zhang¹

332 Accesses
Explore all metrics

Abstract

The aesthetic assessment of images is a popular research topic due to its practical applications in various fields such as image recommendation, image ranking, and image search. Currently, most research on image aesthetic assessment relies on large-scale photography datasets, such as AVA and AADB, primarily composed of photos taken by users in real-world scenarios. Few studies specifically focus on the automatic aesthetic assessment of artistic images. Artistic images are more complex, diverse, and abstract compared to photographic images. In this paper, we propose a convolutional neural network model to automatically generate aesthetic scores for input artistic images. Unlike previous research, this study explores artistic theories and introduces the analysis of aesthetic features in artistic images from three dimensions: color, brightness, and contour. These features are integrated to generate an overall aesthetic score. We utilize our own large-scale dataset of artistic images for aesthetic assessment, consisting of over 7,000 artistic images, each accompanied by corresponding average aesthetic scores assigned by users. We compare our model with state-of-the-art image aesthetic assessment models, demonstrating the effectiveness of our approach. Code is available at: https://github.com/ysmyan/aiaa

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+

from $39.99 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Image Aesthetics Assessment Using Fully Convolutional Neural Networks

New Approach for the Aesthetic Improvement of Images Through the Combination of Convolutional Neural Networks and Evolutionary Algorithms

IDEA: A new dataset for image aesthetic scoring

Article 18 August 2018

Data availability

The datasets generated and analyzed during the current study are available from the corresponding author on reasonable request.

References

Murray, N., Marchesotti, L., Perronnin, F.: Ava: a large-scale database for aesthetic visual analysis. In: 2012 IEEE conference on computer vision and pattern recognition (IEEE, 2012). https://doi.org/10.1109/cvpr.2012.6247954
Kong, S., Shen, X., Lin, Z., Mech, R., Fowlkes, C.: Photo aesthetics ranking network with attributes and content adaptation, 662–679 (Springer International Publishing, 2016)
Tang, X., Luo, W., Wang, X.: Content-based photo quality assessment. IEEE Trans. Multimed. 15, 1930–1943 (2013). https://doi.org/10.1109/tmm.2013.2269899
Article Google Scholar
Yi, R., Tian, H., Gu, Z., Lai, Y.-K., Rosin, P.L.: Towards artistic image aesthetics assessment: a large-scale dataset and a new method. In: 2023 IEEE/CVF conference on computer vision and pattern recognition (CVPR), https://doi.org/10.1109/cvpr52729.2023.02144 (IEEE, 2023)
Lisa, C.: Colour theory: understanding and working with colour (RMIT Open Press, 2023)
Stephen, P.E.: Photons to Phenomenology (The MIT Press, 1999)
Mamassian, P.: Ambiguities and conventions in the perception of visual art. Vision. Res. 48, 2143–2153 (2008). https://doi.org/10.1016/j.visres.2008.06.010
Article Google Scholar
Cheng, Z., Yang, Q., Sheng, B.: Deep colorization. In: 2015 IEEE international conference on computer vision (ICCV) 415–423.https://doi.org/10.1109/ICCV.2015.55 (2015)
Jin, Y., Sheng, B., Li, P., Chen, C.L.P.: Broad colorization. IEEE Trans. Neural Netw. Learn. Syst. 32, 2330–2343 (2021). https://doi.org/10.1109/TNNLS.2020.3004634
Article Google Scholar
Sheng, B., Li, P., Jin, Y., Tan, P., Lee, T.-Y.: Intrinsic image decomposition with step and drift shading separation. IEEE Trans. Visual Comput. Graphics 26, 1332–1346 (2020). https://doi.org/10.1109/TVCG.2018.2869326
Article Google Scholar
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: 2018 IEEE/CVF conference on computer vision and pattern recognition. https://doi.org/10.1109/cvpr.2018.00745 (IEEE, 2018)
Rombach, R., Blattmann, A., Lorenz, D., Esser, P., Ommer, B.: High-resolution image synthesis with latent diffusion models (2021). arXiv:2112.10752
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR) https://doi.org/10.1109/cvpr.2016.90 (IEEE, 2016)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Pereira, F., Burges, C., Bottou, L., Weinberger, K. (eds.) Advances in neural information processing systems, vol. 25 (Curran Associates, Inc., 2012)
Lin, X., et al.: Eapt: Efficient attention pyramid transformer for image processing. IEEE Trans. Multimedia 25, 50–61 (2023). https://doi.org/10.1109/TMM.2021.3120873
Article Google Scholar
Huang, S., et al.: Transmrsr: transformer-based self-distilled generative prior for brain mri super-resolution. Vis. Comput. 39, 3647–3659 (2023). https://doi.org/10.1007/s00371-023-02938-3
Article Google Scholar
Zhou, Y., et al.: Fsad-net: Feedback spatial attention dehazing network. IEEE Trans. Neural Netw. Learn. Syst. 34, 7719–7733 (2023). https://doi.org/10.1109/TNNLS.2022.3146004
Article Google Scholar
Vaswani, A., et al.: Attention is all you need. In: Guyon, I. et al. (eds.) Advances in neural information processing systems, vol. 30 (Curran Associates, Inc., 2017)
Ren, J., Shen, X., Lin, Z., Mech, R., Foran, D.J.: Personalized image aesthetics. In: 2017 IEEE international conference on computer vision (ICCV). https://doi.org/10.1109/iccv.2017.76 (IEEE, 2017)
Yang, Y., et al.: Personalized image aesthetics assessment with rich attributes. In: 2022 IEEE/CVF conference on computer vision and pattern recognition (CVPR). https://doi.org/10.1109/cvpr52688.2022.01924 (IEEE, 2022)
He, S., Zhang, Y., Xie, R., Jiang, D., Ming, A.: Rethinking image aesthetics assessment: models, datasets and benchmarks. In: Raedt, L. D. (ed.) Proceedings of the thirty-first international joint conference on artificial intelligence, IJCAI-22, 942–948, https://doi.org/10.24963/ijcai.2022/132 (International Joint Conferences on Artificial Intelligence Organization, 2022). Main Track
Amirshahi, S.A., Hayn-Leichsenring, G.U., Denzler, J., Redies, C.: Jenaesthetics subjective dataset: analyzing paintings by subjective scores. In: Agapito, L., Bronstein, M. M. & Rother, C. (eds.) Computer Vision - ECCV 2014 Workshops 3–19. (Springer International Publishing, Cham, 2015)
Talebi, H., Milanfar, P.: Nima: Neural image assessment. IEEE Trans. Image Process. 27, 3998–4011 (2018). https://doi.org/10.1109/tip.2018.2831899
Article MathSciNet Google Scholar
Levina, E., Bickel, P.: The earth mover’s distance is the mallows distance: some insights from statistics. In: Proceedings eighth ieee international conference on computer vision. ICCV 2001 ICCV-01. https://doi.org/10.1109/iccv.2001.937632 (IEEE Comput. Soc, 2001)
Hosu, V., Goldlucke, B., Saupe, D.: Effective aesthetics prediction with multi-level spatially pooled features. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR) (2019)
Ma, S., Liu, J., Chen, C.W.: A-lamp: adaptive layout-aware multi-patch deep convolutional neural network for photo aesthetic assessment. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR) https://doi.org/10.1109/cvpr.2017.84 (IEEE, 2017)
Wang, L., Wang, X., Yamasaki, T.: Image aesthetics prediction using multiple patches preserving the original aspect ratio of contents. Multimed. Tools Appl. 82, 2783–2804 (2022). https://doi.org/10.1007/s11042-022-13333-w
Article Google Scholar
She, D., Lai, Y.-K., Yi, G., Xu, K.: Hierarchical layout-aware graph convolutional network for unified aesthetics assessment. In: 2021 IEEE/CVF conference on computer vision and pattern recognition (CVPR). https://doi.org/10.1109/cvpr46437.2021.00837 (IEEE, 2021)
Lv, P., et al.: User-guided personalized image aesthetic assessment based on deep reinforcement learning. IEEE Trans. Multimedia 25, 736–749 (2023). https://doi.org/10.1109/TMM.2021.3130752
Article Google Scholar
Zhu, H., Zhou, Y., Li, L., Li, Y., Guo, Y.: Learning personalized image aesthetics from subjective and objective attributes. IEEE Trans. Multimedia 25, 179–190 (2023). https://doi.org/10.1109/TMM.2021.3123468
Article Google Scholar
Zhu, H., et al.: Personalized image aesthetics assessment with attribute-guided fine-grained feature representation. In: Proceedings of the 31st ACM International Conference on Multimedia, MM ’23, https://doi.org/10.1145/3581783.3611942 (ACM, 2023)
He, S., Ming, A., Zheng, S., Zhong, H., Ma, H.: Eat: An enhancer for aesthetics-oriented transformers. In: Proceedings of the 31st ACM international conference on multimedia MM ’23. https://doi.org/10.1145/3581783.3611881 (ACM, 2023)
Niu, Y., Chen, S., Song, B., Chen, Z., Liu, W.: Comment-guided semantics-aware image aesthetics assessment. IEEE Trans. Circuits Syst. Video Technol. 33, 1487–1492 (2023). https://doi.org/10.1109/TCSVT.2022.3201510
Article Google Scholar
Nie, X., et al.: Bmi-net: a brain-inspired multimodal interaction network for image aesthetic assessment. In: Proceedings of the 31st ACM International Conference on Multimedia MM ’23. https://doi.org/10.1145/3581783.3611996 (ACM, 2023)
Sheng, X., et al.: Aesclip: multi-attribute contrastive learning for image aesthetics assessment. In: Proceedings of the 31st ACM international conference on multimedia, MM ’23. https://doi.org/10.1145/3581783.3611969 (ACM, 2023)
Amirshahi, S.A., Denzler, J.: Judging aesthetic quality in paintings based on artistic inspired color features. In: 2017 international conference on digital image computing: techniques and applications (DICTA), https://doi.org/10.1109/dicta.2017.8227452 (IEEE, 2017)
Guo, X., Kurita, T., Asano, C.M., Asano, A.: Visual complexity assessment of painting images. In: 2013 IEEE international conference on image processing. https://doi.org/10.1109/icip.2013.6738080 (IEEE, 2013)
Li, C., Chen, T.: Aesthetic visual quality assessment of paintings. IEEE J. Sel. Top. Sign. Proc. 3, 236–252 (2009). https://doi.org/10.1109/jstsp.2009.2015077
Article Google Scholar
Zhang, J., Miao, Y., Zhang, J., Yu, J.: Inkthetics: a comprehensive computational model for aesthetic evaluation of chinese ink paintings. IEEE Access 8, 225857–225871 (2020). https://doi.org/10.1109/access.2020.3044573
Article Google Scholar
Chang, H., Fried, O., Liu, Y., DiVerdi, S., Finkelstein, A.: Palette-based photo recoloring. ACM Tran. Gr. 34, 1–11 (2015). https://doi.org/10.1145/2766978
Article Google Scholar
Glorot, X., Bordes, A., Bengio, Y.: Deep sparse rectifier neural networks. In: Proceedings of the fourteenth international conference on artificial intelligence and statistics, 315–323 (JMLR Workshop and Conference Proceedings, 2011)
Dubey, S.R., et al.: diffgrad: An optimization method for convolutional neural networks. IEEE Trans. Neural Netw. Learn. Syst. 31, 4500–4511 (2020). https://doi.org/10.1109/tnnls.2019.2955777
Article MathSciNet Google Scholar
Selvaraju, R.R., et al.: Grad-cam: visual explanations from deep networks via gradient-based localization. In: 2017 IEEE international conference on computer vision (ICCV). https://doi.org/10.1109/iccv.2017.74 (IEEE, 2017)
He, S., Ming, A., Zheng, S., Zhong, H., Ma, H.: Eat: An enhancer for aesthetics-oriented transformers. In: Proceedings of the 31st ACM international conference on multimedia, MM ’23. https://doi.org/10.1145/3581783.3611881 (ACM, 2023)
He, S., et al.: Thinking image color aesthetics assessment: Models, datasets and benchmarks. In: 2023 IEEE/CVF international conference on computer vision (ICCV). https://doi.org/10.1109/iccv51070.2023.01996 (IEEE, 2023)
Zhu, H., et al.: Personalized image aesthetics assessment via meta-learning with bilevel gradient optimization. IEEE Trans. Cybern. 52, 1798–1811 (2022). https://doi.org/10.1109/tcyb.2020.2984670
Article Google Scholar

Download references

Author information

Authors and Affiliations

College of Computer Science and Technology, Zhejiang University, Hangzhou, China
Simin Yan & Sanyuan Zhang
College of Information Science and Technology, Hangzhou Normal University, Hangzhou, China
Shuchang Xu
Beijing Dailybread Co., Ltd., Beijing, China
Aiping Lei

Authors

Simin Yan
View author publications
Search author on:PubMed Google Scholar
Shuchang Xu
View author publications
Search author on:PubMed Google Scholar
Aiping Lei
View author publications
Search author on:PubMed Google Scholar
Sanyuan Zhang
View author publications
Search author on:PubMed Google Scholar

Contributions

SY contributed to writing (original draft), coding and conducting experiments. SX contributed to algorithm idea, writing (review and editing) and experiments design. AL contributed to conducting experiments. SZ contributed to supervision and project administration. All authors reviewed the manuscript.

Corresponding author

Correspondence to Shuchang Xu.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Yan, S., Xu, S., Lei, A. et al. Advancing neural aesthetic assessment of artistic images based on bundle features integration. Vis Comput 41, 5447–5459 (2025). https://doi.org/10.1007/s00371-024-03732-5

Download citation

Accepted: 15 November 2024
Published: 10 December 2024
Version of record: 10 December 2024
Issue date: June 2025
DOI: https://doi.org/10.1007/s00371-024-03732-5

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+

from $39.99 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Advancing neural aesthetic assessment of artistic images based on bundle features integration

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Image Aesthetics Assessment Using Fully Convolutional Neural Networks

New Approach for the Aesthetic Improvement of Images Through the Combination of Convolutional Neural Networks and Evolutionary Algorithms

IDEA: A new dataset for image aesthetic scoring

Explore related subjects

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now