Abstract
The aesthetic assessment of images is a popular research topic due to its practical applications in various fields such as image recommendation, image ranking, and image search. Currently, most research on image aesthetic assessment relies on large-scale photography datasets, such as AVA and AADB, primarily composed of photos taken by users in real-world scenarios. Few studies specifically focus on the automatic aesthetic assessment of artistic images. Artistic images are more complex, diverse, and abstract compared to photographic images. In this paper, we propose a convolutional neural network model to automatically generate aesthetic scores for input artistic images. Unlike previous research, this study explores artistic theories and introduces the analysis of aesthetic features in artistic images from three dimensions: color, brightness, and contour. These features are integrated to generate an overall aesthetic score. We utilize our own large-scale dataset of artistic images for aesthetic assessment, consisting of over 7,000 artistic images, each accompanied by corresponding average aesthetic scores assigned by users. We compare our model with state-of-the-art image aesthetic assessment models, demonstrating the effectiveness of our approach. Code is available at: https://github.com/ysmyan/aiaa















Similar content being viewed by others
Data availability
The datasets generated and analyzed during the current study are available from the corresponding author on reasonable request.
References
Murray, N., Marchesotti, L., Perronnin, F.: Ava: a large-scale database for aesthetic visual analysis. In: 2012 IEEE conference on computer vision and pattern recognition (IEEE, 2012). https://doi.org/10.1109/cvpr.2012.6247954
Kong, S., Shen, X., Lin, Z., Mech, R., Fowlkes, C.: Photo aesthetics ranking network with attributes and content adaptation, 662–679 (Springer International Publishing, 2016)
Tang, X., Luo, W., Wang, X.: Content-based photo quality assessment. IEEE Trans. Multimed. 15, 1930–1943 (2013). https://doi.org/10.1109/tmm.2013.2269899
Yi, R., Tian, H., Gu, Z., Lai, Y.-K., Rosin, P.L.: Towards artistic image aesthetics assessment: a large-scale dataset and a new method. In: 2023 IEEE/CVF conference on computer vision and pattern recognition (CVPR), https://doi.org/10.1109/cvpr52729.2023.02144 (IEEE, 2023)
Lisa, C.: Colour theory: understanding and working with colour (RMIT Open Press, 2023)
Stephen, P.E.: Photons to Phenomenology (The MIT Press, 1999)
Mamassian, P.: Ambiguities and conventions in the perception of visual art. Vision. Res. 48, 2143–2153 (2008). https://doi.org/10.1016/j.visres.2008.06.010
Cheng, Z., Yang, Q., Sheng, B.: Deep colorization. In: 2015 IEEE international conference on computer vision (ICCV) 415–423.https://doi.org/10.1109/ICCV.2015.55 (2015)
Jin, Y., Sheng, B., Li, P., Chen, C.L.P.: Broad colorization. IEEE Trans. Neural Netw. Learn. Syst. 32, 2330–2343 (2021). https://doi.org/10.1109/TNNLS.2020.3004634
Sheng, B., Li, P., Jin, Y., Tan, P., Lee, T.-Y.: Intrinsic image decomposition with step and drift shading separation. IEEE Trans. Visual Comput. Graphics 26, 1332–1346 (2020). https://doi.org/10.1109/TVCG.2018.2869326
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: 2018 IEEE/CVF conference on computer vision and pattern recognition. https://doi.org/10.1109/cvpr.2018.00745 (IEEE, 2018)
Rombach, R., Blattmann, A., Lorenz, D., Esser, P., Ommer, B.: High-resolution image synthesis with latent diffusion models (2021). arXiv:2112.10752
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE conference on computer vision and pattern recognition (CVPR) https://doi.org/10.1109/cvpr.2016.90 (IEEE, 2016)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Pereira, F., Burges, C., Bottou, L., Weinberger, K. (eds.) Advances in neural information processing systems, vol. 25 (Curran Associates, Inc., 2012)
Lin, X., et al.: Eapt: Efficient attention pyramid transformer for image processing. IEEE Trans. Multimedia 25, 50–61 (2023). https://doi.org/10.1109/TMM.2021.3120873
Huang, S., et al.: Transmrsr: transformer-based self-distilled generative prior for brain mri super-resolution. Vis. Comput. 39, 3647–3659 (2023). https://doi.org/10.1007/s00371-023-02938-3
Zhou, Y., et al.: Fsad-net: Feedback spatial attention dehazing network. IEEE Trans. Neural Netw. Learn. Syst. 34, 7719–7733 (2023). https://doi.org/10.1109/TNNLS.2022.3146004
Vaswani, A., et al.: Attention is all you need. In: Guyon, I. et al. (eds.) Advances in neural information processing systems, vol. 30 (Curran Associates, Inc., 2017)
Ren, J., Shen, X., Lin, Z., Mech, R., Foran, D.J.: Personalized image aesthetics. In: 2017 IEEE international conference on computer vision (ICCV). https://doi.org/10.1109/iccv.2017.76 (IEEE, 2017)
Yang, Y., et al.: Personalized image aesthetics assessment with rich attributes. In: 2022 IEEE/CVF conference on computer vision and pattern recognition (CVPR). https://doi.org/10.1109/cvpr52688.2022.01924 (IEEE, 2022)
He, S., Zhang, Y., Xie, R., Jiang, D., Ming, A.: Rethinking image aesthetics assessment: models, datasets and benchmarks. In: Raedt, L. D. (ed.) Proceedings of the thirty-first international joint conference on artificial intelligence, IJCAI-22, 942–948, https://doi.org/10.24963/ijcai.2022/132 (International Joint Conferences on Artificial Intelligence Organization, 2022). Main Track
Amirshahi, S.A., Hayn-Leichsenring, G.U., Denzler, J., Redies, C.: Jenaesthetics subjective dataset: analyzing paintings by subjective scores. In: Agapito, L., Bronstein, M. M. & Rother, C. (eds.) Computer Vision - ECCV 2014 Workshops 3–19. (Springer International Publishing, Cham, 2015)
Talebi, H., Milanfar, P.: Nima: Neural image assessment. IEEE Trans. Image Process. 27, 3998–4011 (2018). https://doi.org/10.1109/tip.2018.2831899
Levina, E., Bickel, P.: The earth mover’s distance is the mallows distance: some insights from statistics. In: Proceedings eighth ieee international conference on computer vision. ICCV 2001 ICCV-01. https://doi.org/10.1109/iccv.2001.937632 (IEEE Comput. Soc, 2001)
Hosu, V., Goldlucke, B., Saupe, D.: Effective aesthetics prediction with multi-level spatially pooled features. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR) (2019)
Ma, S., Liu, J., Chen, C.W.: A-lamp: adaptive layout-aware multi-patch deep convolutional neural network for photo aesthetic assessment. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR) https://doi.org/10.1109/cvpr.2017.84 (IEEE, 2017)
Wang, L., Wang, X., Yamasaki, T.: Image aesthetics prediction using multiple patches preserving the original aspect ratio of contents. Multimed. Tools Appl. 82, 2783–2804 (2022). https://doi.org/10.1007/s11042-022-13333-w
She, D., Lai, Y.-K., Yi, G., Xu, K.: Hierarchical layout-aware graph convolutional network for unified aesthetics assessment. In: 2021 IEEE/CVF conference on computer vision and pattern recognition (CVPR). https://doi.org/10.1109/cvpr46437.2021.00837 (IEEE, 2021)
Lv, P., et al.: User-guided personalized image aesthetic assessment based on deep reinforcement learning. IEEE Trans. Multimedia 25, 736–749 (2023). https://doi.org/10.1109/TMM.2021.3130752
Zhu, H., Zhou, Y., Li, L., Li, Y., Guo, Y.: Learning personalized image aesthetics from subjective and objective attributes. IEEE Trans. Multimedia 25, 179–190 (2023). https://doi.org/10.1109/TMM.2021.3123468
Zhu, H., et al.: Personalized image aesthetics assessment with attribute-guided fine-grained feature representation. In: Proceedings of the 31st ACM International Conference on Multimedia, MM ’23, https://doi.org/10.1145/3581783.3611942 (ACM, 2023)
He, S., Ming, A., Zheng, S., Zhong, H., Ma, H.: Eat: An enhancer for aesthetics-oriented transformers. In: Proceedings of the 31st ACM international conference on multimedia MM ’23. https://doi.org/10.1145/3581783.3611881 (ACM, 2023)
Niu, Y., Chen, S., Song, B., Chen, Z., Liu, W.: Comment-guided semantics-aware image aesthetics assessment. IEEE Trans. Circuits Syst. Video Technol. 33, 1487–1492 (2023). https://doi.org/10.1109/TCSVT.2022.3201510
Nie, X., et al.: Bmi-net: a brain-inspired multimodal interaction network for image aesthetic assessment. In: Proceedings of the 31st ACM International Conference on Multimedia MM ’23. https://doi.org/10.1145/3581783.3611996 (ACM, 2023)
Sheng, X., et al.: Aesclip: multi-attribute contrastive learning for image aesthetics assessment. In: Proceedings of the 31st ACM international conference on multimedia, MM ’23. https://doi.org/10.1145/3581783.3611969 (ACM, 2023)
Amirshahi, S.A., Denzler, J.: Judging aesthetic quality in paintings based on artistic inspired color features. In: 2017 international conference on digital image computing: techniques and applications (DICTA), https://doi.org/10.1109/dicta.2017.8227452 (IEEE, 2017)
Guo, X., Kurita, T., Asano, C.M., Asano, A.: Visual complexity assessment of painting images. In: 2013 IEEE international conference on image processing. https://doi.org/10.1109/icip.2013.6738080 (IEEE, 2013)
Li, C., Chen, T.: Aesthetic visual quality assessment of paintings. IEEE J. Sel. Top. Sign. Proc. 3, 236–252 (2009). https://doi.org/10.1109/jstsp.2009.2015077
Zhang, J., Miao, Y., Zhang, J., Yu, J.: Inkthetics: a comprehensive computational model for aesthetic evaluation of chinese ink paintings. IEEE Access 8, 225857–225871 (2020). https://doi.org/10.1109/access.2020.3044573
Chang, H., Fried, O., Liu, Y., DiVerdi, S., Finkelstein, A.: Palette-based photo recoloring. ACM Tran. Gr. 34, 1–11 (2015). https://doi.org/10.1145/2766978
Glorot, X., Bordes, A., Bengio, Y.: Deep sparse rectifier neural networks. In: Proceedings of the fourteenth international conference on artificial intelligence and statistics, 315–323 (JMLR Workshop and Conference Proceedings, 2011)
Dubey, S.R., et al.: diffgrad: An optimization method for convolutional neural networks. IEEE Trans. Neural Netw. Learn. Syst. 31, 4500–4511 (2020). https://doi.org/10.1109/tnnls.2019.2955777
Selvaraju, R.R., et al.: Grad-cam: visual explanations from deep networks via gradient-based localization. In: 2017 IEEE international conference on computer vision (ICCV). https://doi.org/10.1109/iccv.2017.74 (IEEE, 2017)
He, S., Ming, A., Zheng, S., Zhong, H., Ma, H.: Eat: An enhancer for aesthetics-oriented transformers. In: Proceedings of the 31st ACM international conference on multimedia, MM ’23. https://doi.org/10.1145/3581783.3611881 (ACM, 2023)
He, S., et al.: Thinking image color aesthetics assessment: Models, datasets and benchmarks. In: 2023 IEEE/CVF international conference on computer vision (ICCV). https://doi.org/10.1109/iccv51070.2023.01996 (IEEE, 2023)
Zhu, H., et al.: Personalized image aesthetics assessment via meta-learning with bilevel gradient optimization. IEEE Trans. Cybern. 52, 1798–1811 (2022). https://doi.org/10.1109/tcyb.2020.2984670
Author information
Authors and Affiliations
Contributions
SY contributed to writing (original draft), coding and conducting experiments. SX contributed to algorithm idea, writing (review and editing) and experiments design. AL contributed to conducting experiments. SZ contributed to supervision and project administration. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Yan, S., Xu, S., Lei, A. et al. Advancing neural aesthetic assessment of artistic images based on bundle features integration. Vis Comput 41, 5447–5459 (2025). https://doi.org/10.1007/s00371-024-03732-5
Accepted:
Published:
Version of record:
Issue date:
DOI: https://doi.org/10.1007/s00371-024-03732-5
