close
Skip to main content
Log in

Image aesthetic assessment with weighted multi-region aggregation based on information theory

  • Original Article
  • Published:
Pattern Analysis and Applications Aims and scope Submit manuscript

Abstract

Image aesthetic assessment is a hot issue in current research. It will be very important to find regions that affect the aesthetic assessment of the image, for which we propose a weighted multi-region aesthetic assessment network WRMA-Net, which consists of three modules: information theory-based image segmentation module uses information theory to segment images; in the feature extraction module, we connect Convolutional Neural Network(CNN) and Graph Neural Network(GNN) in tandem, using CNN to obtain shallow detail information of the image and GNN to obtain deep semantic information of the image, which can retain feature information at each level, and subsequently fuse shallow and deep features to predict aesthetic assessment scores by the region weighting module; the weighted multi-region aggregation module assigns different weights to each region adaptively to adjust the prediction results and find high-quality aesthetic regions. The network can analyze image aesthetics from multiple regions and provide constructive regional aesthetic suggestions. The experimental results show that our WMRA-Net achieves good results in some aesthetic assessment metrics.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+
from $39.99 /Month
  • Starting from 10 chapters or articles per month
  • Access and download chapters and articles from more than 300k books and 2,500 journals
  • Cancel anytime
View plans

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
The alternative text for this image may have been generated using AI.
Fig. 2
The alternative text for this image may have been generated using AI.
Fig. 3
The alternative text for this image may have been generated using AI.
Fig. 4
The alternative text for this image may have been generated using AI.
Fig. 5
The alternative text for this image may have been generated using AI.
Fig. 6
The alternative text for this image may have been generated using AI.
Fig. 7
The alternative text for this image may have been generated using AI.
Fig. 8
The alternative text for this image may have been generated using AI.
Fig. 9
The alternative text for this image may have been generated using AI.
Fig. 10
The alternative text for this image may have been generated using AI.
Fig. 11
The alternative text for this image may have been generated using AI.

Similar content being viewed by others

Data availability

All accompanying data are provided in the manuscript.

References

  1. Wu X (2022) Interpretable aesthetic analysis model for intelligent photography guidance systems. 27th Int Conf Intell User Interfaces. https://doi.org/10.1145/3490099.3511155

    Article  Google Scholar 

  2. Zhang B, Niu L, Zhang L (2021) Image composition assessment with saliency-augmented multi-pattern pooling. ArXiv

  3. Lu X, Lin Z, Shen X, Mech R, Wang JZ (2015) Deep multi-patch aggregation network for image style, aesthetics, and quality estimation. 2015 IEEE Int Conf Comput Vis (ICCV). https://doi.org/10.1109/ICCV.2015.119

    Article  Google Scholar 

  4. Ma S, Liu J, Chen CW (2017) A-Lamp: adaptive layout-aware multi-patch deep convolutional neural network for photo aesthetic assessment. 2017 IEEE Conf Comput Vis Pattern Recognit (CVPR). https://doi.org/10.1109/CVPR.2017.84

    Article  Google Scholar 

  5. Wang L, Wang X, Yamasaki T (2023) Image aesthetics prediction using multiple patches preserving the original aspect ratio of contents. Multimed Tools Appl 82:2783–2804. https://doi.org/10.1007/s11042-022-13333-w

    Article  Google Scholar 

  6. Yang J, Zhou Y, Zhao Y, Lu W, Gao X (2022) MetaMP: metalearning-based multipatch image aesthetics assessment. IEEE Trans Cybern. https://doi.org/10.1109/TCYB.2022.3169017

    Article  Google Scholar 

  7. Le Q-T, Ladret P, Nguyen H-T, Caplier A (2020) Image aesthetic assessment based on image classification and region segmentation. J Imaging 7:3. https://doi.org/10.3390/jimaging7010003

    Article  Google Scholar 

  8. Liu D, Puri R, Kamath N, Bhattacharya S (2020) Composition-aware image aesthetics assessment. 2020 IEEE Winter Conf Appl Comput Vis (WACV). https://doi.org/10.1109/WACV45572.2020.9093412

    Article  Google Scholar 

  9. Krizhevsky A, Sutskever I, Hinton GE (2017) ImageNet classification with deep convolutional neural networks. Commun ACM 60:84–90. https://doi.org/10.1145/3065386

    Article  Google Scholar 

  10. Kipf T, Welling M (2016) Semi-supervised classification with graph convolutional networks. ArXiv

  11. Lee B, Seo MK, Kim D, Shin I, Schich M, Jeong H, Han SK (2020) Dissecting landscape art history with information theory. Proc Natl Acad Sci USA 117:26580–26590. https://doi.org/10.1073/pnas.2011927117

    Article  Google Scholar 

  12. Lu X, Lin Z, Jin H, Yang J, Wang JZ (2014) RAPID: rating pictorial aesthetics using deep learning. Proc 22nd ACM Int Conf Multimedia. https://doi.org/10.1145/2647868.2654927

    Article  Google Scholar 

  13. Talebi H, Milanfar P (2018) NIMA: neural image assessment. IEEE Trans Image Process 27:3998–4011. https://doi.org/10.1109/TIP.2018.2831899

    Article  MathSciNet  Google Scholar 

  14. Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M, Adam H (2017) MobileNets: efficient convolutional neural networks for mobile vision applications. ArXiv

  15. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. CoRR

  16. Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. 2015 IEEE conference on computer vision and pattern recognition (CVPR). pp 1–9. https://doi.org/10.1109/CVPR.2015.7298594

  17. Kong S, Shen X, Lin Z, Mech R, Fowlkes C (2016) Photo aesthetics ranking network with attributes and content adaptation. 9905:662–679. https://doi.org/10.1007/978-3-319-46448-0_40

  18. Gao F, Li Z, Yu J, Yu J, Huang Q, Tian Q (2020) Style-adaptive photo aesthetic rating via convolutional neural networks and multi-task learning. Neurocomputing 395:247–254. https://doi.org/10.1016/j.neucom.2018.06.099

    Article  Google Scholar 

  19. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. 2016 IEEE Conf Comput Vis Pattern Recognit (CVPR). https://doi.org/10.1109/CVPR.2016.90

    Article  Google Scholar 

  20. Murray N, Marchesotti L, Perronnin F (2012) AVA: a large-scale database for aesthetic visual analysis. 2012 IEEE Conf Comput Vis Pattern Recognit. https://doi.org/10.1109/CVPR.2012.6247954

    Article  Google Scholar 

  21. Hou J, Yang S, Lin W, Zhao B, Fang Y (2021) Learning image aesthetic assessment from object-level visual components. ArXiv

  22. She D, Lai Y-K, Yi G, Xu K (2021) Hierarchical layout-aware graph convolutional network for unified aesthetics assessment. 2021 IEEE/CVF conference on computer vision and pattern recognition (CVPR). https://doi.org/10.1109/CVPR46437.2021.00837

  23. Zeng H, Cao Z, Zhang L, Bovik AC (2020) A unified probabilistic formulation of image aesthetic assessment. IEEE Trans Image Process 29:1548–1561. https://doi.org/10.1109/TIP.2019.2941778

    Article  MathSciNet  Google Scholar 

  24. Murray N, Gordo A (2017) A deep architecture for unified aesthetic prediction. ArXiv

  25. Sheng K, Dong W, Ma C, Mei X, Huang F, Hu B-G (2018) Attention-based multi-patch aggregation for image aesthetic assessment. Proc 26th ACM Int Conf Multimedia. https://doi.org/10.1145/3240508.3240554

    Article  Google Scholar 

  26. Hosu V, Goldlucke B, Saupe D (2019) Effective aesthetics prediction with multi-level spatially pooled features. 2019 IEEE/CVF Conf Comput Vis Pattern Recognit (CVPR). https://doi.org/10.1109/CVPR.2019.00960

    Article  Google Scholar 

  27. Chen Q, Zhang W, Zhou N, Lei P, Xu Y, Zheng Y, Fan J (2020) Adaptive fractional dilated convolution network for image aesthetics assessment. 2020 IEEE/CVF Conf Comput Vis Pattern Recognit (CVPR). https://doi.org/10.1109/CVPR42600.2020.01412

    Article  Google Scholar 

  28. Li L, Zhu H, Zhao S, Ding G, Lin W (2020) Personality-assisted multi-task learning for generic and personalized image aesthetics assessment. IEEE Trans Image Process 29:3898–3910. https://doi.org/10.1109/TIP.2020.2968285

    Article  Google Scholar 

  29. Rigau J, Feixas M, Sbert M (2008) Informational aesthetics measures. IEEE Comput Grap Appl 28:24–34. https://doi.org/10.1109/MCG.2008.34

    Article  Google Scholar 

  30. Han K, Wang Y, Guo J, Tang Y, Wu E (2022) Vision GNN: an image is worth graph of nodes. https://doi.org/10.48550/ARXIV.2206.00272

  31. Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L (2009) ImageNet: a large-scale hierarchical image database. 2009 IEEE conference on computer vision and pattern recognition. pp 248–255. https://doi.org/10.1109/CVPR.2009.5206848

  32. He S et al (2022) Rethinking image aesthetics assessment: models, datasets and benchmarks. IJCAI. https://www.ijcai.org/proceedings/2022/132

  33. He S et al (2023) Eat: an enhancer for aesthetics-oriented transformers. Proceedings of the 31st ACM international conference on multimedia. https://dl.acm.org/doi/abs/10.1145/3581783.361188

  34. Ke J et al (2021) Musiq: multi-scale image quality transformer. Proceedings of the IEEE/CVF international conference on computer vision.

  35. Li L et al (2022) Psychology inspired model for hierarchical image aesthetic attribute prediction. 2022 IEEE international conference on multimedia and expo (ICME), IEEE

Download references

Funding

Received no external funding.

Author information

Authors and Affiliations

Authors

Contributions

Yin Wang: Methodology, Software, Writing - Original Draft.Jing Guo: Conceptualization, Supervision, Validation, Writing - Review & EditingYongzhen Ke: Conceptualization, Supervision, Methodology, Writing - Review & Editing.Kai Wang: Writing - Review & Editing, Formal analysis, Visualization.Shuai Yang: Resources, Validation, Data Curation.Liming Chen: Methodology, Writing - Review & Editing.

Corresponding author

Correspondence to Yongzhen Ke.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Ethical approval

Not applicable.

Consent for publication

The work described has not been published before. it is not under consideration for publication elsewhere. its publication has been approved by all co-authors.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, Y., Guo, J., Ke, Y. et al. Image aesthetic assessment with weighted multi-region aggregation based on information theory. Pattern Anal Applic 28, 115 (2025). https://doi.org/10.1007/s10044-025-01490-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Version of record:

  • DOI: https://doi.org/10.1007/s10044-025-01490-1

Keywords