Abstract
Image aesthetic assessment is a hot issue in current research. It will be very important to find regions that affect the aesthetic assessment of the image, for which we propose a weighted multi-region aesthetic assessment network WRMA-Net, which consists of three modules: information theory-based image segmentation module uses information theory to segment images; in the feature extraction module, we connect Convolutional Neural Network(CNN) and Graph Neural Network(GNN) in tandem, using CNN to obtain shallow detail information of the image and GNN to obtain deep semantic information of the image, which can retain feature information at each level, and subsequently fuse shallow and deep features to predict aesthetic assessment scores by the region weighting module; the weighted multi-region aggregation module assigns different weights to each region adaptively to adjust the prediction results and find high-quality aesthetic regions. The network can analyze image aesthetics from multiple regions and provide constructive regional aesthetic suggestions. The experimental results show that our WMRA-Net achieves good results in some aesthetic assessment metrics.











Similar content being viewed by others
Data availability
All accompanying data are provided in the manuscript.
References
Wu X (2022) Interpretable aesthetic analysis model for intelligent photography guidance systems. 27th Int Conf Intell User Interfaces. https://doi.org/10.1145/3490099.3511155
Zhang B, Niu L, Zhang L (2021) Image composition assessment with saliency-augmented multi-pattern pooling. ArXiv
Lu X, Lin Z, Shen X, Mech R, Wang JZ (2015) Deep multi-patch aggregation network for image style, aesthetics, and quality estimation. 2015 IEEE Int Conf Comput Vis (ICCV). https://doi.org/10.1109/ICCV.2015.119
Ma S, Liu J, Chen CW (2017) A-Lamp: adaptive layout-aware multi-patch deep convolutional neural network for photo aesthetic assessment. 2017 IEEE Conf Comput Vis Pattern Recognit (CVPR). https://doi.org/10.1109/CVPR.2017.84
Wang L, Wang X, Yamasaki T (2023) Image aesthetics prediction using multiple patches preserving the original aspect ratio of contents. Multimed Tools Appl 82:2783–2804. https://doi.org/10.1007/s11042-022-13333-w
Yang J, Zhou Y, Zhao Y, Lu W, Gao X (2022) MetaMP: metalearning-based multipatch image aesthetics assessment. IEEE Trans Cybern. https://doi.org/10.1109/TCYB.2022.3169017
Le Q-T, Ladret P, Nguyen H-T, Caplier A (2020) Image aesthetic assessment based on image classification and region segmentation. J Imaging 7:3. https://doi.org/10.3390/jimaging7010003
Liu D, Puri R, Kamath N, Bhattacharya S (2020) Composition-aware image aesthetics assessment. 2020 IEEE Winter Conf Appl Comput Vis (WACV). https://doi.org/10.1109/WACV45572.2020.9093412
Krizhevsky A, Sutskever I, Hinton GE (2017) ImageNet classification with deep convolutional neural networks. Commun ACM 60:84–90. https://doi.org/10.1145/3065386
Kipf T, Welling M (2016) Semi-supervised classification with graph convolutional networks. ArXiv
Lee B, Seo MK, Kim D, Shin I, Schich M, Jeong H, Han SK (2020) Dissecting landscape art history with information theory. Proc Natl Acad Sci USA 117:26580–26590. https://doi.org/10.1073/pnas.2011927117
Lu X, Lin Z, Jin H, Yang J, Wang JZ (2014) RAPID: rating pictorial aesthetics using deep learning. Proc 22nd ACM Int Conf Multimedia. https://doi.org/10.1145/2647868.2654927
Talebi H, Milanfar P (2018) NIMA: neural image assessment. IEEE Trans Image Process 27:3998–4011. https://doi.org/10.1109/TIP.2018.2831899
Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M, Adam H (2017) MobileNets: efficient convolutional neural networks for mobile vision applications. ArXiv
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. CoRR
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. 2015 IEEE conference on computer vision and pattern recognition (CVPR). pp 1–9. https://doi.org/10.1109/CVPR.2015.7298594
Kong S, Shen X, Lin Z, Mech R, Fowlkes C (2016) Photo aesthetics ranking network with attributes and content adaptation. 9905:662–679. https://doi.org/10.1007/978-3-319-46448-0_40
Gao F, Li Z, Yu J, Yu J, Huang Q, Tian Q (2020) Style-adaptive photo aesthetic rating via convolutional neural networks and multi-task learning. Neurocomputing 395:247–254. https://doi.org/10.1016/j.neucom.2018.06.099
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. 2016 IEEE Conf Comput Vis Pattern Recognit (CVPR). https://doi.org/10.1109/CVPR.2016.90
Murray N, Marchesotti L, Perronnin F (2012) AVA: a large-scale database for aesthetic visual analysis. 2012 IEEE Conf Comput Vis Pattern Recognit. https://doi.org/10.1109/CVPR.2012.6247954
Hou J, Yang S, Lin W, Zhao B, Fang Y (2021) Learning image aesthetic assessment from object-level visual components. ArXiv
She D, Lai Y-K, Yi G, Xu K (2021) Hierarchical layout-aware graph convolutional network for unified aesthetics assessment. 2021 IEEE/CVF conference on computer vision and pattern recognition (CVPR). https://doi.org/10.1109/CVPR46437.2021.00837
Zeng H, Cao Z, Zhang L, Bovik AC (2020) A unified probabilistic formulation of image aesthetic assessment. IEEE Trans Image Process 29:1548–1561. https://doi.org/10.1109/TIP.2019.2941778
Murray N, Gordo A (2017) A deep architecture for unified aesthetic prediction. ArXiv
Sheng K, Dong W, Ma C, Mei X, Huang F, Hu B-G (2018) Attention-based multi-patch aggregation for image aesthetic assessment. Proc 26th ACM Int Conf Multimedia. https://doi.org/10.1145/3240508.3240554
Hosu V, Goldlucke B, Saupe D (2019) Effective aesthetics prediction with multi-level spatially pooled features. 2019 IEEE/CVF Conf Comput Vis Pattern Recognit (CVPR). https://doi.org/10.1109/CVPR.2019.00960
Chen Q, Zhang W, Zhou N, Lei P, Xu Y, Zheng Y, Fan J (2020) Adaptive fractional dilated convolution network for image aesthetics assessment. 2020 IEEE/CVF Conf Comput Vis Pattern Recognit (CVPR). https://doi.org/10.1109/CVPR42600.2020.01412
Li L, Zhu H, Zhao S, Ding G, Lin W (2020) Personality-assisted multi-task learning for generic and personalized image aesthetics assessment. IEEE Trans Image Process 29:3898–3910. https://doi.org/10.1109/TIP.2020.2968285
Rigau J, Feixas M, Sbert M (2008) Informational aesthetics measures. IEEE Comput Grap Appl 28:24–34. https://doi.org/10.1109/MCG.2008.34
Han K, Wang Y, Guo J, Tang Y, Wu E (2022) Vision GNN: an image is worth graph of nodes. https://doi.org/10.48550/ARXIV.2206.00272
Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L (2009) ImageNet: a large-scale hierarchical image database. 2009 IEEE conference on computer vision and pattern recognition. pp 248–255. https://doi.org/10.1109/CVPR.2009.5206848
He S et al (2022) Rethinking image aesthetics assessment: models, datasets and benchmarks. IJCAI. https://www.ijcai.org/proceedings/2022/132
He S et al (2023) Eat: an enhancer for aesthetics-oriented transformers. Proceedings of the 31st ACM international conference on multimedia. https://dl.acm.org/doi/abs/10.1145/3581783.361188
Ke J et al (2021) Musiq: multi-scale image quality transformer. Proceedings of the IEEE/CVF international conference on computer vision.
Li L et al (2022) Psychology inspired model for hierarchical image aesthetic attribute prediction. 2022 IEEE international conference on multimedia and expo (ICME), IEEE
Funding
Received no external funding.
Author information
Authors and Affiliations
Contributions
Yin Wang: Methodology, Software, Writing - Original Draft.Jing Guo: Conceptualization, Supervision, Validation, Writing - Review & EditingYongzhen Ke: Conceptualization, Supervision, Methodology, Writing - Review & Editing.Kai Wang: Writing - Review & Editing, Formal analysis, Visualization.Shuai Yang: Resources, Validation, Data Curation.Liming Chen: Methodology, Writing - Review & Editing.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Ethical approval
Not applicable.
Consent for publication
The work described has not been published before. it is not under consideration for publication elsewhere. its publication has been approved by all co-authors.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, Y., Guo, J., Ke, Y. et al. Image aesthetic assessment with weighted multi-region aggregation based on information theory. Pattern Anal Applic 28, 115 (2025). https://doi.org/10.1007/s10044-025-01490-1
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1007/s10044-025-01490-1
