nerc.ac.uk

Superpixel-based attention graph neural network for semantic segmentation in aerial images

Diao, Qi; Dai, Yaping; Zhang, Ce ORCID: https://orcid.org/0000-0001-5100-3584; Wu, Yan; Feng, Xiaoxue; Pan, Feng. 2022 Superpixel-based attention graph neural network for semantic segmentation in aerial images [in special issue: Advances in object-based image analysis - linked with computer vision and machine learning] Remote Sensing, 14 (2), 305. 17, pp. https://doi.org/10.3390/rs14020305

Before downloading, please read NORA policies.
[img]
Preview
Text
N531807JA.pdf - Published Version
Available under License Creative Commons Attribution 4.0.

Download (3MB) | Preview

Abstract/Summary

Semantic segmentation is one of the significant tasks in understanding aerial images with high spatial resolution. Recently, Graph Neural Network (GNN) and attention mechanism have achieved excellent performance in semantic segmentation tasks in general images and been applied to aerial images. In this paper, we propose a novel Superpixel-based Attention Graph Neural Network (SAGNN) for semantic segmentation of high spatial resolution aerial images. A K-Nearest Neighbor (KNN) graph is constructed from our network for each image, where each node corresponds to a superpixel in the image and is associated with a hidden representation vector. On this basis, the initialization of the hidden representation vector is the appearance feature extracted by a unary Convolutional Neural Network (CNN) from the image. Moreover, relying on the attention mechanism and recursive functions, each node can update its hidden representation according to the current state and the incoming information from its neighbors. The final representation of each node is used to predict the semantic class of each superpixel. The attention mechanism enables graph nodes to differentially aggregate neighbor information, which can extract higher-quality features. Furthermore, the superpixels not only save computational resources, but also maintain object boundary to achieve more accurate predictions. The accuracy of our model on the Potsdam and Vaihingen public datasets exceeds all benchmark approaches, reaching 90.23% and 89.32%, respectively.

Item Type: Publication - Article
Digital Object Identifier (DOI): https://doi.org/10.3390/rs14020305
UKCEH and CEH Sections/Science Areas: Soils and Land Use (Science Area 2017-)
ISSN: 2072-4292
Additional Information. Not used in RCUK Gateway to Research.: Open Access paper - full text available via Official URL link.
Additional Keywords: graph neural networks, superpixel, attention mechanism, semantic segmentation, aerial images
NORA Subject Terms: Electronics, Engineering and Technology
Computer Science
Date made live: 21 Jan 2022 14:44 +0 (UTC)
URI: https://nora.nerc.ac.uk/id/eprint/531807

Actions (login required)

View Item View Item

Document Downloads

Downloads for past 30 days

Downloads per month over past year

More statistics for this item...