Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images

Most of the current image semantic segmentation algorithms on UAV vision segment remote sensing images, which cannot represent ground detail information, resulting in obstacles to real-time autonomous environment perception of UAVs in low altitude flight missions. To address this problem, this paper...

Full description

Bibliographic Details
Published in:ACM International Conference Proceeding Series
Main Author: Yan Q.; Cheng G.
Format: Conference paper
Language:English
Published: Association for Computing Machinery 2023
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85191484767&doi=10.1145%2f3650400.3650664&partnerID=40&md5=de6376dc4d9f59174ebc1f6008ac69c1
id 2-s2.0-85191484767
spelling 2-s2.0-85191484767
Yan Q.; Cheng G.
Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images
2023
ACM International Conference Proceeding Series


10.1145/3650400.3650664
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85191484767&doi=10.1145%2f3650400.3650664&partnerID=40&md5=de6376dc4d9f59174ebc1f6008ac69c1
Most of the current image semantic segmentation algorithms on UAV vision segment remote sensing images, which cannot represent ground detail information, resulting in obstacles to real-time autonomous environment perception of UAVs in low altitude flight missions. To address this problem, this paper proposes a real-time image semantic segmentation method for low altitude UAVs. The network designs a novel hypernetwork structure that incorporates a context header weight generation module at the last layer of the encoder, the weights of each block in the decoder are generated before the end of encoding in the encoder to reduce the number of parameters and computation of the model to achieve real-time segmentation. In the decoder, a dynamic patch-wise convolutional algorithm is designed using the locally connection layer mechanism to take full account of the contextual semantic information when targeting large segmented objects that are in more than one piece, so that the decoder's weights change with the spatial location of the input feature map, and at the same time, the dynamic weights are used to target the segmentation of different objects, to maximise the network's adaptive nature. In order to verify the effectiveness of the method, this experiment uses the transfer learning technique to carry out pre-training on Cityscape data set, and uses the UAVid data set to validate the method of this paper, and the experimental results show that the mean intersection over union of this method is 66.3% for the images of the categories of buildings, roads, and static cars, and the prediction speed reaches 37.9 FPS, which significantly improves segmentation accuracy under the condition of guaranteeing the real-time performance. © 2023 ACM.
Association for Computing Machinery

English
Conference paper

author Yan Q.; Cheng G.
spellingShingle Yan Q.; Cheng G.
Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images
author_facet Yan Q.; Cheng G.
author_sort Yan Q.; Cheng G.
title Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images
title_short Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images
title_full Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images
title_fullStr Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images
title_full_unstemmed Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images
title_sort Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images
publishDate 2023
container_title ACM International Conference Proceeding Series
container_volume
container_issue
doi_str_mv 10.1145/3650400.3650664
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85191484767&doi=10.1145%2f3650400.3650664&partnerID=40&md5=de6376dc4d9f59174ebc1f6008ac69c1
description Most of the current image semantic segmentation algorithms on UAV vision segment remote sensing images, which cannot represent ground detail information, resulting in obstacles to real-time autonomous environment perception of UAVs in low altitude flight missions. To address this problem, this paper proposes a real-time image semantic segmentation method for low altitude UAVs. The network designs a novel hypernetwork structure that incorporates a context header weight generation module at the last layer of the encoder, the weights of each block in the decoder are generated before the end of encoding in the encoder to reduce the number of parameters and computation of the model to achieve real-time segmentation. In the decoder, a dynamic patch-wise convolutional algorithm is designed using the locally connection layer mechanism to take full account of the contextual semantic information when targeting large segmented objects that are in more than one piece, so that the decoder's weights change with the spatial location of the input feature map, and at the same time, the dynamic weights are used to target the segmentation of different objects, to maximise the network's adaptive nature. In order to verify the effectiveness of the method, this experiment uses the transfer learning technique to carry out pre-training on Cityscape data set, and uses the UAVid data set to validate the method of this paper, and the experimental results show that the mean intersection over union of this method is 66.3% for the images of the categories of buildings, roads, and static cars, and the prediction speed reaches 37.9 FPS, which significantly improves segmentation accuracy under the condition of guaranteeing the real-time performance. © 2023 ACM.
publisher Association for Computing Machinery
issn
language English
format Conference paper
accesstype
record_format scopus
collection Scopus
_version_ 1809677886588715008