Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images
Most of the current image semantic segmentation algorithms on UAV vision segment remote sensing images, which cannot represent ground detail information, resulting in obstacles to real-time autonomous environment perception of UAVs in low altitude flight missions. To address this problem, this paper...
Published in: | ACM International Conference Proceeding Series |
---|---|
Main Author: | |
Format: | Conference paper |
Language: | English |
Published: |
Association for Computing Machinery
2023
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85191484767&doi=10.1145%2f3650400.3650664&partnerID=40&md5=de6376dc4d9f59174ebc1f6008ac69c1 |
id |
2-s2.0-85191484767 |
---|---|
spelling |
2-s2.0-85191484767 Yan Q.; Cheng G. Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images 2023 ACM International Conference Proceeding Series 10.1145/3650400.3650664 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85191484767&doi=10.1145%2f3650400.3650664&partnerID=40&md5=de6376dc4d9f59174ebc1f6008ac69c1 Most of the current image semantic segmentation algorithms on UAV vision segment remote sensing images, which cannot represent ground detail information, resulting in obstacles to real-time autonomous environment perception of UAVs in low altitude flight missions. To address this problem, this paper proposes a real-time image semantic segmentation method for low altitude UAVs. The network designs a novel hypernetwork structure that incorporates a context header weight generation module at the last layer of the encoder, the weights of each block in the decoder are generated before the end of encoding in the encoder to reduce the number of parameters and computation of the model to achieve real-time segmentation. In the decoder, a dynamic patch-wise convolutional algorithm is designed using the locally connection layer mechanism to take full account of the contextual semantic information when targeting large segmented objects that are in more than one piece, so that the decoder's weights change with the spatial location of the input feature map, and at the same time, the dynamic weights are used to target the segmentation of different objects, to maximise the network's adaptive nature. In order to verify the effectiveness of the method, this experiment uses the transfer learning technique to carry out pre-training on Cityscape data set, and uses the UAVid data set to validate the method of this paper, and the experimental results show that the mean intersection over union of this method is 66.3% for the images of the categories of buildings, roads, and static cars, and the prediction speed reaches 37.9 FPS, which significantly improves segmentation accuracy under the condition of guaranteeing the real-time performance. © 2023 ACM. Association for Computing Machinery English Conference paper |
author |
Yan Q.; Cheng G. |
spellingShingle |
Yan Q.; Cheng G. Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images |
author_facet |
Yan Q.; Cheng G. |
author_sort |
Yan Q.; Cheng G. |
title |
Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images |
title_short |
Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images |
title_full |
Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images |
title_fullStr |
Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images |
title_full_unstemmed |
Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images |
title_sort |
Research on Deep Learning-based Semantic Segmentation Algorithm for UAV Images |
publishDate |
2023 |
container_title |
ACM International Conference Proceeding Series |
container_volume |
|
container_issue |
|
doi_str_mv |
10.1145/3650400.3650664 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85191484767&doi=10.1145%2f3650400.3650664&partnerID=40&md5=de6376dc4d9f59174ebc1f6008ac69c1 |
description |
Most of the current image semantic segmentation algorithms on UAV vision segment remote sensing images, which cannot represent ground detail information, resulting in obstacles to real-time autonomous environment perception of UAVs in low altitude flight missions. To address this problem, this paper proposes a real-time image semantic segmentation method for low altitude UAVs. The network designs a novel hypernetwork structure that incorporates a context header weight generation module at the last layer of the encoder, the weights of each block in the decoder are generated before the end of encoding in the encoder to reduce the number of parameters and computation of the model to achieve real-time segmentation. In the decoder, a dynamic patch-wise convolutional algorithm is designed using the locally connection layer mechanism to take full account of the contextual semantic information when targeting large segmented objects that are in more than one piece, so that the decoder's weights change with the spatial location of the input feature map, and at the same time, the dynamic weights are used to target the segmentation of different objects, to maximise the network's adaptive nature. In order to verify the effectiveness of the method, this experiment uses the transfer learning technique to carry out pre-training on Cityscape data set, and uses the UAVid data set to validate the method of this paper, and the experimental results show that the mean intersection over union of this method is 66.3% for the images of the categories of buildings, roads, and static cars, and the prediction speed reaches 37.9 FPS, which significantly improves segmentation accuracy under the condition of guaranteeing the real-time performance. © 2023 ACM. |
publisher |
Association for Computing Machinery |
issn |
|
language |
English |
format |
Conference paper |
accesstype |
|
record_format |
scopus |
collection |
Scopus |
_version_ |
1809677886588715008 |