Attention-Based Semantic Segmentation Networks for Forest Applications
Deforestation remains one of the key concerning activities around the world due to commodity-driven extraction, agricultural land expansion, and urbanization. The effective and efficient monitoring of national forests using remote sensing technology is important for the early detection and mitigatio...
Published in: | Forests |
---|---|
Main Author: | |
Format: | Article |
Language: | English |
Published: |
Multidisciplinary Digital Publishing Institute (MDPI)
2023
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85180617333&doi=10.3390%2ff14122437&partnerID=40&md5=5476e8a8b9f13fefbd84ca3fa618f18b |
id |
2-s2.0-85180617333 |
---|---|
spelling |
2-s2.0-85180617333 Lim S.V.; Zulkifley M.A.; Saleh A.; Saputro A.H.; Abdani S.R. Attention-Based Semantic Segmentation Networks for Forest Applications 2023 Forests 14 12 10.3390/f14122437 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85180617333&doi=10.3390%2ff14122437&partnerID=40&md5=5476e8a8b9f13fefbd84ca3fa618f18b Deforestation remains one of the key concerning activities around the world due to commodity-driven extraction, agricultural land expansion, and urbanization. The effective and efficient monitoring of national forests using remote sensing technology is important for the early detection and mitigation of deforestation activities. Deep learning techniques have been vastly researched and applied to various remote sensing tasks, whereby fully convolutional neural networks have been commonly studied with various input band combinations for satellite imagery applications, but very little research has focused on deep networks with high-resolution representations, such as HRNet. In this study, an optimal semantic segmentation architecture based on high-resolution feature maps and an attention mechanism is proposed to label each pixel of the satellite imagery input for forest identification. The selected study areas are located in Malaysian rainforests, sampled from 2016, 2018, and 2020, downloaded using Google Earth Pro. Only a two-class problem is considered for this study, which is to classify each pixel either as forest or non-forest. HRNet is chosen as the baseline architecture, in which the hyperparameters are optimized before being embedded with an attention mechanism to help the model to focus on more critical features that are related to the forest. Several variants of the proposed methods are validated on 6120 sliced images, whereby the best performance reaches 85.58% for the mean intersection over union and 92.24% for accuracy. The benchmarking analysis also reveals that the attention-embedded high-resolution architecture outperforms U-Net, SegNet, and FC-DenseNet for both performance metrics. A qualitative analysis between the baseline and attention-based models also shows that fewer false classifications and cleaner prediction outputs can be observed in identifying the forest areas. © 2023 by the authors. Multidisciplinary Digital Publishing Institute (MDPI) 19994907 English Article All Open Access; Gold Open Access |
author |
Lim S.V.; Zulkifley M.A.; Saleh A.; Saputro A.H.; Abdani S.R. |
spellingShingle |
Lim S.V.; Zulkifley M.A.; Saleh A.; Saputro A.H.; Abdani S.R. Attention-Based Semantic Segmentation Networks for Forest Applications |
author_facet |
Lim S.V.; Zulkifley M.A.; Saleh A.; Saputro A.H.; Abdani S.R. |
author_sort |
Lim S.V.; Zulkifley M.A.; Saleh A.; Saputro A.H.; Abdani S.R. |
title |
Attention-Based Semantic Segmentation Networks for Forest Applications |
title_short |
Attention-Based Semantic Segmentation Networks for Forest Applications |
title_full |
Attention-Based Semantic Segmentation Networks for Forest Applications |
title_fullStr |
Attention-Based Semantic Segmentation Networks for Forest Applications |
title_full_unstemmed |
Attention-Based Semantic Segmentation Networks for Forest Applications |
title_sort |
Attention-Based Semantic Segmentation Networks for Forest Applications |
publishDate |
2023 |
container_title |
Forests |
container_volume |
14 |
container_issue |
12 |
doi_str_mv |
10.3390/f14122437 |
url |
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85180617333&doi=10.3390%2ff14122437&partnerID=40&md5=5476e8a8b9f13fefbd84ca3fa618f18b |
description |
Deforestation remains one of the key concerning activities around the world due to commodity-driven extraction, agricultural land expansion, and urbanization. The effective and efficient monitoring of national forests using remote sensing technology is important for the early detection and mitigation of deforestation activities. Deep learning techniques have been vastly researched and applied to various remote sensing tasks, whereby fully convolutional neural networks have been commonly studied with various input band combinations for satellite imagery applications, but very little research has focused on deep networks with high-resolution representations, such as HRNet. In this study, an optimal semantic segmentation architecture based on high-resolution feature maps and an attention mechanism is proposed to label each pixel of the satellite imagery input for forest identification. The selected study areas are located in Malaysian rainforests, sampled from 2016, 2018, and 2020, downloaded using Google Earth Pro. Only a two-class problem is considered for this study, which is to classify each pixel either as forest or non-forest. HRNet is chosen as the baseline architecture, in which the hyperparameters are optimized before being embedded with an attention mechanism to help the model to focus on more critical features that are related to the forest. Several variants of the proposed methods are validated on 6120 sliced images, whereby the best performance reaches 85.58% for the mean intersection over union and 92.24% for accuracy. The benchmarking analysis also reveals that the attention-embedded high-resolution architecture outperforms U-Net, SegNet, and FC-DenseNet for both performance metrics. A qualitative analysis between the baseline and attention-based models also shows that fewer false classifications and cleaner prediction outputs can be observed in identifying the forest areas. © 2023 by the authors. |
publisher |
Multidisciplinary Digital Publishing Institute (MDPI) |
issn |
19994907 |
language |
English |
format |
Article |
accesstype |
All Open Access; Gold Open Access |
record_format |
scopus |
collection |
Scopus |
_version_ |
1809677886922162176 |