Attention-Based Semantic Segmentation Networks for Forest Applications

Deforestation remains one of the key concerning activities around the world due to commodity-driven extraction, agricultural land expansion, and urbanization. The effective and efficient monitoring of national forests using remote sensing technology is important for the early detection and mitigatio...

Full description

Bibliographic Details
Published in:Forests
Main Author: Lim S.V.; Zulkifley M.A.; Saleh A.; Saputro A.H.; Abdani S.R.
Format: Article
Language:English
Published: Multidisciplinary Digital Publishing Institute (MDPI) 2023
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-85180617333&doi=10.3390%2ff14122437&partnerID=40&md5=5476e8a8b9f13fefbd84ca3fa618f18b
id 2-s2.0-85180617333
spelling 2-s2.0-85180617333
Lim S.V.; Zulkifley M.A.; Saleh A.; Saputro A.H.; Abdani S.R.
Attention-Based Semantic Segmentation Networks for Forest Applications
2023
Forests
14
12
10.3390/f14122437
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85180617333&doi=10.3390%2ff14122437&partnerID=40&md5=5476e8a8b9f13fefbd84ca3fa618f18b
Deforestation remains one of the key concerning activities around the world due to commodity-driven extraction, agricultural land expansion, and urbanization. The effective and efficient monitoring of national forests using remote sensing technology is important for the early detection and mitigation of deforestation activities. Deep learning techniques have been vastly researched and applied to various remote sensing tasks, whereby fully convolutional neural networks have been commonly studied with various input band combinations for satellite imagery applications, but very little research has focused on deep networks with high-resolution representations, such as HRNet. In this study, an optimal semantic segmentation architecture based on high-resolution feature maps and an attention mechanism is proposed to label each pixel of the satellite imagery input for forest identification. The selected study areas are located in Malaysian rainforests, sampled from 2016, 2018, and 2020, downloaded using Google Earth Pro. Only a two-class problem is considered for this study, which is to classify each pixel either as forest or non-forest. HRNet is chosen as the baseline architecture, in which the hyperparameters are optimized before being embedded with an attention mechanism to help the model to focus on more critical features that are related to the forest. Several variants of the proposed methods are validated on 6120 sliced images, whereby the best performance reaches 85.58% for the mean intersection over union and 92.24% for accuracy. The benchmarking analysis also reveals that the attention-embedded high-resolution architecture outperforms U-Net, SegNet, and FC-DenseNet for both performance metrics. A qualitative analysis between the baseline and attention-based models also shows that fewer false classifications and cleaner prediction outputs can be observed in identifying the forest areas. © 2023 by the authors.
Multidisciplinary Digital Publishing Institute (MDPI)
19994907
English
Article
All Open Access; Gold Open Access
author Lim S.V.; Zulkifley M.A.; Saleh A.; Saputro A.H.; Abdani S.R.
spellingShingle Lim S.V.; Zulkifley M.A.; Saleh A.; Saputro A.H.; Abdani S.R.
Attention-Based Semantic Segmentation Networks for Forest Applications
author_facet Lim S.V.; Zulkifley M.A.; Saleh A.; Saputro A.H.; Abdani S.R.
author_sort Lim S.V.; Zulkifley M.A.; Saleh A.; Saputro A.H.; Abdani S.R.
title Attention-Based Semantic Segmentation Networks for Forest Applications
title_short Attention-Based Semantic Segmentation Networks for Forest Applications
title_full Attention-Based Semantic Segmentation Networks for Forest Applications
title_fullStr Attention-Based Semantic Segmentation Networks for Forest Applications
title_full_unstemmed Attention-Based Semantic Segmentation Networks for Forest Applications
title_sort Attention-Based Semantic Segmentation Networks for Forest Applications
publishDate 2023
container_title Forests
container_volume 14
container_issue 12
doi_str_mv 10.3390/f14122437
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-85180617333&doi=10.3390%2ff14122437&partnerID=40&md5=5476e8a8b9f13fefbd84ca3fa618f18b
description Deforestation remains one of the key concerning activities around the world due to commodity-driven extraction, agricultural land expansion, and urbanization. The effective and efficient monitoring of national forests using remote sensing technology is important for the early detection and mitigation of deforestation activities. Deep learning techniques have been vastly researched and applied to various remote sensing tasks, whereby fully convolutional neural networks have been commonly studied with various input band combinations for satellite imagery applications, but very little research has focused on deep networks with high-resolution representations, such as HRNet. In this study, an optimal semantic segmentation architecture based on high-resolution feature maps and an attention mechanism is proposed to label each pixel of the satellite imagery input for forest identification. The selected study areas are located in Malaysian rainforests, sampled from 2016, 2018, and 2020, downloaded using Google Earth Pro. Only a two-class problem is considered for this study, which is to classify each pixel either as forest or non-forest. HRNet is chosen as the baseline architecture, in which the hyperparameters are optimized before being embedded with an attention mechanism to help the model to focus on more critical features that are related to the forest. Several variants of the proposed methods are validated on 6120 sliced images, whereby the best performance reaches 85.58% for the mean intersection over union and 92.24% for accuracy. The benchmarking analysis also reveals that the attention-embedded high-resolution architecture outperforms U-Net, SegNet, and FC-DenseNet for both performance metrics. A qualitative analysis between the baseline and attention-based models also shows that fewer false classifications and cleaner prediction outputs can be observed in identifying the forest areas. © 2023 by the authors.
publisher Multidisciplinary Digital Publishing Institute (MDPI)
issn 19994907
language English
format Article
accesstype All Open Access; Gold Open Access
record_format scopus
collection Scopus
_version_ 1809677886922162176