改进DeepLab v3+模型下的梯田遥感提取研究OACSTPCD
Remote Sensing Extraction Method of Terraced Fields Based on Improved DeepLab v3+
[目的与意义]梯田作为农业生产的关键要素之一,其面积估算对于农业政策制定、土地规划和资源管理至关重要.为解决复杂的地形条件、种植环境导致传统遥感数据和监测方法难以开展梯田自动化提取问题,探索一种利用深度学习技术在高分辨率遥感影像中精准提取梯田面积的方法.[方法]以休耕期梯田高分六号影像构建语义分割数据集,同时提出一种改进的DeepLab v3+模型.该模型使用轻量级网络MobileNet v2作为骨干网络,为了同时兼顾局部细节和全局语境,使用多尺度特征融合(Multi-scale Feature Fusion module,MSFF)模块代替空洞空间金字塔池化(Atrous Spatial Pyramid Pooling,ASPP)模块,利用扩张率依次增大的空洞卷积级联模式改善信息丢失的问题.此外,对浅层特征和深层特征使用坐标注意力机制以加强网络对于目标的学习.[结果与讨论]利用红、绿和近红外波段组合方式在梯田提取的精度和效果上表现最佳.相比于原始DeepLab v3+网络,精确率、召回率、F1评分和交并比指标分别提升4.62%、2.61%、3.81%和2.81%.此外,与UNet和原始DeepLab v3+相比,改进的DeepLab v3+在参数量上和浮点运算数有着更为优越的性能,其参数量仅为UNet的28.6%和原始DeepLab v3+的19.5%,同时浮点运算数仅为UNet和DeepLab v3+的1/5.这不仅提高了计算效率,也使得改进后的模型更适用于资源有限或计算能力较低的环境中.[结论]深度学习在高分辨率遥感影像梯田识别中具有较高的精度,有利于为梯田精细化监测和管理提供参考依据.
[Objective]The accurate estimation of terraced field areas is crucial for addressing issues such as slope erosion control,water reten-tion,soil conservation,and increasing food production.The use of high-resolution remote sensing imagery for terraced field informa-tion extraction holds significant importance in these aspects.However,as imaging sensor technologies continue to advance,traditional methods focusing on shallow features may no longer be sufficient for precise and efficient extraction in complex terrains and environ-ments.Deep learning techniques offer a promising solution for accurately extracting terraced field areas from high-resolution remote sensing imagery.By utilizing these advanced algorithms,detailed terraced field characteristics with higher levels of automation can be better identified and analyzed.The aim of this research is to explore a proper deep learning algorithm for accurate terraced field area extraction in high-resolution remote sensing imagery. [Methods]Firstly,a terraced dataset was created using high-resolution remote sensing images captured by the Gaofen-6 satellite dur-ing fallow periods.The dataset construction process involved data preprocessing,sample annotation,sample cropping,and dataset par-titioning with training set augmentation.To ensure a comprehensive representation of terraced field morphologies,14 typical regions were selected as training areas based on the topographical distribution characteristics of Yuanyang county.To address misclassifica-tions near image edges caused by limited contextual information,a sliding window approach with a size of 256 pixels and a stride of 192 pixels in each direction was utilized to vary the positions of terraced fields in the images.Additionally,geometric augmentation techniques were applied to both images and labels to enhance data diversity,resulting in a high-resolution terraced remote sensing da-taset.Secondly,an improved DeepLab v3+model was proposed.In the encoder section,a lightweight MobileNet v2 was utilized in-stead of Xception as the backbone network for the semantic segmentation model.Two shallow features from the 4th and 7th layers of the MobileNet v2 network were extracted to capture relevant information.To address the need for local details and global context si-multaneously,the multi-scale feature fusion(MSFF)module was employed to replace the atrous spatial pyramid pooling(ASPP)mod-ule.The MSFF module utilized a series of dilated convolutions with increasing dilation rates to handle information loss.Furthermore,a coordinate attention mechanism was applied to both shallow and deep features to enhance the network's understanding of targets.This design aimed to lightweight the DeepLab v3+model while maintaining segmentation accuracy,thus improving its efficiency for practical applications. [Results and Discussions]The research findings reveal the following key points:(1)The model trained using a combination of near-in-frared,red,and green(NirRG)bands demonstrated the optimal overall performance,achieving precision,recall,F1-Score,and inter-section over union(IoU)values of 90.11%,90.22%,90.17%and 82.10%,respectively.The classification results indicated higher accu-racy and fewer discrepancies,with an error in reference area of only 12 hm2.(2)Spatial distribution patterns of terraced fields in Yuan-yang county were identified through the deep learning model.The majority of terraced fields were found within the slope range of 8º to 25º,covering 84.97%of the total terraced area.Additionally,there was a noticeable concentration of terraced fields within the alti-tude range of 1 000 m to 2 000 m,accounting for 95.02%of the total terraced area.(3)A comparison with the original DeepLab v3+network showed that the improved DeepLab v3+model exhibited enhancements in terms of precision,recall,F1-Score,and IoU by 4.62%,2.61%,3.81%and 2.81%,respectively.Furthermore,the improved DeepLab v3+outperformed UNet and the original Deep-Lab v3+in terms of parameter count and floating-point operations.Its parameter count was only 28.6%of UNet and 19.5%of the original DeepLab v3+,while the floating-point operations were only 1/5 of UNet and DeepLab v3+.This not only improved computa-tional efficiency but also made the enhanced model more suitable for resource-limited or computationally less powerful environments.The lightweighting of the DeepLab v3+network led to improvements in accuracy and speed.However,the slection of the NirGB band combination during fallow periods significantly impacted the model's generalization ability. [Conclusions]The research findings highlights the significant contribution of the near-infrared(NIR)band in enhancing the model's ability to learn terraced field features.Comparing different band combinations,it was evident that the NirRG combination resulted in the highest overall recognition performance and precision metrics for terraced fields.In contrast to PSPNet,UNet,and the original DeepLab v3+,the proposed model showcased superior accuracy and performance on the terraced field dataset.Noteworthy improve-ments were observed in the total parameter count,floating-point operations,and the Epoch that led to optimal model performance,out-performing UNet and DeepLab v3+.This study underscores the heightened accuracy of deep learning in identifying terraced fields from high-resolution remote sensing imagery,providing valuable insights for enhanced monitoring and management of terraced land-scapes.
张俊;陈雨艳;秦震宇;张梦瑶;张军
云南大学 地球科学学院,云南昆明 650500,中国云南大学 国际河流与生态安全研究院,云南昆明 650500,中国
农业科学
梯田提取遥感卷积神经网络高分六号卫星DeepLab v3+
terrace extractionremote sensingconvolutional neural networkGF-6 satelliteDeepLab v3+
《智慧农业(中英文)》 2024 (003)
46-57 / 12
国防科技工业局高分专项云南省政府综合治理深度应用与规模化产业化示范项目(89-Y50G31-9001-22/23);云南大学研究生科研创新基金(KC-22222840) State Administration of Science,Technology and Industry for National Defense Gaofen Special Yunnan Provincial Government Comprehensive Management of Deep Application and Large-Scale Industrialization Demonstration Projects(89-Y50G31-9001-22/23);Yunnan University Graduate Research Innovation Fund(KC-22222840)
评论