| 注册
首页|期刊导航|液晶与显示|基于CNN-Transformer结构的遥感影像变化检测

基于CNN-Transformer结构的遥感影像变化检测

潘梦洋 杨航 范祥晖

液晶与显示2024,Vol.39Issue(10):1361-1379,19.
液晶与显示2024,Vol.39Issue(10):1361-1379,19.DOI:10.37188/CJLCD.2024-0086

基于CNN-Transformer结构的遥感影像变化检测

Remote sensing image change detection based on CNN-Transformer structure

潘梦洋 1杨航 2范祥晖1

作者信息

  • 1. 中国科学院 长春光学精密机械与物理研究所,吉林 长春 130033||中国科学院大学,北京 100049
  • 2. 中国科学院 长春光学精密机械与物理研究所,吉林 长春 130033
  • 折叠

摘要

Abstract

Modern high-resolution remote sensing images have achieved remarkable results in change detection with the aid of convolutional neural network(CNN).However,the limited receptive field of convolution operations leads to insufficient learning of global context and long-distance spatial relationships.While visual Transformers effectively capture dependencies in remote features,their handling of details in image changes is insufficient,resulting in limited spatial localization capabilities and low computational efficiency.To address these issues,this paper proposes a multi-level cross-layer linear fusion end-to-end encoding-decoding hybrid CNN-Transformer change detection model based on dilated spatial pyramid pooling,combining the advantages of visual Transformers and CNN.Firstly,image features are extracted using Siamese CNN,refined through dilated pyramid pooling to better capture detailed feature information.Secondly,the extracted attributes are converted into visual words,and a Transformer encoder models the compact visual words,feeding the learned context-rich labels back into visual space through a Transformer decoder to reinforce the original features.Thirdly,CNN features are fused with the features from Transformer encoding-decoding through skip connections,facilitating the fusion of position and semantic information by connecting features of different resolutions through upsampling.Finally,a difference enhancement module generates difference feature maps containing rich change information.Comprehensive experiments conducted on four publicly accessible remote sensing datasets,including LEVIR,CDD,DSIFN and WHUCD,confirm the efficacy of the proposed approach.Compared with other cutting-edge techniques for detecting changes,the model presented in this paper achieves superior classification performance,effectively addressing issues such as under-segmentation,over-segmentation and rough edge segmentation in change detection results.

关键词

遥感图像/变化检测/卷积神经网络/Transformer/空间空洞金字塔池化

Key words

remote sensing images/change detection/convolutional neural network/transformer/atrous spatial pyramid pooling

分类

信息技术与安全科学

引用本文复制引用

潘梦洋,杨航,范祥晖..基于CNN-Transformer结构的遥感影像变化检测[J].液晶与显示,2024,39(10):1361-1379,19.

基金项目

中国科学院青年创新促进会(No.2020220)Supported by Youth Innovation Promotion Association,Chinese Academy of Sciences(No.2020220) (No.2020220)

液晶与显示

OA北大核心CSTPCD

1007-2780

访问量0
|
下载量0
段落导航相关论文