| 注册
首页|期刊导航|火力与指挥控制|一种无人机航拍图像多尺度目标语义分割方法

一种无人机航拍图像多尺度目标语义分割方法

张蔚

火力与指挥控制2026,Vol.51Issue(2):54-60,69,8.
火力与指挥控制2026,Vol.51Issue(2):54-60,69,8.DOI:10.3969/j.issn.1002-0640.2026.02.008

一种无人机航拍图像多尺度目标语义分割方法

A Multi-scale Target Semantic Segmentation Method for UAV Aerial Images

张蔚1

作者信息

  • 1. 河南应用技术职业学院,郑州 450042
  • 折叠

摘要

Abstract

To address the limitations of small target segmentation accuracy and cross-scale feature alignment in multi-scale target semantic segmentation tasks for UAV aerial images,this paper proposes the TransSeg model.The model integrates the global modeling capability of Swin Transformer with the symmetric encoder-decoder architecture of U-Net,and introduces D-ASPP and residual attention mechanisms.The loss function employs a collaborative optimization strategy combining Dice Loss and Focal Loss.Experiments were conducted on the UAVid dataset,with comparisons made against mainstream models including DANet,BiSeNet,and SegFormer,validating the effectiveness of TransSeg in segmenting 8 types of urban scene targets,and demonstrating strong performance on a custom dataset.The results show that the model achieves an average IoU of 66.9%,an improvement of 0.9%over SegFormer,with IoU increases of 4.1%and 5.6%for small targets such as moving vehicles and pedestrians,respectively.The research highlights the critical role of global context modeling and dynamic multi-scale fusion mechanisms in enhancing segmentation performance,providing a new solution for UAV remote sensing image processing.

关键词

无人机航拍图像/语义分割/多尺度目标/Swin Transformer/D-ASPP/残差注意力机制

Key words

UAV aerial images/semantic segmentation/multi-scale targets/Swin Transformer/D-ASPP/residual attention mechanism

分类

航空航天

引用本文复制引用

张蔚..一种无人机航拍图像多尺度目标语义分割方法[J].火力与指挥控制,2026,51(2):54-60,69,8.

基金项目

河南省教育厅资助项目(ZJC18020,ZJC17098) (ZJC18020,ZJC17098)

火力与指挥控制

1002-0640

访问量2
|
下载量0
段落导航相关论文