| 注册
首页|期刊导航|计算机技术与发展|基于白盒Transformer与动态卷积的弱监督语义分割

基于白盒Transformer与动态卷积的弱监督语义分割

严格 刘进锋

计算机技术与发展2026,Vol.36Issue(1):38-45,8.
计算机技术与发展2026,Vol.36Issue(1):38-45,8.DOI:10.20165/j.cnki.ISSN1673-629X.2025.0195

基于白盒Transformer与动态卷积的弱监督语义分割

Weakly Supervised Semantic Segmentation Based on White Box Transformer and Dynamic Convolution

严格 1刘进锋1

作者信息

  • 1. 宁夏大学 信息工程学院,宁夏 银川 750021
  • 折叠

摘要

Abstract

The weakly supervised semantic segmentation method based on image level labels has attracted much attention because it can train the network with a small number of image level labels to reduce annotation burden.Class activation map generation is a commonly used method in this field,but its quality is limited by the sparsity of initial localization and insufficient feature expression ability.Although existing methods based on visual Transformers optimize class activation maps through self attention,their black box characteristics lead to scattered attention regions,making static convolution difficult to adapt to multi-scale targets,and cross entropy loss is easily dominated by simple samples.To address the aforementioned issues,we propose a weakly supervised semantic segmentation method based on white box Transformer and dynamic convolution.Firstly,a sparse coding white box Transformer module is constructed to generate high-precision class activation maps through interpretable sparse coding mechanisms,effectively suppressing background noise.Secondly,a dynamic conditional convolution module is designed to achieve accurate feature extraction of multi-scale targets by a-daptively adjusting the convolution kernel parameters.Finally,the introduction of Focal Loss improves the segmentation accuracy of the model for difficult to distinguish samples by dynamically suppressing the weights of easily separable samples.Compared to mainstream methods in PASCAL VOC 2012 and MS COCO 2014 validation sets,the proposed method is improved by 1.6 percentage points and 1.3 percentage points in terms of performance,respectively.The experimental results indicate that the model proposed can obtain a more complete class activation graph.

关键词

弱监督学习/语义分割/图像级标签/白盒Transformer/动态卷积/类激活图

Key words

weakly supervised learning/semantic segmentation/image level labels/white box Transformer/dynamic convolution/class ac-tivation map

分类

信息技术与安全科学

引用本文复制引用

严格,刘进锋..基于白盒Transformer与动态卷积的弱监督语义分割[J].计算机技术与发展,2026,36(1):38-45,8.

基金项目

宁夏自然科学基金(2023AAC03126) (2023AAC03126)

计算机技术与发展

1673-629X

访问量0
|
下载量0
段落导航相关论文