| 注册
首页|期刊导航|计算机应用研究|WKCN:基于小波-KAN协同优化的自适应图像描述算法

WKCN:基于小波-KAN协同优化的自适应图像描述算法

何世鹏 郑豪

计算机应用研究2026,Vol.43Issue(3):696-703,8.
计算机应用研究2026,Vol.43Issue(3):696-703,8.DOI:10.19734/j.issn.1001-3695.2025.07.0256

WKCN:基于小波-KAN协同优化的自适应图像描述算法

WKCN:adaptive image description algorithm based on wavelet-KAN collaborative optimization

何世鹏 1郑豪2

作者信息

  • 1. 南京中医药大学人工智能与信息技术学院/江苏省智慧中医药健康服务工程研究中心,南京 210023
  • 2. 南京中医药大学人工智能与信息技术学院/江苏省智慧中医药健康服务工程研究中心,南京 210023||南京晓庄学院信息工程学院,南京 211171
  • 折叠

摘要

Abstract

This study addressed key challenges in image captioning,including inadequate multi-scale texture representation,redundant feature fusion,and limited dynamic semantic modeling.This paper proposed a novel algorithm named WKCN based on wavelet-KAN collaborative optimization.It designed the wavelet-KAN multi-scale nonlinear enhancement(WKMNE)modu-le to decompose image features using Daubechies-4 wavelet bases and enhance textures via B-spline interpolation in KAN.A KAN-based adaptive feature fusion(KAN-AFF)mechanism dynamically generated spatial and channel weights to integrate global features from ResNet50 and frequency-domain features from WKMNE.Finally,a KAN-enhanced dynamic decoder(KED)replaced the static feed-forward network in Transformer with a learnable KAN activation module to strengthen semantic mapping.Experiments on the MSCOCO dataset show that WKCN achieves optimal scores on BLEU-1(81.1),ROUGE-L(59.1)and CIDEr(133.6).Ablation studies confirme the synergy between multi-scale feature extraction and dynamic deco-ding.Hyperparameter analysis verifies the rationality of parameter selection.Cross-dataset tests on Flickr30k and NoCaps demonstrate strong generalization capability.Visualization analyses illustrate the model's effectiveness intuitively.This work provides a verifiable nonlinear optimization paradigm for cross-modal semantic generation tasks.

关键词

图像描述/小波卷积/Kolmogorov-Arnold/动态特征融合/多尺度建模/动态解码

Key words

image captioning/wavelet convolution/Kolmogorov-Arnold/adaptive feature fusion/multi-scale modeling/dynamic decoding

分类

信息技术与安全科学

引用本文复制引用

何世鹏,郑豪..WKCN:基于小波-KAN协同优化的自适应图像描述算法[J].计算机应用研究,2026,43(3):696-703,8.

基金项目

国家自然科学基金资助项目(61976118) (61976118)

江苏省研究生科研创新计划资助项目(KYCX25_2266) (KYCX25_2266)

计算机应用研究

1001-3695

访问量0
|
下载量0
段落导航相关论文