| 注册
首页|期刊导航|光学精密工程|多尺度特征增强的多帧自监督单目深度估计

多尺度特征增强的多帧自监督单目深度估计

寇旗旗 王伟臣 韩成功 吕晨 程德强 姬玉成

光学精密工程2024,Vol.32Issue(24):3603-3615,13.
光学精密工程2024,Vol.32Issue(24):3603-3615,13.DOI:10.37188/OPE.20243224.3603

多尺度特征增强的多帧自监督单目深度估计

Multi-frame self-supervised monocular depth estimation with multi-scale feature enhancement

寇旗旗 1王伟臣 2韩成功 2吕晨 2程德强 2姬玉成3

作者信息

  • 1. 中国矿业大学 计算机科学与技术学院,江苏 徐州 221116
  • 2. 中国矿业大学 信息与控制工程学院,江苏 徐州 221116
  • 3. 应急管理部 大数据中心,北京 100013
  • 折叠

摘要

Abstract

The current depth estimation networks do not sufficiently extract spatial features from images in outdoor scenes,leading to issues such as object edge distortion,blurriness,and regional pseudo-shadows in the output depth maps.To address these problems,this paper proposed a multi-frame self-supervised monocular depth estimation model with multi-scale feature enhancement.Firstly,the model's encoder in-corporated an activation module based on large kernel attention to enhance its ability to extract global spa-tial features from the input image,preserving more spatial context information.Simultaneously,a structur-al enhancement module was introduced that can discriminate important features across channel dimen-sions,enhancing the network's perception of the structural characteristics of the image.Finally,the decod-er used a dynamic upsampling method instead of the traditional nearest interpolation upsampling method to restore detailed information,thereby optimizing the pseudo-shadow phenomenon in the depth map to some extent.Experimental results demonstrate that the depth estimation network proposed in this paper outper-forms current mainstream algorithms in tests on the KITTI and CityScapes outdoor datasets,particularly achieving a prediction accuracy rate of 90.3%on the KITTI dataset.Visualization results also indicate that the depth maps generated by our network model have clearer and more precise edges,effectively im-proving the prediction accuracy of the depth estimation network.

关键词

单目深度估计/自监督/多帧/大核注意力/特征增强

Key words

monocular depth estimation/self-supervision/multi-frame/large kernel attention/feature enhancement

分类

信息技术与安全科学

引用本文复制引用

寇旗旗,王伟臣,韩成功,吕晨,程德强,姬玉成..多尺度特征增强的多帧自监督单目深度估计[J].光学精密工程,2024,32(24):3603-3615,13.

基金项目

国家自然科学基金项目(No.52204177) (No.52204177)

徐州市基础研究计划青年科技人才项目(No.KC23026) (No.KC23026)

光学精密工程

OA北大核心CSTPCD

1004-924X

访问量0
|
下载量0
段落导航相关论文