首页|期刊导航|光学精密工程|多尺度特征增强的多帧自监督单目深度估计

多尺度特征增强的多帧自监督单目深度估计

寇旗旗王伟臣韩成功吕晨程德强姬玉成

光学精密工程2024，Vol.32Issue(24)：3603-3615,13.

光学精密工程2024，Vol.32Issue(24)：3603-3615,13.DOI:10.37188/OPE.20243224.3603

多尺度特征增强的多帧自监督单目深度估计

Multi-frame self-supervised monocular depth estimation with multi-scale feature enhancement

寇旗旗 ¹王伟臣 ²韩成功 ²吕晨 ²程德强 ²姬玉成³

作者信息

1. 中国矿业大学计算机科学与技术学院,江苏徐州 221116
2. 中国矿业大学信息与控制工程学院,江苏徐州 221116
3. 应急管理部大数据中心,北京 100013
折叠

摘要

Abstract

The current depth estimation networks do not sufficiently extract spatial features from images in outdoor scenes,leading to issues such as object edge distortion,blurriness,and regional pseudo-shadows in the output depth maps.To address these problems,this paper proposed a multi-frame self-supervised monocular depth estimation model with multi-scale feature enhancement.Firstly,the model's encoder in-corporated an activation module based on large kernel attention to enhance its ability to extract global spa-tial features from the input image,preserving more spatial context information.Simultaneously,a structur-al enhancement module was introduced that can discriminate important features across channel dimen-sions,enhancing the network's perception of the structural characteristics of the image.Finally,the decod-er used a dynamic upsampling method instead of the traditional nearest interpolation upsampling method to restore detailed information,thereby optimizing the pseudo-shadow phenomenon in the depth map to some extent.Experimental results demonstrate that the depth estimation network proposed in this paper outper-forms current mainstream algorithms in tests on the KITTI and CityScapes outdoor datasets,particularly achieving a prediction accuracy rate of 90.3%on the KITTI dataset.Visualization results also indicate that the depth maps generated by our network model have clearer and more precise edges,effectively im-proving the prediction accuracy of the depth estimation network.

关键词

单目深度估计/自监督/多帧/大核注意力/特征增强

Key words

monocular depth estimation/self-supervision/multi-frame/large kernel attention/feature enhancement

分类

信息技术与安全科学

引用本文复制引用

寇旗旗,王伟臣,韩成功,吕晨,程德强,姬玉成..多尺度特征增强的多帧自监督单目深度估计[J].光学精密工程,2024,32(24):3603-3615,13.

基金项目

国家自然科学基金项目(No.52204177) （No.52204177）

徐州市基础研究计划青年科技人才项目(No.KC23026) （No.KC23026）

光学精密工程

OA北大核心CSTPCD

ISSN：1004-924X

访问量0

下载量0

段落导航