| 注册
首页|期刊导航|中南大学学报(自然科学版)|一种基于深度学习的异构多模态目标识别方法

一种基于深度学习的异构多模态目标识别方法

孟飞 胡超 刘伟荣

中南大学学报(自然科学版)2016,Vol.47Issue(5):1580-1587,8.
中南大学学报(自然科学版)2016,Vol.47Issue(5):1580-1587,8.DOI:10.11817/j.issn.1672-7207.2016.05.018

一种基于深度学习的异构多模态目标识别方法

Heterogeneous multimodal object recognition method based on deep learning

孟飞 1胡超 2刘伟荣3

作者信息

  • 1. 中南大学 信息科学与工程学院,湖南 长沙,410083
  • 2. 湖南省教育科学研究院,湖南 长沙,410005
  • 3. 中南大学 信息与网络中心,湖南 长沙,410083
  • 折叠

摘要

Abstract

The heterogeneous multimodal object recognition method was proposed based on deep learning. Firstly, based on the video and audio co-existing feature of media data, a heterogeneous multimodal structure was constructed to incorporate the convolutional neural network(CNN) and the restricted boltzmann machine(RBM). The audio and video information were processed respectively, generating the share characteristic representation by using the canonical correlation analysis(CCA). Then the temporal coherence of video frame was utilized to improve the recognizing accuracy further. The experiments were implemented based on the standard audio & face library and the actual movie video fragments. The results show thatforboth the two kinds ofvideo sources, the proposed method improves the accuracy of target recognition significantly.

关键词

目标识别/深度学习/卷积神经网络/限制玻尔兹曼机/典型关联分析

Key words

object recognition/deep learning/restricted boltzmann machine/convolutional neural network/canonical co rrelation analysis

分类

信息技术与安全科学

引用本文复制引用

孟飞,胡超,刘伟荣..一种基于深度学习的异构多模态目标识别方法[J].中南大学学报(自然科学版),2016,47(5):1580-1587,8.

基金项目

湖南省教育科学“十二五”规划重点项目(XJK014AJC001);国家自然科学基金资助项目(61379111,61003233,61202342);教育部-中国移动科研基金资助项目(MCM20121031)(Project(XJK014AJC001) supported by the HunanProvincialEducationScienceKey Foundationduring12thFive-YearPlan (XJK014AJC001)

Projects(61379111,61003233,61202342) supported by the National Natural Science Foundation of China (61379111,61003233,61202342)

Project(MCM20121031) supported by theScienceFund ofEducationDepartment-ChinaMobile) (MCM20121031)

中南大学学报(自然科学版)

OA北大核心CSCDCSTPCD

1672-7207

访问量4
|
下载量0
段落导航相关论文