首页|期刊导航|自动化学报|基于改进并行回火算法的RBM网络训练研究

基于改进并行回火算法的RBM网络训练研究

李飞高晓光万开方

自动化学报2017，Vol.43Issue(5)：753-764,12.

自动化学报2017，Vol.43Issue(5)：753-764,12.DOI:10.16383/j.aas.2017.c160326

基于改进并行回火算法的RBM网络训练研究

Research on RBM Networks Training Based on Improved Parallel Tempering Algorithm

李飞 ¹高晓光 ¹万开方¹

作者信息

1. 西北工业大学电子信息学院西安 710129
折叠

摘要

Abstract

Currently, most algorithms for training restricted Boltzmann machines (RBMs) are based on multi-step Gibbs sampling. When the sampling algorithm is used to calculate gradient, the sampling gradient is an approximate value of the true gradient, and there is a big error between the sampling gradient and the true gradient, which seriously affects training effect of network. This article focuses on the problems mentioned above. Firstly, numerical error and direction error between gradient and true gradient sampling are analyzed, as well as their influences on the performance of network training. The problems are theoretically analyzed from the angle of Markov sampling. Then a gradient modification model is established to adjust the numerical value and direction of sampling gradient. Furthermore, improved tempering learning based algorithm is put forward, that is, GFPT (Gradient fixing parallel tempering) algorithm. Finally, a comparative experiment on the GFPT algorithm and existing algorithms is given. It demonstrated that GFPT algorithm can greatly reduce the sampling error between sampling gradient and true gradient, and improve RBM network training precision.

关键词

深度学习/受限玻尔兹曼机/采样算法/马尔科夫理论/并行回火/GFPT

Key words

Deep learning/restricted Boltzmann machine (RBM)/sampling algorithm/Markov theory/parallel temper-ing/GFPT (Gradient fixing parallel tempering)

引用本文复制引用

李飞,高晓光,万开方..基于改进并行回火算法的RBM网络训练研究[J].自动化学报,2017,43(5):753-764,12.

基金项目

国家自然科学基金(61573285, 61305133), 中央高校基本科研业务费专项基金(3102015BJ(Ⅱ)GH01) 资助 Supported by National Natural Science Foundation of China(61573285, 61305133), Fundamental Research Funds for the Central Universities(3102015BJ(Ⅱ)GH01) （61573285, 61305133）

自动化学报

OA北大核心CSCDCSTPCD

ISSN：0254-4156

访问量0

下载量0

段落导航