数据采集与处理Issue(6):1310-1317,8.DOI:10.16337/j.1004-9037.2015.06.021
一种新的连续动作集学习自动机
New Continuous Action-set Learning Automation
刘晓 1毛宁1
作者信息
- 1. 中航工业西安航空计算技术研究所,西安,710065
- 折叠
摘要
Abstract
Learning automaton (LA) is an adaptive decision maker that learns to choose the optimal action from a set of allowable actions through repeated interactions with a random environment .In most of the traditional LA ,the action set is always taken to be finite . Hence ,for continuous parameter learning problems ,the action space needs to be discretized ,and the accuracy of the solutions depends on the level of the discretization .A new continuous action‐set learning automaton (CALA)is proposed .The action set of the automaton is a variable interval ,and actions are selected according to a uniform distribution o‐ver this interval .The end‐points of the interval are updated using the binary feedback signal from the en‐vironment .Simulation results with a multi‐modal learning problem experiment demonstrate the superior‐ity of the new algorithm over three existing CALA algorithms .关键词
机器学习/强化学习/在线学习/学习自动机/连续动作集学习自动机Key words
machine learning/reinforcement learning/online learning/learning automata/continuous ac-tion-set learning automata分类
信息技术与安全科学引用本文复制引用
刘晓,毛宁..一种新的连续动作集学习自动机[J].数据采集与处理,2015,(6):1310-1317,8.