A novel artificial hydrocarbon networks based value function approximation in hierarchical reinforcement learning
Abstract
Reinforcement learning aims to solve the problem of learning optimal or near-optimal decision-making policies for a given domain problem. However, it is known that increasing the dimensionality of the input space (i.e. environment) will increase the complexity for the learning algorithms, falling into the curse of dimensionality. Value function approximation and hierarchical reinforcement learning have been two different approaches proposed to alleviate reinforcement learning from this illness. In that sense, this paper proposes a new value function approximation using artificial hydrocarbon networks –a supervised learning method inspired on chemical carbon networks– with regularization at each subtask in a hierarchical reinforcement learning framework. Comparative results using a greedy sparse value function approximation over the MAXQ hierarchical method was computed, proving that artificial hydrocarbon networks improves accuracy and efficiency on the value function approximation. © Springer International Publishing AG 2017.
Collections
Related items
Showing items related by title, author, creator and subject.
-
Stochastic parallel extreme artificial hydrocarbon networks : an implementation for fast and robust supervised machine learning in high-dimensional data
Ponce, Hiram; González Mora, José Guillermo (Elsevier Ltd., 2020-03)Artificial hydrocarbon networks (AHN) – a supervised learning method inspired on organic chemical structures and mechanisms – have shown improvements in predictive power and interpretability in comparison with other ... -
A method to improve speed of training algorithm in artificial hydrocarbon networks
Martinez-Villaseñor, Lourdes; Ponce, Hiram (Institute of Electrical and Electronics Engineers Inc., 2020)Artificial hydrocarbon networks (AHN) is a supervised machine learning method inspired on chemical carbon networks that simulate heuristic chemical rules involved within organic molecules to represent the structure and ... -
A methodology based on deep learning for advert value calculation in CPM, CPC and CPA networks
Miralles, Luis (Springer Verlag, 2017)In this research, we propose a methodology for advert value calculation in CPM, CPC and CPA networks. Accurately estimating this value increases the three previous networks’ incomes by selecting the most profitable advert. ...