A novel artificial hydrocarbon networks based value function approximation in hierarchical reinforcement learning
MetadataShow full item record
Reinforcement learning aims to solve the problem of learning optimal or near-optimal decision-making policies for a given domain problem. However, it is known that increasing the dimensionality of the input space (i.e. environment) will increase the complexity for the learning algorithms, falling into the curse of dimensionality. Value function approximation and hierarchical reinforcement learning have been two different approaches proposed to alleviate reinforcement learning from this illness. In that sense, this paper proposes a new value function approximation using artificial hydrocarbon networks –a supervised learning method inspired on chemical carbon networks– with regularization at each subtask in a hierarchical reinforcement learning framework. Comparative results using a greedy sparse value function approximation over the MAXQ hierarchical method was computed, proving that artificial hydrocarbon networks improves accuracy and efficiency on the value function approximation. © Springer International Publishing AG 2017.
Showing items related by title, author, creator and subject.
Stochastic parallel extreme artificial hydrocarbon networks : an implementation for fast and robust supervised machine learning in high-dimensional data Ponce, Hiram; González Mora, José Guillermo (Elsevier Ltd., 2020-03)Artificial hydrocarbon networks (AHN) – a supervised learning method inspired on organic chemical structures and mechanisms – have shown improvements in predictive power and interpretability in comparison with other ...
Martinez-Villaseñor, Lourdes; Ponce, Hiram (Institute of Electrical and Electronics Engineers Inc., 2020)Artificial hydrocarbon networks (AHN) is a supervised machine learning method inspired on chemical carbon networks that simulate heuristic chemical rules involved within organic molecules to represent the structure and ...
Miralles, Luis (Springer Verlag, 2017)In this research, we propose a methodology for advert value calculation in CPM, CPC and CPA networks. Accurately estimating this value increases the three previous networks’ incomes by selecting the most profitable advert. ...