Repository logo
  • English
  • Deutsch
  • Español
  • Français
  • Log In
    New user? Click here to register.Have you forgotten your password?
Universidad Panamericana
  • Communities & Collections
  • Research Outputs
  • Fundings & Projects
  • Researchers
  • Statistics
  • Feedback
  • English
  • Deutsch
  • Español
  • Français
  1. Home
  2. CRIS
  3. Publications
  4. A Novel Artificial Hydrocarbon Networks Based Value Function Approximation in Hierarchical Reinforcement Learning
 
  • Details
Options

A Novel Artificial Hydrocarbon Networks Based Value Function Approximation in Hierarchical Reinforcement Learning

Journal
Advances in Soft Computing
Lecture Notes in Computer Science
ISSN
0302-9743
1611-3349
Date Issued
2017
Author(s)
Ponce, Hiram  
Facultad de Ingeniería - CampCM  
Type
Resource Types::text::book::book part
DOI
10.1007/978-3-319-62428-0_18
URL
https://scripta.up.edu.mx/handle/123456789/4424
Abstract
Reinforcement learning aims to solve the problem of learning optimal or near-optimal decision-making policies for a given domain problem. However, it is known that increasing the dimensionality of the input space (i.e. environment) will increase the complexity for the learning algorithms, falling into the curse of dimensionality. Value function approximation and hierarchical reinforcement learning have been two different approaches proposed to alleviate reinforcement learning from this illness. In that sense, this paper proposes a new value function approximation using artificial hydrocarbon networks –a supervised learning method inspired on chemical carbon networks– with regularization at each subtask in a hierarchical reinforcement learning framework. Comparative results using a greedy sparse value function approximation over the MAXQ hierarchical method was computed, proving that artificial hydrocarbon networks improves accuracy and efficiency on the value function approximation. © Springer International Publishing AG 2017.
Subjects

Artificial organic ne...

Machine learning

Regularization

Reinforcement learnin...

Carbon

Decision making

Hydrocarbons

Learning algorithms

Learning systems

Soft computing

Curse of dimensionali...


Copyright 2024 Universidad Panamericana
Términos y condiciones | Política de privacidad | Reglamento General

Built with DSpace-CRIS software - Extension maintained and optimized by - Hosting & support SCImago Lab

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback