Echo's learning mimics the way the hippocampus in our brain works.
Basically, the hippocampus is a component of our brain that is key for memory, imagination and learning. To create abstract knowledge, the hippocampus retrieves recent experiences from the memory, and repeats them at much higher speed to which those experiences occurred, giving priority to those experiences that lead to greater reward, activating the imagination to construct scenes for the future, and comparing those imagined experiences with what really happened in reality, all of this with the sole purpose of adjusting the responses of the neural network system.
Now, in Echo, the imagination has been removed, because it is an expensive process that requires a lot of computer power to make the avatar work. In this way, Echo goes beyond the design of our brain, to take advantage of the possibilities that comes from creating intelligence artificially.
This evolution in the design of the digital brain offers improved speed, making the avatar super light in terms of requirements of computer power for learning, and also much faster response when operating.
After all, there is no need to implement the exact same design as in our human brain, that operates with carbon material, to simulate an entity that operates with silicon material.
Learning Loop
Please have a closer look at the image at the top of the page, which represents a learning loop with two components: the actors, that generate experiences of interaction between the avatar and the environment, and the learner that improves the responses of the deep neural network.
Initially, the behavior of the avatar is random. Every certain period of time (learning steps) the learner updates the neural network that the actors use to generate experiences. With every new version of the neural network, the actors generate experiences that are slightly better than the ones generated with the previous version. This improvement from one version of the neural network to the next one is imperceptible, but over time, the avatar improves the responses of the neural network to the point of optimizing the performance.
Key points:
Echo, a machine learning model, learns in a similar way to humans. This is because it incorporates the learning principles of the hippocampus, a part of the brain responsible for memory, imagination and learning.
The responses of the neural network used by Echo are initially random. Every set number of learning steps, the learner updates the neural network. This updated version is then used by the Self-Play agent to generate new experiences.
With each new version of the neural network, the quality of experiences generated by the Self-Play agent improves. Although the improvement may not be noticeable with each iteration, over time, the experiences become better and better. Eventually, the avatar will have mastered the hidden dynamics of the environment and will maximize the rewards received at the end of each episode.
Want to find out details about what the Actors do? An actor is a machine that is autonomously generating experiences that are stored in the avatar's memory. Many actors work as team to generate enough experiences to feed the learner.
Echo Brains
Copyright © 2023 Echo Brains - All rights reserved.
Usamos cookies para analizar el tráfico del sitio web y optimizar tu experiencia en el sitio. Al aceptar nuestro uso de cookies, tus datos se agruparán con los datos de todos los demás usuarios.