Esc

Model-free Reinforcement Learning

D3A-MFRL (Model-free Reinforcement Learning)

Definition

In reinforcement learning (RL), a model-free algorithm (as opposed to a model-based one) is an algorithm which does not use the transition probability distribution (and the reward function) associated with the Markov decision process (MDP),which, in RL, represents the problem to be solved. The transition probability distribution (or transition model) and the reward function are often collectively called the "model" of the environment (or MDP), hence the name "model-free". A model-free RL algorithm can be thought of as an "explicit" trial-and-error algorithm. An example of a model-free algorithm is Q-learning.

References

Model-free (reinforcement learning). Wikipedia. Link.)

json

D3FEND^™

A knowledge graph of cybersecurity countermeasures

1.3.0