Esc  
              
   
 Policy Gradient
Definition
The objective of a Reinforcement Learning Policy Gradient agent is to maximize the “expected” reward when following a policy
References
Policy Gradients in a Nutshell. Towards Data Science. Link.
        loading...
      
 loading...
  D3FEND™ 
 A knowledge graph of cybersecurity countermeasures