Wiki

A universe of ideas

User Tools

Site Tools


uni:8:ml:start

This is an old revision of the document!


Maschinelles Lernen und Data Mining

Reinforcement Learning

Agent → Actions → Environment
← state ←
← reward ←

- Learner is not told what to do - Trial and error search - Delayed reward - We need to explore round exploit

uni/8/ml/start.1434007336.txt.gz · Last modified: 2020-11-18 18:10 (external edit)