uni:8:ml:start
This is an old revision of the document!
Maschinelles Lernen und Data Mining
Reinforcement Learning
Agent | → Actions → | Environment |
---|---|---|
← state ← | ||
← reward ← |
- Learner is not told what to do
- Trial and error search
- Delayed reward
- We need to explore round exploit
uni/8/ml/start.1434007740.txt.gz · Last modified: 2020-11-18 18:10 (external edit)