This is an old revision of the document!

Maschinelles Lernen und Data Mining

Reinforcement Learning

- Learner is not told what to do - Trial and error search - Delayed reward - We need to explore round exploit