uni:8:ml:start

This is an old revision of the document!

Maschinelles Lernen und Data Mining

Reinforcement Learning

Agent	→ Actions →	Environment
	← state ←
	← reward ←

Learner is not told what to do
Trial and error search
Delayed reward
We need to explore round exploit

uni/8/ml/start.1434007740.txt.gz · Last modified: 2020-11-18 18:10 (external edit)