Classical ML1989demo planned
Q-Learning
Learn from reward, not labels
Q-Learning estimates how valuable each action is in each state and improves behavior through trial, error, and delayed reward.
Roadmap node · demo on the way
Read the wiki entry for Q-Learning
The interactive demo for this model is on the build list — the live linear regression demo is the template the rest will follow. The wiki entry already has the full concept, history, and where this fits in the chain.