← Back to timeline
Classical ML1989demo planned

Q-Learning

Learn from reward, not labels

Q-Learning estimates how valuable each action is in each state and improves behavior through trial, error, and delayed reward.

Roadmap node · demo on the way

Read the wiki entry for Q-Learning

The interactive demo for this model is on the build list — the live linear regression demo is the template the rest will follow. The wiki entry already has the full concept, history, and where this fits in the chain.