Classical ML1989demo planned

Q-Learning

Learn from reward, not labels

Q-Learning estimates how valuable each action is in each state and improves behavior through trial, error, and delayed reward.

Roadmap node · demo on the way

Read the wiki entry for Q-Learning

The interactive demo for this model is on the build list — the live linear regression demo is the template the rest will follow. The wiki entry already has the full concept, history, and where this fits in the chain.

Read the wiki →See the live demo pattern Back to timeline