🎓 Learning corner

Evaluation metrics without a statistics background

A metric is a score that answers one specific question about how well a system is doing. Start with the kind of system you are evaluating, then choose the metric family that matches it.

Choose the right starting point:

One-page cheat sheet

Metric Plain-English question Best for
🎯 Precision When the system says yes, how often is it right? Avoiding false alarms.
🔍 Recall Of everything we wanted to find, how much did we find? Avoiding missed items.
⚖️ F1 Is the system both careful and complete? Balancing precision and recall.
🔍 Recall@1 Was the correct item first? Strict top-result evaluation.
⚔️ Spearman Do two rankings mostly agree? Comparing ordered lists.
🥇 MRR@10 How high was the first correct item within the top 10? Search, recommendation, and retrieval systems.