Glossary

Recall

Learn what Recall is in machine learning, why it matters, and how it ensures AI models capture critical positive instances effectively.

Recall is a crucial performance metric in machine learning (ML) and statistical classification, measuring a model's ability to identify all relevant instances within a dataset. Specifically, it quantifies the proportion of actual positive cases that were correctly predicted as positive by the model. Also known as sensitivity or the true positive rate (TPR), Recall is particularly important in scenarios where failing to detect a positive instance (a False Negative) carries significant consequences. It helps answer the question: "Of all the actual positive instances, how many did the model correctly identify?" Evaluating models requires understanding various metrics, and Recall provides a vital perspective on completeness.

How Recall Is Calculated

Recall is calculated by dividing the number of True Positives (TP) by the sum of True Positives and False Negatives (FN). True Positives are the instances correctly identified as positive, while False Negatives are the positive instances that the model incorrectly classified as negative. A high Recall score indicates that the model is effective at finding most of the positive instances in the data. This metric is fundamental for assessing model performance, especially in tasks like object detection and image classification. Tools and platforms like Ultralytics HUB often display Recall alongside other metrics during model evaluation.

Recall Vs. Related Metrics

Understanding Recall often involves comparing it with other common evaluation metrics:

Precision: While Recall focuses on capturing all actual positive instances, Precision measures the accuracy of the positive predictions made by the model (TP / (TP + False Positives)). There is often a trade-off between Precision and Recall; improving one may decrease the other. This is known as the Precision-Recall tradeoff.
Accuracy: Accuracy measures the overall correctness of the model across all classes ((TP + True Negatives) / Total Predictions). However, accuracy can be misleading, especially with imbalanced datasets where one class significantly outnumbers the others. In such cases, Recall for the minority class is often a more informative metric.
F1-Score: The F1-Score is the harmonic mean of Precision and Recall, providing a single metric that balances both concerns. It's useful when you need a compromise between finding all positive instances (Recall) and ensuring the identified instances are indeed positive (Precision). You can explore detailed guides on YOLO performance metrics for more insights.

Importance and Applications

High Recall is critical in applications where missing positive instances is costly or dangerous. The focus is on minimizing False Negatives.

Medical Image Analysis: In diagnostic systems, such as detecting cancerous tumors from scans, maximizing Recall is paramount. Failing to detect a disease (a False Negative) can have severe health consequences for the patient, making it preferable to have some False Positives (flagging healthy tissue as potentially diseased) that can be ruled out by further testing. AI in healthcare heavily relies on high-Recall models for screening, contributing to advancements discussed in journals like Radiology: Artificial Intelligence. For example, detecting brain tumors requires high sensitivity.
Fraud Detection: In financial systems, identifying fraudulent transactions is crucial. A high Recall model ensures that most actual fraud attempts are caught, minimizing financial losses. While this might lead to flagging some legitimate transactions for review (False Positives), the cost of missing a fraudulent transaction (False Negative) is typically much higher. Many AI in finance applications prioritize Recall.
Security Systems: For surveillance systems designed to detect intruders or threats, high Recall ensures that potential dangers are not missed. Missing an actual threat (False Negative) could compromise safety, as seen in AI-driven security applications.
Quality Control in Manufacturing: In automated inspection systems, high Recall helps identify almost all defective products on a production line, preventing faulty items from reaching consumers. Missing a defect (False Negative) can lead to customer dissatisfaction and safety issues. Learn more about AI in manufacturing.

Recall In Ultralytics YOLO Models

In the context of computer vision (CV) and models like Ultralytics YOLO, Recall is a key metric used alongside Precision and mean Average Precision (mAP) to evaluate performance on tasks like object detection and instance segmentation. Achieving a good balance between Recall and Precision is often essential for robust real-world performance. For instance, when comparing models like YOLOv8 vs YOLO11, Recall helps understand how well each model identifies all target objects. Users can train custom models using frameworks like PyTorch or TensorFlow and track Recall using tools like Weights & Biases or the integrated features in Ultralytics HUB. Understanding Recall helps optimize models for specific use cases, potentially involving hyperparameter tuning or exploring different model architectures like YOLOv10 or the latest YOLO11. Resources like the Ultralytics documentation offer comprehensive guides on training and evaluation.

Recall

Flexible enterprise licensing solution to power your innovation

Train AI models in seconds with Ultralytics YOLO

Train YOLO models simply with Ultralytics HUB

How Recall Is Calculated

Recall Vs. Related Metrics

Importance and Applications

Recall In Ultralytics YOLO Models

Read more in this category

Seamlessly deploy Ultralytics YOLO11 using OpenVINO™

Battery manufacturing is being reinvented by computer vision

Seamlessly deploy Ultralytics YOLO11 using the MNN integration

Join the Ultralytics community