Back to Glossary
Test Data
Fundamentals
Untouched data for objective performance evaluation.
Test data are used only at the end to measure true generalization.
- Requirements: Clean holdout, representative of deployment, no data leakage.
- Metrics: Task-appropriate (e.g., F1 instead of accuracy for imbalanced data).
- Practice: Report with confidence intervals, error analysis, and limitation notes.