Back to Glossary

Test Data

Fundamentals

Untouched data for objective performance evaluation.


Test data are used only at the end to measure true generalization.

  • Requirements: Clean holdout, representative of deployment, no data leakage.
  • Metrics: Task-appropriate (e.g., F1 instead of accuracy for imbalanced data).
  • Practice: Report with confidence intervals, error analysis, and limitation notes.