Test vs Validation
Dev set or validation set:
Same data as train, use in train model, maybe compare two different models' performance
Test set:
Same or different data as train, use in fine-tuning (after train model) unbiased estimate performance of model (can skip test set)
Mismatch train & test: cropped photo vs photo in the wild (low resolution, weird angle)
How to split data:
Train | Dev | Test | |
Small dataset | 60% (6,000) | 20% (2,000) | 20% (2,000) |
Big dataset | 98% (1,000,000) | 1% (10,000) | 1% (10,000) |
Last updated