Why data splits exist
Machine learning models can look good during development and still fail in the real world. Data splits help reduce that risk.
The three roles
Training data is what the model learns from. Validation data helps you compare model choices. Test data is held back until the end so you can estimate how the model performs on unseen data.
- Do not tune your model directly on the test set.
- Use validation results to make development decisions.
- Treat the test result as a final check, not a playground.
The builder mindset
Good evaluation is a product habit, not only a research habit. If an AI feature affects users, your evaluation setup should match the real use case as closely as possible.