Test Set estimate real-world performance
No decisions should be made using the test set
For unbiased estimate, test data should be used as a metric since they are unseen dataset.
The test set is used to evaluate the model's final performance and is not involved in the training process. In real-world applications, the test distribution often does not match the training distribution.