Dataset for AI are three types
- Background information - Pretraining
- Problems with solution - SFT
- Practice problems - Reinforcement Learning
Dataset for AI are three types
- Background information - Pretraining
- Problems with solution - SFT
- Practice problems - Reinforcement Learning
We typically say that a dataset is high-dimensional if the number of data points N is
smaller than the dimensionality D
- not cheatable
- large degree of intra-class variability
Datasets
Dataset Usages