Training speed technique
model_args
- n_layer
- n_head
- n_embd
- block_size
- bias
- dropout
functions
estimate_loss()- get loss
get_batch()- get batch
get_lr()- learning_rate
Seonglae Cho
Seonglae Choestimate_loss() - get lossget_batch() - get batchget_lr() - learning_rate