Internalizing CoT stepsStepwise InternalizationTrain to predict CoT tokens and result tokensReduce CoT tokens for each epochRemoval SmoothingReset Optimizer arxiv.orghttps://arxiv.org/pdf/2405.14838