Model Layer Scaling TechniquesDepth Up-ScalingCOCONUT Model Layer OptimizationMoDLayerSkipRecursive TransformersTranskimmer arxiv.orghttps://arxiv.org/pdf/2203.00555