Recurrent TransformerWeight Sharing with LoRA variantsRecursive Transformer MethodsMixture-of-RecursionsHierarchical Reasoning Model