Scaled Dot-Product AttentionFlashAttention과 Memory-efficient attention pytorch.orghttps://pytorch.org/docs/master/backends.html#torch.backends.cuda.sdp_kernelPyTorchAn open source machine learning framework that accelerates the path from research prototyping to production deployment.https://pytorch.org/blog/out-of-the-box-acceleration/