Scaled Dot-Product Attention
FlashAttention과 Memory-efficient attention
PyTorch
An open source machine learning framework that accelerates the path from research prototyping to production deployment.
https://pytorch.org/blog/out-of-the-box-acceleration/


Seonglae Cho