torch.backends.cuda.sdp_kernel

Creator

Creator

Seonglae Cho

Created

Created

2023 Nov 20 15:4

Editor

Editor

Seonglae Cho

Edited

Edited

2023 Nov 20 15:7

Refs

Refs

Flash Attention

Dot-Product Attention

Scaled Dot-Product Attention

FlashAttention과 Memory-efficient attention

https://pytorch.org/docs/master/backends.html#torch.backends.cuda.sdp_kernel

An open source machine learning framework that accelerates the path from research prototyping to production deployment.

https://pytorch.org/blog/out-of-the-box-acceleration/

Recommendations

//////////