WGMMA

Creator

Creator

Seonglae Cho

Created

Created

2024 May 16 16:11

Editor

Editor

Seonglae Cho

Edited

Edited

2024 May 16 16:12

Refs

Refs

WGMMA (warp group matrix multiply accumulate)

The programming guide to using PTX (Parallel Thread Execution) and ISA (Instruction Set Architecture).

https://docs.nvidia.com/cuda/parallel-thread-execution/index.html

how make gpu fast?

https://hazyresearch.stanford.edu/blog/2024-05-12-tk

Recommendations

////////