Triton

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2023 May 16 4:1
Editor
Edited
Edited
2025 Jan 27 22:9

Convert triton GPU to LLVM

currently triton only supports linux; 3-bit quantization is not supported when using triton
https://github.com/kimbochen/md-blogs/tree/main/triton
https://www.youtube.com/watch?v=AtbnRIzpwho
Triton Notion
 
https://www.youtube.com/watch?v=AtbnRIzpwho
https://www.youtube.com/watch?v=AtbnRIzpwho
https://github.com/kimbochen/md-blogs/tree/main/triton
  • API
    • Python
  • Supported Platforms:
    • Linux
  • Supported Hardware:
    • NVIDIA GPUs (Compute Capability 8.0+)
    • AMD GPUs (ROCm 6.2+)
    • Under development: CPUs
 
 
 

Documentations

Welcome to Triton’s documentation! — Triton documentation
Triton is a language and compiler for parallel programming. It aims to provide a Python-based programming environment for productively writing custom DNN compute kernels capable of running at maximal throughput on modern GPU hardware.
www.jokeren.tech
github.com
Welcome to Triton's documentation! - Triton documentation
Triton is an language and compiler for parallel programming. It aims to provide a Python-based programming environment for productively writing custom DNN compute kernels capable of running at maximal throughput on modern GPU hardware.
Introducing Triton: Open-Source GPU Programming for Neural Networks
We're releasing Triton 1.0, an open-source Python-like programming language which enables researchers with no CUDA experience to write highly efficient GPU code-most of the time on par with what an expert would be able to produce. Triton makes it possible to reach peak hardware performance with relatively little
Introducing Triton: Open-Source GPU Programming for Neural Networks

Paper

www.eecs.harvard.edu
 
 
 

Recommendations