Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Development/AI Inference/
FasterTransformer
Search

FasterTransformer

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2023 Nov 20 15:32
Editor
Editor
Seonglae ChoSeonglae Cho
Edited
Edited
2026 Mar 24 14:13
Refs
Refs
Tensor Parallelism
Pipeline parallelism
FasterTransformer
NVIDIA • Updated 2024 Jan 30 6:23
 
 
 
 
 
 
 
 
arxiv.org
https://arxiv.org/pdf/1909.08053.pdf
 

Backlinks

In-Flight BatchingAI Inference

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Development/AI Inference/
FasterTransformer
Copyright Seonglae Cho