Texonom
/
Engineering
/
Data Engineering
/
Artificial Intelligence
/
AI Development
/
AI Inference
/
FasterTransformer
Search
FasterTransformer
Creator
Creator
Seonglae Cho
Created
Created
2023 Nov 20 15:32
Editor
Editor
Seonglae Cho
Edited
Edited
2026 Mar 24 14:13
Refs
Refs
Tensor Parallelism
Pipeline parallelism
FasterTransformer
NVIDIA
•
Updated 2024 Jan 30 6:23
arxiv.org
https://arxiv.org/pdf/1909.08053.pdf
Backlinks
In-Flight Batching
AI Inference
Recommendations
Texonom
/
Engineering
/
Data Engineering
/
Artificial Intelligence
/
AI Development
/
AI Inference
/
FasterTransformer