AI Optimization, Inference Optimization Model Inference ToolsTriton InferenceVllmExoTensorRTDeepsparseOpenVINOSparsifyPowerInferFlexflowTransformer EngineFaster TransformerTensorIRXFormersTorchchatExo InferenceAirLLMLingua AI Server Model Inference ServersTGITEIONNX ServerTorchserveKserveTrussBentoMLNvidia NIM AI Performance LibrariesGGMLFlashlightFastAI LLM inference cost is going down fastWelcome to LLMflation - LLM inference cost is going down fast ⬇️ | Andreessen HorowitzFor LLM of equivalent performance, the inference cost is decreasing by 10x every year. What cost $60/million tokens in 2021 costs $.06/million tokens today.https://a16z.com/llmflation-llm-inference-cost