Speculative Decoding — SGLang
SGLang now provides an EAGLE2-based speculative decoding option. Our implementation aims to maximize speed and efficiency and is considered to be among the fastest in open-source LLM engines.
https://docs.sglang.ai/backend/speculative_decoding.html