[CL] Draft-based Approximate Inference for LLMs[FuriosaAI & UW-Madison]https://arxiv.org/abs/2506.08373