Lossless Acceleration of LLM via Adaptive N-Gram Parallel Decoding
27 by PaulHoule | 2 comments on Hacker News.
Home
»
Hacker News
» New top story on Hacker News: Lossless Acceleration of LLM via Adaptive N-Gram Parallel Decoding
Subscribe to:
Post Comments (Atom)
Post a Comment