Boggle Nogs
Compiling LLMs into a MegaKernel: A path to low-latency inference
(
zhihaojia.medium.com
)
306 points
76 comments
more