Please enable JavaScript.
Coggle requires JavaScript to display documents.
LLM Systems Seminar (CS7670): Week08a, KV Cache (paper, HotCRP, lottery) -…
LLM Systems Seminar (CS7670): Week08a, KV Cache (paper, HotCRP, lottery)
-
-
3. CacheGen
- local KV cache (prev discussion)
-
-
- remote KV Cache (this work)
-
- think of this as a database
-
-
-
-
4. Debate
Q: Is it okay for systems (e.g., CacheGen) to change the numerical outputs of LLMs?
Should systems that serve LLMs be permitted to introduce numerical approximations that change the output of LLMs, as long as quality remains empirically "close enough"?
Pro side: Yes, systems serve users, not equations.
Con side: No, it violates computational integrity.
-
-