Weโre thrilled to announce that Nvidia Dynamo has integrated LMCache as a KV caching layer solution. This is a big milestone: Dynamo gets a battle-tested caching solution, and LMCache becomes part of a data center-scale inference platform used by many developers worldwide to deploy AI at scale.
[Read More]
Weโre thrilled to share that LMCache has officially crossed 5,000 GitHub stars! ๐ This milestone is not just a number โ itโs a strong signal that KV cache technology has become a first-class citizen in the LLM inference stack, and that our community is leading the way.
[Read More]