LMCache | LMCache blog website

LMCache on Google Kubernetes Engine: Boosting LLM Inference Performance with KV Cache on Tiered Storage
By Danna Wang, Google
Posted on October 7, 2025

Overview of the Collaboration [Read More]
Implementing LMCache Plugin Framework & lmcache_frontend: Design Philosophy

A flexible plugin system for enhanced observability and management
By Baolong, Kobe
Posted on September 23, 2025

Abstract [Read More]
Tags:
- LMCache
- Plugin Framework
- Frontend
- vLLM
- Monitoring
NVIDIA Dynamo integrates LMCache, Accelerating LLM Inference
By NVIDIA Dynamo team, LMCache team
Posted on September 18, 2025

We’re thrilled to announce that Nvidia Dynamo has integrated LMCache as a KV caching layer solution. This is a big milestone: Dynamo gets a battle-tested caching solution, and LMCache becomes part of a data center-scale inference platform used by many developers worldwide to deploy AI at scale. [Read More]
Extending LMCache Backends: A Comprehensive Guide to Custom Backend Development

Learn how to build custom backends for LMCache using the external backend extension mechanism
By Baolong, Kobe
Posted on September 11, 2025

Abstract [Read More]
Tags:
- backend
- extension
- customization
- storage
- lmcache
🎉 LMCache Hits 5,000+ GitHub Stars — Thank You, Community!

A milestone that shows KV cache has become a first-class citizen in the LLM inference stack
By LMCache Team
Posted on August 28, 2025

We’re thrilled to share that LMCache has officially crossed 5,000 GitHub stars! 🚀 This milestone is not just a number — it’s a strong signal that KV cache technology has become a first-class citizen in the LLM inference stack, and that our community is leading the way. [Read More]
Tags:
- milestone
- community
- github
- stars

Older Posts

LMCache on Google Kubernetes Engine: Boosting LLM Inference Performance with KV Cache on Tiered Storage

Implementing LMCache Plugin Framework & lmcache_frontend: Design Philosophy

A flexible plugin system for enhanced observability and management

NVIDIA Dynamo integrates LMCache, Accelerating LLM Inference

Extending LMCache Backends: A Comprehensive Guide to Custom Backend Development

Learn how to build custom backends for LMCache using the external backend extension mechanism

🎉 LMCache Hits 5,000+ GitHub Stars — Thank You, Community!

A milestone that shows KV cache has become a first-class citizen in the LLM inference stack