Add hot-swap capability between count-based and size-based cache modes #6809

timl3136 · Apr 11, 2025

What changed?
This PR introduces a hot-swap capability between count-based and size-based cache modes in the LRU cache implementation. The key changes include:

Added shadow tracking of entry sizes even in count-based mode, allowing seamless transitions to size-based mode and vice versa
Enhanced cache eviction logic to handle both count and size-based constraints
Improved error handling for oversized entries and cache full scenarios

Why?
The current implementation requires a complete cache restart when switching between count-based and size-based modes, which:

Causes unnecessary cache evictions
Introduces latency spikes during mode transitions
Disrupts application performance during configuration changes
This change enables smooth transitions between modes while maintaining cache contents, improving system stability and performance.

How did you test it?
I tested the logic with unit test and local testing.

Potential risks
Since it's the base cache, it could break every part that used this cache if there was a bug hidden. We need to rollback immediately if such things happens.

Release notes

Documentation Changes

common/cache/lru.go

Shaddoll · Apr 11, 2025

common/cache/lru.go

-		}
-		cache.sizeByKey = make(map[interface{}]uint64, opts.InitialCapacity)
+	if opts.IsSizeBased == nil {
+		cache.isSizeBased = dynamicproperties.GetBoolPropertyFn(false)


I prefer to panic in this case. I'd like to have the behavior of cache to be completely managed by dynamic config. These implicit logic can make it hard to debug

I want to add the panic logic later once we add feature flags for all the cache usages. For now, existing usage will be default to count based.

I'd avoid panic'ing in library level and instead return an error for missing mandatory options

Thanks, I will add the error returning and make this into a mandatory flag once we add the feature flags for all other cache usage.

common/cache/lru.go

taylanisikdemir · Apr 14, 2025

common/cache/lru.go

-		}
-		cache.sizeByKey = make(map[interface{}]uint64, opts.InitialCapacity)
+	if opts.IsSizeBased == nil {
+		cache.isSizeBased = dynamicproperties.GetBoolPropertyFn(false)


I'd avoid panic'ing in library level and instead return an error for missing mandatory options

common/cache/lru.go

taylanisikdemir · Apr 14, 2025

common/cache/lru.go

+	if c.isSizeBased() {
+		if valueSize > uint64(c.maxSize()) {
+			// value is too big to be cached, we also don't want to evict everyone else
+			return nil, ErrEntryTooBig


This is a good idea but we may need to update callers to work without caching for big workflow executions. Otherwise they will be stuck because caching the workflow execution is mandatory to process the workflow.

Got it, this is similar to when the cache is full of pinned elements and can't evict or add new entries. One way I can think of is to avoid that is by setting a minimum cache size that fits any workflow execution—for example, 50MB if the limit is 30MB.
I'll add a comment to cover this edge case in the cache usage logic.

common/cache/lru.go

common/cache/lru_test.go

timl3136 added 3 commits April 10, 2025 18:17

Implement dual logic for count and size based cache

5319595

Improve logic

d88e214

Merge with upstream master, also fix unit test

b6f6e42

timl3136 requested review from Shaddoll, neil-xie, davidporter-id-au, Groxx, shijiesheng, jakobht, 3vilhamster, sankari165, dkrotx, taylanisikdemir and demirkayaender as code owners April 11, 2025 18:50

Shaddoll reviewed Apr 11, 2025

View reviewed changes

respond to comments and try to fix data race issue

33add38

taylanisikdemir reviewed Apr 14, 2025

View reviewed changes

timl3136 added 5 commits April 14, 2025 14:04

Add for loop to find unpinned elements (if any)

3691777

Merge remote-tracking branch 'upstream/master' into lru-dual-cache-logic

b5afd1f

fix cache with edge case of misconfigured maxSize

4cf6e43

lint

27ec252

comments

2cd39cf

taylanisikdemir reviewed Apr 16, 2025

View reviewed changes

common/cache/lru.go Outdated Show resolved Hide resolved

common/cache/lru.go Show resolved Hide resolved

common/cache/lru.go Show resolved Hide resolved

common/cache/lru.go Show resolved Hide resolved

add unit tests

1766480

taylanisikdemir reviewed Apr 16, 2025

View reviewed changes

common/cache/lru_test.go Outdated Show resolved Hide resolved

taylanisikdemir reviewed Apr 16, 2025

View reviewed changes

common/cache/lru_test.go Show resolved Hide resolved

taylanisikdemir reviewed Apr 16, 2025

View reviewed changes

common/cache/lru_test.go Show resolved Hide resolved

taylanisikdemir approved these changes Apr 16, 2025

View reviewed changes

lint

1c6a9a5

timl3136 enabled auto-merge (squash) April 16, 2025 21:29

timl3136 merged commit a653fa9 into cadence-workflow:master Apr 16, 2025
22 of 23 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add hot-swap capability between count-based and size-based cache modes #6809

Add hot-swap capability between count-based and size-based cache modes #6809

Uh oh!

timl3136 commented Apr 11, 2025

Uh oh!

Uh oh!

Shaddoll Apr 11, 2025

Uh oh!

timl3136 Apr 11, 2025

Uh oh!

taylanisikdemir Apr 14, 2025

Uh oh!

timl3136 Apr 15, 2025

Uh oh!

Uh oh!

Uh oh!

taylanisikdemir Apr 14, 2025

Uh oh!

Uh oh!

taylanisikdemir Apr 14, 2025

Uh oh!

timl3136 Apr 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Search code, repositories, users, issues, pull requests...

Add hot-swap capability between count-based and size-based cache modes #6809

Add hot-swap capability between count-based and size-based cache modes #6809

Uh oh!

Conversation

timl3136 commented Apr 11, 2025

Uh oh!

Uh oh!

Shaddoll Apr 11, 2025

Choose a reason for hiding this comment

Uh oh!

timl3136 Apr 11, 2025

Choose a reason for hiding this comment

Uh oh!

taylanisikdemir Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

timl3136 Apr 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

taylanisikdemir Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

taylanisikdemir Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

timl3136 Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!