RFC (OpenSearch 3.6.0): LRUCache lock-path hot-path optimization

fiatlux · March 6, 2026, 10:04pm

Versions (relevant - OpenSearch/Dashboard/Server OS/Browser):

Describe the issue:

LRUCache.get(...) already holds the cache lock, but on hit it calls incRef(key), which re-enters lock logic and repeats key lookup.
This adds avoidable overhead on a very hot path in file-cache-heavy workloads.
Proposed change:
- Add private incRefNode(node) for already-resolved entries.
- Use it in get, addNode, and replaceNode.
- Keep public incRef(key) behavior unchanged (delegates internally to the node helper after lookup).
Safety:
- No API or wire/protocol changes.
- No behavior changes to refcount/LRU/stat semantics, only removal of redundant work.
Local microbenchmark result:
- FileCacheBenchmark.get : 8101.39 -> 13393.85 ops/ms (+65.33%).

Configuration:

Relevant Logs or Screenshots:

Feedback requested:

Do you see any concurrency or eviction edge cases this change could miss?
Any concerns that stats accounting could diverge from existing behavior?
Are there additional workloads/benchmarks you want to see before proposing upstream?

Topic		Replies	Views
Lock is null. Nothing to release OpenSearch troubleshoot	6	96	October 21, 2025
Cluster response time for queries getting high OpenDistro	3	328	February 6, 2024
Details on Query cache and Field data cache OpenSearch documentation	1	51	October 31, 2024
High load caused by query OpenSearch	9	274	November 11, 2024
OpenSearch Lucene Study Group Meeting - Monday, November 27th Community community-meeting	4	268	November 27, 2023