Port SHM non-GPU transport to cpu_context_pickle branch and fix correctness gaps from #278 review by Copilot · Pull Request #280 · hlin99/LMCache

Copilot · 2026-05-20T06:32:32Z

Identify merge conflicts between copilot/sub-pr-278 and the base branch\n- [x] Resolve conflict in server.py commit_store (locked + None-check semantics)\n- [x] Resolve conflicts in test_non_cuda_context.py (keep both tests with correct semantics)\n- [x] Verify all 20 tests pass\n- [x] Commit the merge resolution

Agent-Logs-Url: https://github.com/hlin99/LMCache/sessions/661cbeee-d0d4-40ef-9312-4044e4696a51 Co-authored-by: hlin99 <73271530+hlin99@users.noreply.github.com>

Copilot · 2026-05-20T06:49:17Z

Just as a heads up, I was blocked by some firewall rules while working on your feedback. Expand below for details.

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

I tried to connect to the following addresses, but was blocked by firewall rules:

download-r2.pytorch.org
- Triggering command: /usr/bin/python python -m pip install numpy torch --index-url REDACTED (dns block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

…//github.com/hlin99/LMCache into copilot/sub-pr-278 # Conflicts: # lmcache/v1/multiprocess/server.py # tests/v1/multiprocess/test_non_cuda_context.py Co-authored-by: hlin99 <73271530+hlin99@users.noreply.github.com>

hlin99

Review: PR #278 + #280 Combined

整体质量不错，#280 修复了大部分 #278 review 中的问题。还有两个残留需要处理：

🟡 Issue 1: `prepare_store` 空 reserved 时仍存 entry 到 `_pending_shm_writes`，导致微量内存泄漏

当 reserve_write 返回空 dict（key 全部已存在），reserved_keys = []，slots = []。

当前行为链：

Server prepare_store → 存 _pending_shm_writes[transfer_key] = []，返回 {"slots": []}
Worker prepare_store → 检测到空 slots，返回 None（✅ #280 已修复）
Worker 不调用 commit_store
Server 侧 _pending_shm_writes 中的空 entry 永远不会被 pop

虽然每个泄漏 entry 很小，但长时间运行 + 大量重复 prompt 会持续累积。

建议修复（server.py prepare_store）：

# 在 "transfer_key = ..." 之前加：
if not reserved_keys:
    return PrepareStoreResponse(context={})
这样 worker 收到空 context（无 "slots" key），直接返回 None，server 也不存空 entry。同时避免了多余的 reserve_write → 空 dict → 遍历的开销。

🟡 Issue 2: MixedMemoryAllocator(shm_name=...) 参数兼容性未验证
memory_manager.py 中 create_memory_allocator 调用：

Python
return MixedMemoryAllocator(
    config.size_in_bytes,
    align_bytes=config.align_bytes,
    shm_name=shm_name,
)
如果 MixedMemoryAllocator.__init__ 没有 shm_name 参数，SHM 模式初始化时会直接 TypeError crash。请确认 MixedMemoryAllocator 是否已支持该参数，如果不支持需要在本 PR 中添加。

When reserve_write returns empty (all object keys already cached), return PrepareStoreResponse(context={}) immediately without storing an entry in _pending_shm_writes. This prevents leaked entries that would never be popped since the worker won't call commit_store. Agent-Logs-Url: https://github.com/hlin99/LMCache/sessions/182111d5-1737-49c0-be65-0287d5b9d6c5 Co-authored-by: hlin99 <73271530+hlin99@users.noreply.github.com>

Copilot · 2026-05-20T07:11:01Z

Review: PR #278 + #280 Combined

整体质量不错，#280 修复了大部分 #278 review 中的问题。还有两个残留需要处理：...

Fixed both issues in 338abb7:

Issue 1: Restored the early-return in prepare_store when reserved_keys is empty — now returns PrepareStoreResponse(context={}) without storing an entry in _pending_shm_writes. This prevents the memory leak since the worker (seeing no "slots" key) won't call commit_store, and no orphan entry accumulates.

Issue 2: Verified — MixedMemoryAllocator.__init__ already accepts shm_name via **kwargs (line 2092 of memory_management.py), so the call in memory_manager.py works correctly.

Initial plan

520ee5f

Copilot AI assigned Copilot and hlin99 May 20, 2026

Copilot AI mentioned this pull request May 20, 2026

Add SHM-based non-GPU KV transport without changing existing NonGpuContext interfaces #278

Closed

Copilot started work on behalf of hlin99 May 20, 2026 06:32 View session

Copilot AI and others added 2 commits May 20, 2026 06:43

Fix SHM non-GPU transport idempotency, locking, and cleanup issues

7c64229

Agent-Logs-Url: https://github.com/hlin99/LMCache/sessions/661cbeee-d0d4-40ef-9312-4044e4696a51 Co-authored-by: hlin99 <73271530+hlin99@users.noreply.github.com>

Polish SHM feedback fixes and align validation comments

8ae9bc4

Agent-Logs-Url: https://github.com/hlin99/LMCache/sessions/661cbeee-d0d4-40ef-9312-4044e4696a51 Co-authored-by: hlin99 <73271530+hlin99@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] [WIP] Address feedback on SHM-based non-GPU KV transport implementation~~ Port SHM non-GPU transport to cpu_context_pickle branch and fix correctness gaps from #278 review May 20, 2026

Copilot finished work on behalf of hlin99 May 20, 2026 06:49

Copilot AI requested a review from hlin99 May 20, 2026 06:49

Copilot started work on behalf of hlin99 May 20, 2026 06:51 View session

Copilot finished work on behalf of hlin99 May 20, 2026 07:01

hlin99 requested changes May 20, 2026

View reviewed changes

Copilot started work on behalf of hlin99 May 20, 2026 07:05 View session

Copilot finished work on behalf of hlin99 May 20, 2026 07:11

Copilot AI requested a review from hlin99 May 20, 2026 07:11

hlin99 marked this pull request as ready for review May 20, 2026 07:13

hlin99 merged commit 2a02acb into copilot/add-nongpucontext-shm-implementation May 20, 2026

hlin99 deleted the copilot/sub-pr-278 branch May 28, 2026 12:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Port SHM non-GPU transport to cpu_context_pickle branch and fix correctness gaps from #278 review#280

Port SHM non-GPU transport to cpu_context_pickle branch and fix correctness gaps from #278 review#280
hlin99 merged 5 commits into
copilot/add-nongpucontext-shm-implementationfrom
copilot/sub-pr-278

Copilot AI commented May 20, 2026 •

edited

Loading

Uh oh!

Copilot AI commented May 20, 2026

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

hlin99 left a comment

Uh oh!

Copilot AI commented May 20, 2026

Review: PR #278 + #280 Combined

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Copilot AI commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI commented May 20, 2026

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

hlin99 left a comment

Choose a reason for hiding this comment

Review: PR #278 + #280 Combined

🟡 Issue 1: prepare_store 空 reserved 时仍存 entry 到 _pending_shm_writes，导致微量内存泄漏

Uh oh!

Copilot AI commented May 20, 2026

Review: PR #278 + #280 Combined

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented May 20, 2026 •

edited

Loading

🟡 Issue 1: `prepare_store` 空 reserved 时仍存 entry 到 `_pending_shm_writes`，导致微量内存泄漏