You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat: introduce GPU Direct Storage and DiskKVCache support for improved performance
- Added `use_gds` parameter to enable GPU Direct Storage with `kvikio-cu12`, allowing layers to load directly from disk to GPU.
- Introduced `kv_cache_dir` option for offloading KV cache to SSD, supporting long contexts (50k+ tokens).
- Updated README and CHANGELOG to reflect new features and usage instructions.
- Modified Makefile and pyproject.toml to include new dependencies and installation options.
0 commit comments