Ethan Feng chfeng-cs

Ethan Feng

Infrastructure engineer focused on LLM inference systems.

M.S. Computer Science — Shanghai Jiao Tong University
B.S. Computer Science — Harbin Institute of Technology
2 yrs at Alibaba

Currently contributing to vllm-project/vllm — KV cache transfer, scheduler optimization, and hybrid KV cache management (HMA).

Focus Areas

LLM Inference — vLLM internals, KV cache transfer, prefill-decode disaggregation, PagedAttention
Kernel Development — CUDA, Triton (fused kernels, memory hierarchy optimization)
Distributed Systems — background in distributed databases (PolarDB/MySQL), now applying to inference clusters

Open Source

Project	Area	Highlights
vllm-project/vllm	Scheduler / KV Cache	Bounded prefetch scheduling, HMA default behavior, metrics fixes

→ Full contribution list: vllm-contributions

Stack

Python CUDA Triton C++ PyTorch Linux

📫 ethan.fengch@gmail.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ethan Feng chfeng-cs

Achievements

Achievements

Block or report chfeng-cs

Ethan Feng

Focus Areas

Open Source

Stack

Pinned Loading

Uh oh!