Skip to content
View chfeng-cs's full-sized avatar
💬
All In AI
💬
All In AI
  • Alibaba
  • Shanghai Jiao Tong University

Block or report chfeng-cs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
chfeng-cs/README.md

Ethan Feng

Infrastructure engineer focused on LLM inference systems.

  • M.S. Computer Science — Shanghai Jiao Tong University
  • B.S. Computer Science — Harbin Institute of Technology
  • 2 yrs at Alibaba

Currently contributing to vllm-project/vllm — KV cache transfer, scheduler optimization, and hybrid KV cache management (HMA).


Focus Areas

LLM Inference — vLLM internals, KV cache transfer, prefill-decode disaggregation, PagedAttention
Kernel Development — CUDA, Triton (fused kernels, memory hierarchy optimization)
Distributed Systems — background in distributed databases (PolarDB/MySQL), now applying to inference clusters


Open Source

Project Area Highlights
vllm-project/vllm Scheduler / KV Cache Bounded prefetch scheduling, HMA default behavior, metrics fixes

→ Full contribution list: vllm-contributions


Stack

Python CUDA Triton C++ PyTorch Linux


📫 ethan.fengch@gmail.com

Pinned Loading

  1. sglang sglang Public

    Forked from sgl-project/sglang

    SGLang is a high-performance serving framework for large language models and multimodal models.

    Python

  2. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python

  3. flashinfer flashinfer Public

    Forked from flashinfer-ai/flashinfer

    FlashInfer: Kernel Library for LLM Serving

    Python

  4. vllm-contributions vllm-contributions Public

    Python