Releases: xPyD-hub/xPyD-sim
Releases · xPyD-hub/xPyD-sim
v0.5.0
v0.4.0
What's New
OpenAI API Spec Compliance
- Parameter range validation (temperature, top_p, frequency_penalty, presence_penalty, n, best_of)
response_formatsupport:json_objectandjson_schemafor chat completions- Embedding
encoding_format=base64support (little-endian) - Stop sequences correctly skipped for JSON-formatted responses
vLLM Backend Compatibility
- Accept all vLLM-specific sampling params (
top_k,min_p,use_beam_search, etc.) without error - Response fields:
stop_reason,service_tier,kv_transfer_params best_ofandechoon chat completions
Scheduling
response_formatworks in both scheduled and non-scheduled paths
Tests
- 19 new API compliance tests (TC14-TC17)
- 285 total tests passing
v0.3.0
v0.2.0
v0.1.1
- render_dummy_text cycles tokens for any max_tokens value
- Streaming requests logged to JSONL
- max_model_len enforced in all code paths
- prefill/app.py: /ping and /metrics endpoints
- Warmup penalty works with scheduling
- /v1/completions routes through scheduler
- Scheduler sync-safe (no event loop dependency)
- CLI arg priority fixed (_TrackAction subparser bug)
- Batch metrics always present in /metrics
- /debug/batch returns zero-state when scheduling disabled
- Prefill batch FIFO ordering fixed
- PyPI publish workflow added