Skip to content

Releases: xPyD-hub/xPyD-acc

v0.1.0

06 Apr 15:51
f26fe01

Choose a tag to compare

What's Changed

  • feat: add logprobs comparison tool (M2) by @hlin99 in #2
  • chore: mark M2 as complete in ROADMAP.md by @hlin99 in #3
  • feat: add KV cache comparison tool (M3) by @hlin99 in #5
  • chore: mark M3 as complete in ROADMAP.md by @hlin99 in #6
  • feat: add automated diagnostic pipeline (M4) by @hlin99 in #8
  • feat: add output comparison utilities (M5) by @hlin99 in #10
  • feat: add batch dataset comparison (M6) by @hlin99 in #13
  • feat: add xPyD ecosystem integration (M7) by @hlin99 in #15
  • docs: mark M7 as complete by @hlin99 in #16
  • feat: HTML report generation (M8) by @hlin99 in #18
  • docs: mark M8 complete in ROADMAP.md by @hlin99 in #19
  • feat: add TOML configuration file support by @hlin99 in #21
  • feat: add CLI config integration tests, mark M9 complete by @hlin99 in #23
  • feat: add --json export for batch-compare and --version flag by @hlin99 in #25
  • docs: mark M10 as complete by @hlin99 in #26
  • feat: HTTP retry with exponential backoff for endpoint requests by @hlin99 in #28
  • feat: add Rich progress bars for batch comparison by @hlin99 in #30
  • feat: add streaming output comparison (M13) by @hlin99 in #32
  • feat: add endpoint health check (M14) by @hlin99 in #34
  • feat: M15 - CSV Export with truncated prompt column by @hlin99 in #36
  • feat: Token Timing Analysis (M16) by @hlin99 in #38
  • docs: mark M16 complete, add M17 milestone by @hlin99 in #39
  • feat: Prompt Template Support (M17) by @hlin99 in #41
  • docs: mark M17 complete by @hlin99 in #42
  • feat: environment variable support for credentials and endpoints by @hlin99 in #44
  • feat: add Markdown report export for batch-compare (M19) by @hlin99 in #46
  • feat: add dry run mode for batch-compare by @hlin99 in #48
  • feat: add regression detection for batch comparison runs by @hlin99 in #50
  • feat: tolerance-based matching for batch comparison (M22) by @hlin99 in #52
  • feat: selective sample rerun for batch comparison (M23) by @hlin99 in #54
  • feat: M24 - Sampling Parameter Support (temperature, top_p, seed) by @hlin99 in #56
  • feat: M25 - Named Profiles (Presets) by @hlin99 in #58
  • feat: M26 - Multi-run aggregation for persistent vs flaky divergence detection by @hlin99 in #60
  • feat: configurable HTTP timeout for batch-compare by @hlin99 in #62
  • feat: add logging & verbosity control (--verbose / --quiet) by @hlin99 in #64
  • feat: Watch Mode — Continuous Divergence Monitoring by @hlin99 in #66
  • feat: snapshot baseline capture & replay (M30) by @hlin99 in #68
  • feat: CI-friendly fail threshold for batch-compare by @hlin99 in #70
  • feat: Multi-Target Comparison (M32) by @hlin99 in #72
  • feat: request ID tracking for API call correlation by @hlin99 in #74
  • feat: prompt deduplication for batch comparison by @hlin99 in #76
  • feat: wire --no-request-id CLI flag to run_batch enable_request_ids by @hlin99 in #78
  • feat: shell completion generation for bash/zsh/fish by @hlin99 in #80
  • feat: response caching for batch comparison by @hlin99 in #82
  • feat: M37 — Result History & Trend Tracking by @hlin99 in #84
  • feat(benchmark): add endpoint latency benchmarking (M38) by @hlin99 in #86
  • feat(config): Configuration Validation & Init Command (M39) by @hlin99 in #88
  • feat(dataset): CSV and JSON Array Format Support (M40) by @hlin99 in #90
  • feat: M41 — Webhook Notifications for Divergence Alerts by @hlin99 in #92
  • feat: M42 — Sample Filtering for Batch Reports by @hlin99 in #94
  • feat: M43 Report Diff — Side-by-Side Batch Report Comparison by @hlin99 in #96
  • feat: request rate limiting with token bucket algorithm (M44) by @hlin99 in #98
  • feat: custom output normalizers for batch comparison (M45) by @hlin99 in #100
  • feat: Top-K Logprob Distribution Analysis (M46) by @hlin99 in #102
  • feat: auto-bisect divergence by context length (M47) by @hlin99 in #104
  • feat(summary): add compact summary command (M48) by @hlin99 in #106
  • feat: divergence pattern clustering for batch reports (M49) by @hlin99 in #108
  • feat: JUnit XML export for batch-compare (M50) by @hlin99 in #110
  • feat: add history purge command (M51) by @hlin99 in #112
  • feat: Sample Deep-Dive Explain Command (M52) by @hlin99 in #114
  • feat: confidence intervals for divergence rate (M53) by @hlin99 in #116
  • feat: add sample annotation for batch reports (M54) by @hlin99 in #118
  • feat: Model Fingerprinting via Deterministic Probes (M55) by @hlin99 in #120
  • feat(batch_compare): report schema versioning and load_report() by @hlin99 in #122
  • feat: M57 — Endpoint Response Validation (Schema Check) by @hlin99 in #124
  • feat: M58 — Dataset Statistics Command by @hlin99 in #126
  • feat: M59 — Cost Estimation from API Usage by @hlin99 in #128
  • feat: M60 — Cost Tracking Integration into Batch Compare by @hlin99 in #130
  • feat: add output truncation detection (M61) by @hlin99 in #132
  • feat: add reproducibility score for multi-run consistency measurement (M62) by @hlin99 in #134
  • feat(env): add XPYD_ACC_MAX_TOKENS and XPYD_ACC_CONCURRENCY env vars by @hlin99 in #136
  • feat: M64 — Checkpoint Resume for Batch Comparison by @hlin99 in #138
  • feat: M65 — Checkpoint Integration into Batch Compare by @hlin99 in #140
  • feat(retry): add retry statistics reporting (M66) by @hlin99 in #142
  • feat(headers): add custom HTTP headers for API requests (M67) by @hlin99 in #144
  • feat: M68 — Endpoint A/B Testing with Statistical Significance by @hlin99 in #146
  • feat: M69 — Concurrency Scaling Analysis by @hlin99 in #148
  • feat: M70 — Output Entropy Analysis by @hlin99 in #150
  • feat: M71 — Output Length Bias Detection by @hlin99 in #152
  • feat: add prompt sensitivity analysis (M72) by @hlin99 in #154
  • docs: add deliverables — guide, iteration record, diagnostic script by @hlin99 in #156
  • setup: add review policy and auto-assign reviewers by @hlin99 in #157
  • feat(M73): Multi-Model Comparison in Single Batch Run by @hlin99 in #159
  • refactor: split cli.py into cli/ package (#155) by @hlin99 in #160
  • feat(M74): divergence root cause heuristics (#161) by @hlin99 in #162
  • feat(M75): side-by-side token diff viewer (#163) by @hlin99 in #164
  • feat(M76): report dashboard server (#165) by @hlin99 in #166
  • feat(M77): Prometheus metrics export (#167) by @hlin99 in #168
  • feat(M78): Grafana Dashboard Template Export (#169) by @hlin99 in #170
  • feat(M79): Parallel Multi-Dataset Batch Run (#171) by @hlin99 in https://github.com/xPy...
Read more