Releases · xPyD-hub/xPyD-acc

What's Changed

feat: add logprobs comparison tool (M2) by @hlin99 in #2
chore: mark M2 as complete in ROADMAP.md by @hlin99 in #3
feat: add KV cache comparison tool (M3) by @hlin99 in #5
chore: mark M3 as complete in ROADMAP.md by @hlin99 in #6
feat: add automated diagnostic pipeline (M4) by @hlin99 in #8
feat: add output comparison utilities (M5) by @hlin99 in #10
feat: add batch dataset comparison (M6) by @hlin99 in #13
feat: add xPyD ecosystem integration (M7) by @hlin99 in #15
docs: mark M7 as complete by @hlin99 in #16
feat: HTML report generation (M8) by @hlin99 in #18
docs: mark M8 complete in ROADMAP.md by @hlin99 in #19
feat: add TOML configuration file support by @hlin99 in #21
feat: add CLI config integration tests, mark M9 complete by @hlin99 in #23
feat: add --json export for batch-compare and --version flag by @hlin99 in #25
docs: mark M10 as complete by @hlin99 in #26
feat: HTTP retry with exponential backoff for endpoint requests by @hlin99 in #28
feat: add Rich progress bars for batch comparison by @hlin99 in #30
feat: add streaming output comparison (M13) by @hlin99 in #32
feat: add endpoint health check (M14) by @hlin99 in #34
feat: M15 - CSV Export with truncated prompt column by @hlin99 in #36
feat: Token Timing Analysis (M16) by @hlin99 in #38
docs: mark M16 complete, add M17 milestone by @hlin99 in #39
feat: Prompt Template Support (M17) by @hlin99 in #41
docs: mark M17 complete by @hlin99 in #42
feat: environment variable support for credentials and endpoints by @hlin99 in #44
feat: add Markdown report export for batch-compare (M19) by @hlin99 in #46
feat: add dry run mode for batch-compare by @hlin99 in #48
feat: add regression detection for batch comparison runs by @hlin99 in #50
feat: tolerance-based matching for batch comparison (M22) by @hlin99 in #52
feat: selective sample rerun for batch comparison (M23) by @hlin99 in #54
feat: M24 - Sampling Parameter Support (temperature, top_p, seed) by @hlin99 in #56
feat: M25 - Named Profiles (Presets) by @hlin99 in #58
feat: M26 - Multi-run aggregation for persistent vs flaky divergence detection by @hlin99 in #60
feat: configurable HTTP timeout for batch-compare by @hlin99 in #62
feat: add logging & verbosity control (--verbose / --quiet) by @hlin99 in #64
feat: Watch Mode — Continuous Divergence Monitoring by @hlin99 in #66
feat: snapshot baseline capture & replay (M30) by @hlin99 in #68
feat: CI-friendly fail threshold for batch-compare by @hlin99 in #70
feat: Multi-Target Comparison (M32) by @hlin99 in #72
feat: request ID tracking for API call correlation by @hlin99 in #74
feat: prompt deduplication for batch comparison by @hlin99 in #76
feat: wire --no-request-id CLI flag to run_batch enable_request_ids by @hlin99 in #78
feat: shell completion generation for bash/zsh/fish by @hlin99 in #80
feat: response caching for batch comparison by @hlin99 in #82
feat: M37 — Result History & Trend Tracking by @hlin99 in #84
feat(benchmark): add endpoint latency benchmarking (M38) by @hlin99 in #86
feat(config): Configuration Validation & Init Command (M39) by @hlin99 in #88
feat(dataset): CSV and JSON Array Format Support (M40) by @hlin99 in #90
feat: M41 — Webhook Notifications for Divergence Alerts by @hlin99 in #92
feat: M42 — Sample Filtering for Batch Reports by @hlin99 in #94
feat: M43 Report Diff — Side-by-Side Batch Report Comparison by @hlin99 in #96
feat: request rate limiting with token bucket algorithm (M44) by @hlin99 in #98
feat: custom output normalizers for batch comparison (M45) by @hlin99 in #100
feat: Top-K Logprob Distribution Analysis (M46) by @hlin99 in #102
feat: auto-bisect divergence by context length (M47) by @hlin99 in #104
feat(summary): add compact summary command (M48) by @hlin99 in #106
feat: divergence pattern clustering for batch reports (M49) by @hlin99 in #108
feat: JUnit XML export for batch-compare (M50) by @hlin99 in #110
feat: add history purge command (M51) by @hlin99 in #112
feat: Sample Deep-Dive Explain Command (M52) by @hlin99 in #114
feat: confidence intervals for divergence rate (M53) by @hlin99 in #116
feat: add sample annotation for batch reports (M54) by @hlin99 in #118
feat: Model Fingerprinting via Deterministic Probes (M55) by @hlin99 in #120
feat(batch_compare): report schema versioning and load_report() by @hlin99 in #122
feat: M57 — Endpoint Response Validation (Schema Check) by @hlin99 in #124
feat: M58 — Dataset Statistics Command by @hlin99 in #126
feat: M59 — Cost Estimation from API Usage by @hlin99 in #128
feat: M60 — Cost Tracking Integration into Batch Compare by @hlin99 in #130
feat: add output truncation detection (M61) by @hlin99 in #132
feat: add reproducibility score for multi-run consistency measurement (M62) by @hlin99 in #134
feat(env): add XPYD_ACC_MAX_TOKENS and XPYD_ACC_CONCURRENCY env vars by @hlin99 in #136
feat: M64 — Checkpoint Resume for Batch Comparison by @hlin99 in #138
feat: M65 — Checkpoint Integration into Batch Compare by @hlin99 in #140
feat(retry): add retry statistics reporting (M66) by @hlin99 in #142
feat(headers): add custom HTTP headers for API requests (M67) by @hlin99 in #144
feat: M68 — Endpoint A/B Testing with Statistical Significance by @hlin99 in #146
feat: M69 — Concurrency Scaling Analysis by @hlin99 in #148
feat: M70 — Output Entropy Analysis by @hlin99 in #150
feat: M71 — Output Length Bias Detection by @hlin99 in #152
feat: add prompt sensitivity analysis (M72) by @hlin99 in #154
docs: add deliverables — guide, iteration record, diagnostic script by @hlin99 in #156
setup: add review policy and auto-assign reviewers by @hlin99 in #157
feat(M73): Multi-Model Comparison in Single Batch Run by @hlin99 in #159
refactor: split cli.py into cli/ package (#155) by @hlin99 in #160
feat(M74): divergence root cause heuristics (#161) by @hlin99 in #162
feat(M75): side-by-side token diff viewer (#163) by @hlin99 in #164
feat(M76): report dashboard server (#165) by @hlin99 in #166
feat(M77): Prometheus metrics export (#167) by @hlin99 in #168
feat(M78): Grafana Dashboard Template Export (#169) by @hlin99 in #170
feat(M79): Parallel Multi-Dataset Batch Run (#171) by @hlin99 in https://github.com/xPy...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Choose a tag to compare

Sorry, something went wrong.

Sorry, something went wrong.

Uh oh!

No results found

What's Changed

Contributors

Uh oh!

Releases: xPyD-hub/xPyD-acc

v0.1.0

What's Changed

Contributors

Uh oh!