Open benchmark harness for latest major AI models on prediction-market forecasting, calibration, microstructure, and trading-risk tasks.
benchmark calibration forecasting prediction-markets market-microstructure kalshi llm-evaluation polymarket trading-risk
-
Updated
May 5, 2026 - Python