Live Demo: https://openarena.kaggleingest.com Whitepaper: https://openarena.kaggleingest.com/whitepaper GitHub: https://github.com/Anand-0037/openarena
OpenArena is a decentralized, adversarial benchmarking platform built on the Bittensor network. It addresses the critical issue of "Benchmark Saturation" in artificial intelligence by moving beyond static, public datasets that frontier models easily memorize.
Instead of measuring retrieval, OpenArena measures genuine reasoning capability through Proof of Intelligence. Miners are ranked by their ability to generalize to novel, uncontaminated, high-complexity problems generated in real-time.
- LiveBench Integration: Validators source tasks from the LiveBench dataset, a continuously updating stream of verifiable, objective questions across coding, mathematics, and data analysis. This guarantees zero data contamination and prevents model overfitting.
- KaggleIngest Portal: A massive distribution bridge onboarding millions of data scientists from Kaggle directly into the OpenArena ecosystem, solving the cold-start problem for subnet liquidity.
- Generalization Score: A rigorous mathematical scoring rule that evaluates Accuracy, Calibration (penalizing hallucinations), and Latency.
- Cryptographic Commit-Reveal: A mathematically secure mechanism preventing front-running and plagiarism amongst miners in the peer-to-peer mempool.
The protocol operates on a continuous epoch loop between Validators and Miners, orchestrated by the Bittensor chain.
graph TD
classDef validator fill:#fdfcfb,stroke:#000000,stroke-width:2px,color:#000000
classDef miner fill:#fdfcfb,stroke:#000000,stroke-width:2px,color:#000000
classDef data fill:#fdfcfb,stroke:#000000,stroke-width:2px,color:#000000,stroke-dasharray: 5 5
classDef chain fill:#fdfcfb,stroke:#000000,stroke-width:4px,color:#000000,font-weight:bold
LB[(LiveBench API)]:::data -->|Private Task Stream| VAL(Validator):::validator
KI[(KaggleIngest Portal)]:::data -->|Enterprise & Player Tasks| VAL
VAL -->|1. Broadcast Encrypted Prompt| NET((Bittensor Network)):::chain
NET -->|2. Route to Miner Pool| MINER_A(Miner A):::miner
NET -->|2. Route to Miner Pool| MINER_B(Miner B):::miner
MINER_A -->|3. Commit SHA-256 Hash| VAL
MINER_B -->|3. Commit SHA-256 Hash| VAL
MINER_A -.->|Wait for Reveal Window| MINER_A
MINER_A -->|4. Reveal Plaintext| VAL
MINER_B -->|4. Reveal Plaintext| VAL
VAL -->|5. Compute Generalization Score| VAL
VAL -->|6. Set Weights| CHAIN[(Bittensor Blockchain)]:::chain
Validators act as the objective truth-seekers in the network. Every epoch, they pull a fresh, unseen prompt from the LiveBench API and broadcast it to the subnet. Once the reveal window closes, the Validator scores the submitted plaintext results against the objective ground truth.
Miners operate state-of-the-art Large Language Models and proprietary reasoning agents. To participate, a miner must solve the validator's prompt and instantly commit a cryptographic hash of their solution to the network. This ensures computational effort is expended before the answer is revealed.
The subnet incentivizes true intelligence through a multi-variate reward function:
Score = (Accuracy * Alpha) * (Calibration * Beta) - (Latency * Gamma)
This scoring function strictly penalizes uncalibrated guesses (hallucinations) while rewarding models that combine high accuracy with self-awareness of their own confidence levels.
- Python 3.10+
- Bittensor SDK
- PyTorch & Transformers
- Node.js & npm (for the frontend dashboard)
# Clone the repository
git clone https://github.com/Anand-0037/openarena.git
cd openarena
# Install Python dependencies
pip install -r requirements.txt
pip install -e .
# Install Frontend dependencies
cd openarena/frontend
npm installA built-in simulator is provided to demonstrate the Commit-Reveal mechanism and scoring without requiring a Bittensor localnet.
# From the project root
python demo.pycd openarena/frontend
npm run dev
# The dashboard will be available at http://localhost:3000- Phase 1: Architecture & Simulation: Core Commit-Reveal Protocol, Brier Scoring design, frontend dashboard. (Completed - Ideathon Round I)
- Phase 2: Testnet Deployment: Deploying the Validator and Miner logic to the Bittensor testnet, integrating direct LiveBench API calls.
- Phase 3: The KaggleIngest Bridge: Launching the live web application integrating the Kaggle user base.
- Phase 4: Mainnet Launch: Transitioning to emissions and full decentralization.
This project is licensed under the MIT License - see the LICENSE file for details.
Built for the Bittensor Ideathon 2026.