🧠 LLM Inference Leaderboard & Analytics

This is a project I’m working on with Vaughn DiMarco, who’s building a startup that wants to bring a BECK-style system (like what’s used in crypto) into the LLM space. The core idea is to eventually let people track, evaluate, and even trade LLM model tokens — kind of like a performance marketplace for AI models.

We’re starting by collecting and analyzing real-time inference data from platforms like OpenRouter.ai and Shoots.ai, then building a leaderboard to make that data public and trustable.

🚀 What We’re Trying to Do

We’re looking to make LLM usage and performance transparent. Specifically, we want to track:

How many inferences each model/provider is doing
How many tokens are being generated
How much those inferences are costing
Who’s providing what
Which models are most efficient or popular

Eventually, the goal is to publish this data in a public leaderboard that people can filter and explore.

📊 Key Features (In Progress)

Total number of inferences
- Hourly, daily, etc. (based on what the API provides)
Tokens generated over time
Real-time cost per token
- Updated minute by minute
Filterable data:
- ⏱️ Time
- 💰 Price
- 🔢 Token usage
- 🧠 Model name
- 🏢 Provider name

🛠️ How I’m Getting the Data

Originally tried using the OpenRouter API (and even used an LLM to parse the responses), but the results weren’t great — some data was missing or unstructured.

So I pivoted to scraping the site using BeautifulSoup + Selenium to get more consistent results. Right now it’s pulling info like:

Provider names
Associated LLM models
Inference cost & token-related data (still improving this)

Also exploring Shoots.ai as a potential secondary data source.

✅ What’s Done So Far

Scraper built for OpenRouter using BeautifulSoup + Selenium
Extracted provider/model pairs
Tested early attempts at getting cost and token info
Project structured for future automation

🔜 What’s Coming Next

Improve scraping to capture live token + pricing data
Store hourly/daily snapshots of inference usage
Build a basic leaderboard frontend (thinking Streamlit or lightweight web app)
Add full filtering (by time, model, tokens, provider, etc.)
Pull in additional sources like Shoots.ai if useful

📁 Project Structure

llm-inference-project/
├── scrapers/       # Web scraping scripts (OpenRouter, Shoots.ai)
├── data/           # Stored results (JSON, CSV, etc.)
├── notebooks/      # Exploratory analysis / quick data viz
├── scripts/        # Utilities for parsing, formatting, etc.
└── README.md       # This file

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
AWS_Integration		AWS_Integration
Check-Ins		Check-Ins
Model_Provider_csv_output		Model_Provider_csv_output
Pipeline_Items		Pipeline_Items
Testing_APIs_Diff_Methods		Testing_APIs_Diff_Methods
__pycache__		__pycache__
top_models_month_and_week		top_models_month_and_week
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 LLM Inference Leaderboard & Analytics

🚀 What We’re Trying to Do

📊 Key Features (In Progress)

🛠️ How I’m Getting the Data

✅ What’s Done So Far

🔜 What’s Coming Next

📁 Project Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠 LLM Inference Leaderboard & Analytics

🚀 What We’re Trying to Do

📊 Key Features (In Progress)

🛠️ How I’m Getting the Data

✅ What’s Done So Far

🔜 What’s Coming Next

📁 Project Structure

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages