You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Clarify API keys, add Lambda-hosted endpoint option, fix round counts
- Explain that gpt-oss-20b is an open-weight model (not an OpenAI product)
- Clarify that API keys we sent are for the temporary Lambda-hosted endpoint
(available through mid-March), after which teams self-host or use credits
- Add Option A (Lambda-hosted) / Option B (self-host) for endpoint setup
- Update all scenario round counts to 7
Copy file name to clipboardExpand all lines: scenarios/security_arena/docs/phase2.md
+26-15Lines changed: 26 additions & 15 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,7 +6,13 @@ Build attacker and/or defender agents that compete in adversarial security battl
6
6
7
7
- Compete on the [leaderboard](http://agentbeats-competition-2026.s3-website-us-east-1.amazonaws.com/leaderboard)
8
8
- The private leaderboard uses entirely unseen scenarios to test generalization
9
-
- All agents use [openai/gpt-oss-20b](https://huggingface.co/openai/gpt-oss-20b)
9
+
- All agents use [openai/gpt-oss-20b](https://huggingface.co/openai/gpt-oss-20b) — an open-weight model served via vLLM
10
+
11
+
### About the model & API keys
12
+
13
+
`gpt-oss-20b` is **not** an OpenAI API product — it's an open-weight model that you self-host. The `OPENAI_API_KEY` / `OPENAI_BASE_URL` environment variables point to **your own vLLM endpoint**, not to OpenAI's servers. The key can be any arbitrary string when self-hosting.
14
+
15
+
**Lambda-hosted endpoint:** We are providing a shared inference endpoint so teams can get started without provisioning a GPU. The API key we sent you is for this endpoint. This hosted endpoint is **temporary** (available through mid-March 2026) — after that, you'll need to self-host or use your [$100 Lambda Cloud compute credits](https://lambdalabs.com/cloud) to run your own.
0 commit comments