Skip to content

Commit b031e6e

Browse files
committed
move blog images
1 parent 746fbc1 commit b031e6e

1 file changed

Lines changed: 5 additions & 3 deletions

File tree

docs/blog/posts/toxic_berry_test.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -40,6 +40,8 @@ As expected, this simple agent performs well until the rules change. The MLflow
4040

4141
At step 1000, the `average_agent_health` plummets, leading to a sharp drop in `active_agents`. The `causal_understanding_score` flatlines near zero, proving the agent failed to learn the new rule.
4242

43+
![Baseline Agents](../assets/berry_sim_baseline.png)
44+
4345
## Phase 2: The Causal Agent - Learning to See
4446

4547
Our experimental group is the Causal-QLearning-Agent. It uses a sophisticated Q-learning model to make decisions. Crucially, we gave this agent "senses" by equipping it with a `PerceptionComponent` and a more advanced state encoder.
@@ -71,12 +73,12 @@ class BerryStateEncoder(StateEncoderInterface):
7173
return np.array(agent_state_vector + perception_vector)
7274
```
7375

74-
## The A/B Test: A Clear Winner
75-
76-
![Baseline Agents](../assets/berry_sim_baseline.png)
76+
At step 1000, the `average_agent_health` falls then recovers and is not enough to kill off as many agents `active_agents`. The `causal_understanding_score` spikes then remains higher-roughly 2X higher than our baseline agents-proving the agent failed to learn the new rule.
7777

7878
![Causal Agents](../assets/berry_sim_causal.png)
7979

80+
## The A/B Test: A Clear Winner
81+
8082
The results are conclusive.
8183

8284
```

0 commit comments

Comments
 (0)