lowest PI GPT-5-mini · highest PI Grok 4.1 Fast · 53 tests · 1116 variants
Exploratory v1 result: 3 peer-model judges per target, 1 pass each, reliability below target.
| model | PI | class |
|---|
| GPT-5-mini | 0.1019 | Mutualistic |
| Gemini 3.1 Flash-Lite | 0.1674 | Commensal |
| Claude Haiku 4.5 | 0.1745 | Commensal |
| Grok 4.1 Fast | 0.2548 | Mildly Parasitic |
53 matching tests
No prompt-response variants matched this test.