Files
agent-evals/LEADERBOARD.md
2026-02-06 19:40:02 -05:00

35 lines
894 B
Markdown

# Leaderboard
1. **[pi - Kimi K2.5](../../../src/branch/eval/pi-kimi-k2.5/)**
- UI: 9/10
- Fixes Needed: Files List, Editor Scrolling, Duplicate `.md` Name
<details>
<summary>Screenshot</summary>
![pi-kimi-k2.5](../../../raw/branch/eval/pi-kimi-k2.5/screenshot.png)
</details>
2. **[pi - Qwen3 Coder Next (80B)](../../../src/branch/eval/pi-qwen3-coder-next-80b/)**
- UI: 8/10
- Fixes Needed: New File, Markdown Styling
<details>
<summary>Screenshot</summary>
![pi-qwen3-coder-next-80b](../../../raw/branch/eval/pi-qwen3-coder-next-80b/screenshot.png)
</details>
3. **[pi - GLM4.7](../../../src/branch/eval/pi-glm4.7/)**
- UI: 7/10
- Fixes Needed: Files List, Markdown Styling, Delete Failure
<details>
<summary>Screenshot</summary>
![pi-glm4.7](../../../raw/branch/eval/pi-glm4.7/screenshot.png)
</details>