chore: update results

This commit is contained in:
2026-02-06 12:53:40 -05:00
parent cd6eea2a7b
commit 304622aafc
2 changed files with 45 additions and 14 deletions

45
RESULTS.md Normal file
View File

@@ -0,0 +1,45 @@
## Description
## Agentic Tools
- [pi](https://github.com/badlogic/pi-mono/tree/main/packages/coding-agent)
- [opencode](https://github.com/anomalyco/opencode)
- [claude-code](https://github.com/anthropics/claude-code)
## Grading
Purely opinion based, but based on
- One-shot performance
- Follow up fixes
- UI design
## Rankings
### Overall
1. eval/pi-kimi-k2.5
2. eval/pi-qwen3-coder-next-80b
3. eval/pi-glm4.7
### Parameters: < 100B (Local)
1. eval/pi-qwen3-coder-next-80b
- UI: 8/10
- Fixes:
- New File
- Markdown Styling
### Parameters: > 100B (Hosted)
1. eval/pi-kimi-k2.5
- UI: 9/10
- Fixes:
- Files List
- Editor Scrolling
- Duplicate `.md` Name
2. eval/pi-glm4.7
- UI: 7/10
- Fixes:
- Files List
- Markdown Styling
- Delete Failure