- pi-qwen-coder-next-80b: Near one-shot success with UI advantages, minor routing issues - pi-glm4.7-flash: Dependency issues, legacy packages, average UI - pi-devstral-small-2: Runtime issues
405 B
405 B
Rankings
- eval/pi-qwen-coder-next-80b
- Almost one shot
- Nicest UI
- Invalid route (first fix commit) - very simple fix
- Not displaying content (second fix commit) - simple fix
- eval/pi-glm4.7-flash
- FE wouldnt run on first try
- Had to downgrade a bunch of deps so it would run
- Used legacy packages
- UI is meh
- eval/pi-devstral-small-2
- Wouldnt run