- Increase context window from 80k to 202,752 tokens - Add repeat penalty parameter (1.0) - Enable CUDA device for GPU acceleration
14 KiB
14 KiB
- Increase context window from 80k to 202,752 tokens - Add repeat penalty parameter (1.0) - Enable CUDA device for GPU acceleration