- claude: filter model completion to coding/synthetic models only - llama-swap: update model to IQ4_XS and add CUDA device selection
- claude: filter model completion to coding/synthetic models only - llama-swap: update model to IQ4_XS and add CUDA device selection