Sync all three vLLM model configs from club-3090 master (ae4846f).
Update to Genesis v7.65 full PROD env set with new patches.
Update docker image to nightly-7a1eb8ac. Add torch_compile and
triton cache dirs. Add agent setup guide (AGENTS.md).
Add 'evan' API key to llama-swap sops secrets.
Allow one CUDA0 and one CUDA1 model to run simultaneously. Dual-GPU
models (using -ts splits) are excluded from the matrix so they evict
everything when loaded. vLLM docker models get evict_cost=50 to
discourage eviction due to slow cold starts.
Extract a shared hasType helper for model filtering and add
vision (text + image) input capability to compatible models.
Also tag two llama-swap models with the vision type.
- Add modules/home/security/pass-keyring with GPG agent, pass, and
python keyring backend config for headless credential storage
- Enable pass-keyring for lin-va-mbp-work-vm
- Update bash PATH from ~/.bin to ~/.local/bin
- Rotate age key for mac-va-mbp-work in .sops.yaml
- Re-encrypt secrets/common/evanreichard.yaml with new key
- Remove opencode, docker, and nunc from work mac config
Co-locate update-package-hashes.sh helper script to wrap nurl and
show recent releases. Simplify SKILL.md documentation. Bump
pi-coding-agent from 0.70.0 to 0.70.2 with updated hashes.
Allow LSP servers to be enabled on-demand via a buffer-local command
instead of auto-starting on matching filetypes. The command name is
auto-derived from the server name (e.g. 'GolangciLint'). Switch
golangci-lint to manual mode as it's resource-heavy and not always needed.