3117a3569f
chore(packages): bump pi-coding-agent to 0.72.1
2026-05-02 17:17:17 -04:00
3cc4649979
feat(lin-va-terminal): add sops secret for rke2 kubeconfig
2026-05-02 17:04:52 -04:00
7c1519881a
refactor(llama-swap): generate sops secrets from apiKeys list
2026-05-02 15:48:15 -04:00
f00edb620c
feat(lin-va-desktop): add systemd services for NVIDIA power limit
2026-05-02 14:00:49 -04:00
8d45977154
chore(llama-swap): update Genesis to v7.69, add cliff 2 optimizations
...
- Bump Genesis pin from fc89395 to 2db18df (v7.69)
- Add PN32 GDN chunked prefill and PN34 workspace lock relax env vars
- Replace patch_workspace_lock_disable with patch_inputs_embeds_optional
- Remove setup-time PN25/PN30 patches (folded into v7.69 natively)
- Switch patch base URL to v7.69-cliff2-test branch
- Lower GPU memory utilization to 0.93 for long-text variant
- Remove python3 from preflight check prerequisites
- Add printing service to lin-va-thinkpad
2026-05-02 13:48:41 -04:00
40114f438f
feat(llama-swap): sync vLLM configs from club-3090, add evan API key
...
Sync all three vLLM model configs from club-3090 master (ae4846f).
Update to Genesis v7.65 full PROD env set with new patches.
Update docker image to nightly-7a1eb8ac. Add torch_compile and
triton cache dirs. Add agent setup guide (AGENTS.md).
Add 'evan' API key to llama-swap sops secrets.
2026-05-02 08:27:47 -04:00
ba30222962
feat(pi): add skills and improve AGENTS.md reading guidelines
...
- Add 'create-skill' skill for scaffolding new skill directories
- Add 'planning' skill for structured implementation workflows
- Add search-then-read pattern guidance to AGENTS.md
2026-05-01 23:26:53 -04:00
e4d40d89d9
feat: add api keys to llama-swap
2026-05-01 22:12:51 -04:00
43a1d66e6b
add 9b vision
2026-05-01 21:51:06 -04:00
1283b7cdef
add fim 4b + 9b
2026-05-01 21:42:50 -04:00
09fdff4908
refactor(llama-swap): reorganize models by GPU hardware section
2026-05-01 21:08:53 -04:00
88308602c8
feat(llama-swap): add concurrent model matrix for CUDA0/CUDA1
...
Allow one CUDA0 and one CUDA1 model to run simultaneously. Dual-GPU
models (using -ts splits) are excluded from the matrix so they evict
everything when loaded. vLLM docker models get evict_cost=50 to
discourage eviction due to slow cold starts.
2026-05-01 16:50:28 -04:00
1812d2ea03
feat(pi): add vision model support
...
Extract a shared hasType helper for model filtering and add
vision (text + image) input capability to compatible models.
Also tag two llama-swap models with the vision type.
2026-05-01 15:03:26 -04:00
ab63211a75
wip
2026-05-01 14:36:36 -04:00
561f10d2a7
fix: timing & vllm
2026-05-01 13:09:28 -04:00
a3b2efa5bb
feat: vllm timings patch
2026-05-01 10:57:59 -04:00
74ff71803b
feat: vllm yay
2026-05-01 10:38:43 -04:00
75eba8703f
add: vllm base 3.6 27b
2026-04-30 21:47:29 -04:00
3d55b6e675
chore: add lfs
2026-04-30 20:14:41 -04:00
990b6a4392
feat: vllm
2026-04-30 20:04:58 -04:00
bcba8f6b60
feat(address-gh-review): add thread resolution with reactions and comments
2026-04-30 14:20:01 -04:00
93e2247a30
chore(nixos/llama-swap): remove synthetic peer and tune local model args
2026-04-30 11:43:04 -04:00
31363f5f8d
docs(pi): add _scratch guidance for ephemeral artifacts
2026-04-30 08:32:03 -04:00
976edab339
config(llama-swap): enable preserve_thinking in chat template kwargs
2026-04-30 07:45:57 -04:00
d1d3f3c1a3
build(pi-coding-agent): bump to 0.70.6
2026-04-29 00:08:57 -04:00
a242461139
build(llama-cpp): update to 8964
2026-04-28 21:49:16 -04:00
ca8d2a38ed
build(llama-swap): update to 208
2026-04-28 21:48:51 -04:00
eef4d78cb3
feat(home): add pass-backed keyring module and enable for work VM
...
- Add modules/home/security/pass-keyring with GPG agent, pass, and
python keyring backend config for headless credential storage
- Enable pass-keyring for lin-va-mbp-work-vm
- Update bash PATH from ~/.bin to ~/.local/bin
2026-04-27 23:02:22 -04:00
50719469da
chore: update vm sops
2026-04-27 14:17:55 -04:00
f349b24b5d
chore: rotate mac-va-mbp-work sops key and clean up unused packages
...
- Rotate age key for mac-va-mbp-work in .sops.yaml
- Re-encrypt secrets/common/evanreichard.yaml with new key
- Remove opencode, docker, and nunc from work mac config
2026-04-27 12:44:52 -04:00
f110a9743a
chore: add work vm sops key
2026-04-27 11:02:53 -04:00
b85b01bcaa
feat: add web-glimpse skill for headless browser tasks
2026-04-27 10:45:44 -04:00
5a5aeb592e
build(glimpse): update package snapshot
2026-04-27 10:30:45 -04:00
04296e282c
feat: add glimpse module
2026-04-27 10:25:20 -04:00
005ba2244b
feat(pi): add glimpse browser automation CLI
2026-04-27 08:12:39 -04:00
412b503c7a
chore(home): enable pi and set hyprland menu mod on thinkpad
2026-04-27 07:52:22 -04:00
a38a725bb9
fix(pi-coding-agent): wrap binary with nodejs_22 in PATH
2026-04-26 09:25:42 -04:00
ad11dbdf2a
chore(secrets): add lin-va-terminal to sops key rotation
2026-04-26 09:16:13 -04:00
a39a314674
feat(pi): manage pi extension packages via nix module
2026-04-26 08:59:00 -04:00
e8bc4e4da7
fix(pi): guard prompt replacement to anthropic-only and preserve pi-coding-agent
2026-04-26 08:58:58 -04:00
8e11fe06de
feat: add update-package-hashes helper script and bump pi-coding-agent
...
Co-locate update-package-hashes.sh helper script to wrap nurl and
show recent releases. Simplify SKILL.md documentation. Bump
pi-coding-agent from 0.70.0 to 0.70.2 with updated hashes.
2026-04-24 13:00:38 -04:00
db60b41d03
chore(llama-cpp): bump 8815 → 8914
2026-04-24 07:27:58 -04:00
6c700ea0ba
chore(pi-coding-agent): bump 0.67.5 → 0.70.0
2026-04-24 07:25:39 -04:00
8a3c40c268
chore(skill): add auto-lookup latest version to update-package-hashes skill
2026-04-24 07:25:36 -04:00
fc1f2404d0
refactor(nixos): move supportedFilesystems nfs to common boot module
2026-04-24 07:25:06 -04:00
1070642635
feat(llama-swap): add qwen3.6-27b-thinking model
2026-04-22 13:01:38 -04:00
c3d433ddaf
feat(nvim): add manual mode for LSP servers
...
Allow LSP servers to be enabled on-demand via a buffer-local command
instead of auto-starting on matching filetypes. The command name is
auto-derived from the server name (e.g. 'GolangciLint'). Switch
golangci-lint to manual mode as it's resource-heavy and not always needed.
2026-04-22 13:01:32 -04:00
b9f2bfdeae
chore: move to oxlint from eslint
2026-04-20 14:08:15 -04:00
0173034295
chore(git): ignore .agents directory globally
2026-04-17 12:09:28 -04:00
bf6921e90a
chore: move to agnostic agents
2026-04-17 11:35:29 -04:00