Commit Graph

313 Commits

Author SHA1 Message Date
d1f17a18b4 docs(web-glimpse): simplify and streamline skill documentation
Condense the web-glimpse SKILL.md from verbose multi-section format to a
compact quick-reference style. Key changes:
- Consolidate usage patterns into a single quick reference block
- Replace separate sections per command with a concise command table
- Simplify workflow guidance and error handling into scannable tables
- Update timeout values from milliseconds to seconds
- Document new --no-reader and --format options
- Remove redundant answering guidelines
2026-05-02 20:32:36 -04:00
193b431681 chore(packages): bump llama-cpp to b9009 2026-05-02 17:19:19 -04:00
3117a3569f chore(packages): bump pi-coding-agent to 0.72.1 2026-05-02 17:17:17 -04:00
3cc4649979 feat(lin-va-terminal): add sops secret for rke2 kubeconfig 2026-05-02 17:04:52 -04:00
7c1519881a refactor(llama-swap): generate sops secrets from apiKeys list 2026-05-02 15:48:15 -04:00
f00edb620c feat(lin-va-desktop): add systemd services for NVIDIA power limit 2026-05-02 14:00:49 -04:00
8d45977154 chore(llama-swap): update Genesis to v7.69, add cliff 2 optimizations
- Bump Genesis pin from fc89395 to 2db18df (v7.69)
- Add PN32 GDN chunked prefill and PN34 workspace lock relax env vars
- Replace patch_workspace_lock_disable with patch_inputs_embeds_optional
- Remove setup-time PN25/PN30 patches (folded into v7.69 natively)
- Switch patch base URL to v7.69-cliff2-test branch
- Lower GPU memory utilization to 0.93 for long-text variant
- Remove python3 from preflight check prerequisites
- Add printing service to lin-va-thinkpad
2026-05-02 13:48:41 -04:00
40114f438f feat(llama-swap): sync vLLM configs from club-3090, add evan API key
Sync all three vLLM model configs from club-3090 master (ae4846f).
Update to Genesis v7.65 full PROD env set with new patches.
Update docker image to nightly-7a1eb8ac. Add torch_compile and
triton cache dirs. Add agent setup guide (AGENTS.md).

Add 'evan' API key to llama-swap sops secrets.
2026-05-02 08:27:47 -04:00
ba30222962 feat(pi): add skills and improve AGENTS.md reading guidelines
- Add 'create-skill' skill for scaffolding new skill directories
- Add 'planning' skill for structured implementation workflows
- Add search-then-read pattern guidance to AGENTS.md
2026-05-01 23:26:53 -04:00
e4d40d89d9 feat: add api keys to llama-swap 2026-05-01 22:12:51 -04:00
43a1d66e6b add 9b vision 2026-05-01 21:51:06 -04:00
1283b7cdef add fim 4b + 9b 2026-05-01 21:42:50 -04:00
09fdff4908 refactor(llama-swap): reorganize models by GPU hardware section 2026-05-01 21:08:53 -04:00
88308602c8 feat(llama-swap): add concurrent model matrix for CUDA0/CUDA1
Allow one CUDA0 and one CUDA1 model to run simultaneously. Dual-GPU
models (using -ts splits) are excluded from the matrix so they evict
everything when loaded. vLLM docker models get evict_cost=50 to
discourage eviction due to slow cold starts.
2026-05-01 16:50:28 -04:00
1812d2ea03 feat(pi): add vision model support
Extract a shared hasType helper for model filtering and add
vision (text + image) input capability to compatible models.
Also tag two llama-swap models with the vision type.
2026-05-01 15:03:26 -04:00
ab63211a75 wip 2026-05-01 14:36:36 -04:00
561f10d2a7 fix: timing & vllm 2026-05-01 13:09:28 -04:00
a3b2efa5bb feat: vllm timings patch 2026-05-01 10:57:59 -04:00
74ff71803b feat: vllm yay 2026-05-01 10:38:43 -04:00
75eba8703f add: vllm base 3.6 27b 2026-04-30 21:47:29 -04:00
3d55b6e675 chore: add lfs 2026-04-30 20:14:41 -04:00
990b6a4392 feat: vllm 2026-04-30 20:04:58 -04:00
bcba8f6b60 feat(address-gh-review): add thread resolution with reactions and comments 2026-04-30 14:20:01 -04:00
93e2247a30 chore(nixos/llama-swap): remove synthetic peer and tune local model args 2026-04-30 11:43:04 -04:00
31363f5f8d docs(pi): add _scratch guidance for ephemeral artifacts 2026-04-30 08:32:03 -04:00
976edab339 config(llama-swap): enable preserve_thinking in chat template kwargs 2026-04-30 07:45:57 -04:00
d1d3f3c1a3 build(pi-coding-agent): bump to 0.70.6 2026-04-29 00:08:57 -04:00
a242461139 build(llama-cpp): update to 8964 2026-04-28 21:49:16 -04:00
ca8d2a38ed build(llama-swap): update to 208 2026-04-28 21:48:51 -04:00
eef4d78cb3 feat(home): add pass-backed keyring module and enable for work VM
- Add modules/home/security/pass-keyring with GPG agent, pass, and
  python keyring backend config for headless credential storage
- Enable pass-keyring for lin-va-mbp-work-vm
- Update bash PATH from ~/.bin to ~/.local/bin
2026-04-27 23:02:22 -04:00
50719469da chore: update vm sops 2026-04-27 14:17:55 -04:00
f349b24b5d chore: rotate mac-va-mbp-work sops key and clean up unused packages
- Rotate age key for mac-va-mbp-work in .sops.yaml
- Re-encrypt secrets/common/evanreichard.yaml with new key
- Remove opencode, docker, and nunc from work mac config
2026-04-27 12:44:52 -04:00
f110a9743a chore: add work vm sops key 2026-04-27 11:02:53 -04:00
b85b01bcaa feat: add web-glimpse skill for headless browser tasks 2026-04-27 10:45:44 -04:00
5a5aeb592e build(glimpse): update package snapshot 2026-04-27 10:30:45 -04:00
04296e282c feat: add glimpse module 2026-04-27 10:25:20 -04:00
005ba2244b feat(pi): add glimpse browser automation CLI 2026-04-27 08:12:39 -04:00
412b503c7a chore(home): enable pi and set hyprland menu mod on thinkpad 2026-04-27 07:52:22 -04:00
a38a725bb9 fix(pi-coding-agent): wrap binary with nodejs_22 in PATH 2026-04-26 09:25:42 -04:00
ad11dbdf2a chore(secrets): add lin-va-terminal to sops key rotation 2026-04-26 09:16:13 -04:00
a39a314674 feat(pi): manage pi extension packages via nix module 2026-04-26 08:59:00 -04:00
e8bc4e4da7 fix(pi): guard prompt replacement to anthropic-only and preserve pi-coding-agent 2026-04-26 08:58:58 -04:00
8e11fe06de feat: add update-package-hashes helper script and bump pi-coding-agent
Co-locate update-package-hashes.sh helper script to wrap nurl and
show recent releases. Simplify SKILL.md documentation. Bump
pi-coding-agent from 0.70.0 to 0.70.2 with updated hashes.
2026-04-24 13:00:38 -04:00
db60b41d03 chore(llama-cpp): bump 8815 → 8914 2026-04-24 07:27:58 -04:00
6c700ea0ba chore(pi-coding-agent): bump 0.67.5 → 0.70.0 2026-04-24 07:25:39 -04:00
8a3c40c268 chore(skill): add auto-lookup latest version to update-package-hashes skill 2026-04-24 07:25:36 -04:00
fc1f2404d0 refactor(nixos): move supportedFilesystems nfs to common boot module 2026-04-24 07:25:06 -04:00
1070642635 feat(llama-swap): add qwen3.6-27b-thinking model 2026-04-22 13:01:38 -04:00
c3d433ddaf feat(nvim): add manual mode for LSP servers
Allow LSP servers to be enabled on-demand via a buffer-local command
instead of auto-starting on matching filetypes. The command name is
auto-derived from the server name (e.g. 'GolangciLint'). Switch
golangci-lint to manual mode as it's resource-heavy and not always needed.
2026-04-22 13:01:32 -04:00
b9f2bfdeae chore: move to oxlint from eslint 2026-04-20 14:08:15 -04:00