Learn to Prompt — Master the art of prompting AI

practical guides for using ai coding agents securely and effectively. setup with bondage, optional envchain-xtra, nono sandboxing, and workflow patterns that hold up in real use.

be specific. define format, length, tone, audience, constraints.

give context. background, examples, and goals lift output quality.

iterate. refine, adjust, build on previous responses.

think in systems. chain prompts. break complex tasks into steps.

architecture

the hard part is not only installing tools. it is deciding where trust should live, how screenshots should work, and what a real escape hatch looks like.

agent-stack— bondage 0.2.7, envchain-xtra 1.3.1, nono 0.61.1, version guards, and non-doxing checks.Updated June 3, 2026arch
analytics-privacy— block non-essential analytics, telemetry, experiments, and error reporting.Updated May 9, 2026arch
vendor-independence— keep instructions, plugins, and security layers portable across clients.Updated Apr 28, 2026arch
claude-to-codex-plugins— turn a Claude-first plugin into a Codex first-class target without forking the workflow.Updated Apr 28, 2026arch
visual-inspection— accessibility tree first, localhost screenshot service when pixels matter.Updated Apr 27, 2026arch
sandbox-profiles— shape tiers and keep active profile policy read-only from normal agents.Updated May 12, 2026arch

agents

run ai coding agents with the preferred stack: bondage for launch policy, optional envchain-xtra for secret release, nono for kernel sandboxing.

claude-code— anthropic's cli launched through bondage with kernel sandbox.Updated May 4, 2026cloud
opencode— multi-provider agent with bondage + envchain secret injection.Updated May 4, 2026cloud
codex— OpenAI CLI with bondage, nono 0.61 profile packs, and draft profile repair.Updated June 3, 2026cloud
pi— minimal ts agent with pinned node, package-tree verification, and local ds4 side profile.Updated May 9, 2026cloud

local inference

set up local model runtimes and machines that can back coding agents without turning benchmark data into setup instructions.

llama.cpp— local llm inference with metal gpu and model-path isolation.Updated May 4, 2026local
ollama— local model manager with daemon sandboxing and optional remote keys.Updated Apr 27, 2026local
qwen3-tts mlx— local Apple Silicon voice generation with MLX-Audio, reference clips, Markdown chunking, inspection, and repair.Updated May 26, 2026local
ds4 mbp m5-128— run deepseek v4 flash locally with ds4-agent experiments, claude, codex, and pi profiles.Updated May 21, 2026local
ds4-agent setup— build native ds4-agent, pin a DeepSeek V4 Flash model path, run disposable repo tests, and keep experiments separate from default agents.Updated May 21, 2026local
ds4 wiki profile— run native ds4-agent as an opt-in LLM Wiki profile with portable paths, explicit wiki instructions, disposable fixture tests, and public-safe local defaults.Updated June 15, 2026local
ds4 dgx spark— run DeepSeek V4 Flash on NVIDIA Spark with ds4 CUDA, q2-imatrix, MTP, localhost serving, agent profile tests, and benchmark caveats.Updated May 21, 2026local
dgx-spark— set up a local nvidia box for ollama, qwen3-coder, and side profiles.Updated May 7, 2026local

evaluation

compare local model progress and cloud side profiles with dated runs, repeatable parameters, and hardware sizing tools.

benchmarks— dated local ai runs plus Spark ds4, ds4-agent, Codex frontier, and Gemini comparison lanes with hardware, runtime, context, pass rate, and scripts.Updated May 21, 2026eval
ds4-agent vs codex— public benchmark reality check for native ds4-agent, DeepSeek V4 Flash, and Codex frontier.Updated May 21, 2026eval
llm hardware calculator— model memory, kv cache, hardware fit, and single-user decode estimates.Updated Apr 28, 2026calc

workflow

get more from coding agents with better prompts, context management, and automation.

claude-md— instruction budget, progressive disclosure, agents.md cross-tool standard.Updated Apr 24, 2026flow
mcp-servers— connect agents to external tools via model context protocol.Updated Apr 24, 2026flow
prompting— explore-plan-implement-commit, stepwise prompting, verification.Updated Apr 24, 2026flow
hooks— 26 lifecycle events, 5 handler types, skills, ci/cd.Updated May 2, 2026flow
context— compaction, session management, subagents, worktree isolation.Updated Apr 24, 2026flow
agentnoise— White Noise phone control for local agents, media/wiki ingest, fake-phone tests, and stable vs Dark Matter alpha installs.Updated June 3, 2026remote
llm-wiki— append-only knowledge bases compiled by llm agents, with topic guides, session memory, and audit workflows.Updated July 7, 2026flow
llm-health— install the private, own-risk health concierge: homebrew, hub setup, fuzzy @health input, plugins, and health-v2 checks.Updated June 7, 2026flow

featured

tool

Updated Apr 28, 2026

llm hardware calculator

pick any text-generation model on huggingface — see what hardware can run it, how much memory it needs, and a single-user decode tok/s estimate. moe-aware: active params drive speed, full params drive memory. covers apple silicon, dgx spark (1×–8×), rtx, amd strix halo. open the calculator →

fetches model specs directly from huggingface (no backend)
memory breakdown: weights + kv cache + overhead
multi-spark factors capture the 3× pp=3 slowdown
shareable urls (e.g. /calculator/?model=Qwen/Qwen3-30B-A3B&ctx=131072)

open the calculator →

benchmarks

Updated May 21, 2026

local ai benchmark registry

follow measured local model progress across m5 max, dgx spark, ds4, ds4-agent, ollama, mlx, llama.cpp, sglang, and vllm, with Codex frontier and Gemini tracked separately as cloud comparison lanes. the current spark default and the Spark ds4 side profile are tracked with smoke, code, question, and wiki suites plus public reproduction scripts. open benchmarks →

daily-agent and practical-suite pass rates, not just token speed
model/runtime settings preserved with every row
cloud profiles labeled separately from local hardware runs
clear defaults and rejected profiles
downloadable scripts with privacy redaction defaults

open benchmarks →

tool

Updated Apr 29, 2026

llm-wiki

turn any ai agent into a research engine. builds append-only markdown knowledge bases through parallel multi-agent investigation, source ingestion, topic guides, session memory, and cross-referenced article compilation — zero runtime dependencies. read the guide →

parallel multi-agent research from academic, technical, and contrarian angles
thesis-driven investigation with for/against evidence scoring
reports, study guides, slide decks, implementation plans
claude code, openai codex, any llm agent

explore llm-wiki →

tool

Updated June 7, 2026

llm-health

install the local-first, own-risk health concierge with a private hub, fuzzy @health input, agent plugins, and the repackaged health-v2 timeline/dashboard tools. read the install guide →

homebrew install plus explicit own-risk agreement gate
alias-only hub setup for labs, wearables, records, and self-reported context
claude code, codex, opencode, pi, and portable agents.md surfaces
health-v2 doctor, sync, chart, and export checks

install llm-health →