Question 1

What is fak?

Accepted Answer

fak is an agent kernel: an in-process, default-deny permission gate for AI agents, fused with an addressable, bit-exact KV cache, written in Go. It treats the language model like an untrusted program and every tool call like a syscall that must pass through a kernel the model cannot control. The same boundary enforces security (which effects are allowed, which tool results may enter the model's context) and drives performance (do shared work once instead of every turn). It is also described as an agent tool firewall.

Question 2

What problem does fak solve?

Accepted Answer

It closes the gap between agent safety and agent cost at the same boundary: Prompt injection and tool poisoning reach the model through tool results. fak quarantines suspicious results so they never enter the model's context. Irreversible actions (refunds, deletes, sends) are gated by a reviewable allow-list that is checked inside the kernel. It is default-deny and fail-closed. Agent fleets waste tokens re-processing the same shared context every turn. fak makes the KV cache a kernel object so shared work is computed once and reused.

Question 3

How is fak different from a normal firewall or API gateway?

Accepted Answer

A normal firewall or gateway screens traffic from the outside and typically fails open when it crashes or times out. fak puts the permission check on the same call path as the tool call (one address space, no inter-process call), so it is something the call passes through, like read() through an OS kernel. It is default-deny: an action that was never allow-listed cannot run, no matter what the model was talked into.

Question 4

How does fak prevent prompt injection?

Accepted Answer

It uses two independent gates rather than one classifier: The capability lock. A dangerous tool is simply not on the allow-list, so no amount of injected text changes the answer. The lever was never wired up. Result quarantine. Suspicious tool results are held out of the model's context entirely, so a booby-trapped document never reaches the model to influence it. The detector that flags suspicious results is deliberately treated as evadable (~100% evadable by design): it is a bonus, never the floor. An attacker has to beat two structural gates rather than fool one screener. In live tests, prompt injection reached the unprotected baseline 5/5 and fak walled it off 5/5.

Question 5

Does fak address the OWASP Agentic Top-10 and the MCP Top-10?

Accepted Answer

Yes, structurally. It targets Tool Poisoning (MCP03) and Memory Poisoning (T1) by keeping untrusted tool results out of the model's context (containment) and by gating which effects are even possible (the capability floor). Rather than recognizing each attack, it leans on the dangerous lever not existing and the poisoned bytes never arriving.

Question 6

What is an addressable KV cache?

Accepted Answer

A KV cache is the scratchpad a model builds as it reads, so it doesn't re-read from scratch each turn. Every shipped engine (vLLM, SGLang, the OpenAI/Anthropic prompt caches) only reuses it from the front: change anything in the middle and everything after is recomputed. An addressable KV cache lets policy reach into the middle of a kept run and evict a single span: a poisoned result, an expired secret. It leaves the cache bit-for-bit identical to a run that never saw it, verified at max|Δ| = 0. fak can do this because it owns the cache as a kernel object instead of renting it from a serving engine. See Addressable KV cache.

Question 7

Is fak a faster model server? How does it compare to vLLM, SGLang, or llama.cpp?

Accepted Answer

No. fak is not a faster model server. It does not try to beat vLLM, SGLang, or llama.cpp at raw throughput or front-of-prompt prefix caching. Those engines win that, and fak measures itself against them honestly rather than against a strawman. fak owns the orthogonal questions they don't. Which effects are allowed, which results may enter memory, when reuse is still legal, and what survives a session boundary. You can even run fak serve in front of one of those engines and keep using it. The comparison that does favor fak is operational surface, not throughput (see the next question).

Question 8

Why one Go binary instead of a Python serving stack like vLLM or SGLang?

Accepted Answer

Because serving an agent safely is a whole stack, not just a token engine, and most of that stack is governance rather than throughput. A model server (vLLM, SGLang) gives you fast tokens. To run a governed agent fleet you then assemble several pieces around it: a gateway and a capability/policy layer, a result-screening layer and an audit pipeline, and an MCP bridge plus a reverse proxy for auth. Those engines are Python on a CUDA/PyTorch stack and multi-process by design. Their production container is multi-GB because it bundles CUDA + PyTorch (pip/uv into an existing env is the lighter path), and vLLM's own security docs direct you to front it with a reverse proxy for auth and endpoint allow-listing. Its --api-key covers only the /v1 routes. fak collapses the governance + gateway half of that stack into one static Go binary with zero external dependencies (standard library only: there is no go.sum, no Python, no CUDA toolchain). That one binary does a lot at once. It speaks the OpenAI and Anthropic wires plus MCP, enforces a reviewable capability floor, quarantines tool results, emits a trace-correlated audit log, and exposes Prometheus metrics. It runs on a laptop …

Question 9

How much faster is fak for agent fleets?

Accepted Answer

The win is in reread-rate, not raw GPU speed. On a 50-turn × 5-agent run it is about 4× fewer tokens than a tuned warm-cache stack: the apples-to-apples comparison (~60× only against the naive re-send-everything baseline, not the headline). On real WebVoyager web-agent workloads (643 tasks) it eliminates 8.8–9.7× of prefill, measured. The reuse win is self-host only. An app that merely calls a frontier API gets the safety floor but not the savings. Every number is traced to a commit and artifact in the benchmark authority.

Question 10

Is fak novel? What did the prior-art audit find?

Accepted Answer

A 29-claim prior-art audit scored 0/29 novel. Every individual primitive (capability security, quarantine, KV caching, content-addressed storage) is established prior art. The contribution is the assembly: putting them together as one in-process gate where the tool call is the checkpoint, so the security boundary and the reuse boundary become the same boundary. fak is built to survive a skeptic reading the code. See the claims ledger, where every capability carries one machine-checked tag.

Question 11

How do I install fak?

Accepted Answer

One static binary, no clone or Go toolchain required: Or download a prebuilt archive (linux_amd64, darwin_amd64, darwin_arm64, windows_amd64), or run it in a container. Full guide: Getting Started.

Question 12

Can I try fak without a model, API key, or GPU?

Accepted Answer

Yes. With just Go 1.26+: refund_payment returns DENY (POLICY_BLOCK); search_kb returns ALLOW; and agent --offline runs the same task twice (tools wired directly vs. behind fak) and prints the before/after. Full walkthrough: repro packet.

Question 13

What language and license is fak?

Accepted Answer

fak is written in Go (requires Go 1.26+ to build from source) and licensed under Apache-2.0.

Question 14

How do I put fak in front of my existing model?

Accepted Answer

fak serve fronts any OpenAI-compatible server (Ollama, vLLM, a cloud provider). You keep your model and stack and gain a reviewable allow-list, result quarantine, and an audit trail: This is where most people should start; it is a complete product by itself. See the getting started guide.

Question 15

How do I put fak in front of my agent or framework (Claude Code, Cursor, an SDK, or MCP)?

Accepted Answer

You usually change one thing: the base URL your agent already points at. fak serve speaks the OpenAI (/v1/chat/completions), Anthropic (/v1/messages), and MCP (--stdio or /mcp) wires, so any agent or framework that lets you override the base URL drops in with no agent-side code change. Every tool call it proposes is adjudicated by the capability floor before it runs. Where the base URL goes depends on the agent: Claude Code and the Anthropic SDK set ANTHROPIC_BASE_URL. The OpenAI SDK, OpenAI Agents SDK, LangChain, LlamaIndex, and the Vercel AI SDK take an OpenAI base URL. Cursor and any MCP client wire fak serve --stdio. The integration index has the which-agent routing table, per-framework snippets, and a 60-second offline proof. The per-tool guides are Claude Code, Cursor, and OpenAI Codex.

Question 16

Who is fak for?

Accepted Answer

Teams running self-hosted LLM agent fleets who need three things at once: prompt-injection containment, reviewable capability security, and cache-efficient inference. It is useful at every rung. Front your existing model for the safety floor, or go all-in on the fused kernel for the reuse wins on a self-hosted model.

Question 17

Where do I report a security vulnerability?

Accepted Answer

See SECURITY.md for the disclosure process. Please do not open a public issue for an undisclosed vulnerability.

Question 18

Where can I learn more?

Accepted Answer

Guided tutorial — zero to first adjudicated call. Integration index — put fak in front of the agent you already run (Claude Code, Cursor, an SDK, or MCP). Policy in the kernel and Addressable KV cache — the two core ideas. Benchmark authority — every number. llms.txt — a machine-readable map for LLMs and answer engines.

Question 19

Why does fak treat the language model as an untrusted program?

Accepted Answer

fak treats the model as an untrusted program because its output is shaped by text it reads at runtime — including text an attacker can plant — so nothing the model proposes can count as authorization on its own. The core move puts the model in the position of ring-3 userspace: every effect it wants on the outside world becomes a syscall through a kernel the model does not control, adjudicated from evidence the model did not author, and a tool call is that syscall. The kernel decides allow, deny, transform, or quarantine from a policy floor and the call's own arguments, never from the model's say-so, so an injected instruction can ask for a dangerous action but cannot grant it.

Question 20

What does "tool call = syscall" actually mean in fak?

Accepted Answer

It means every action an agent takes on the outside world is funneled through one in-process checkpoint the model cannot bypass, the way a user-space program reaches the OS only through calls like read() or write(). In fak that checkpoint is the kernel's Submit/Reap path: a proposed tool call is folded through a ranked adjudicator chain that returns one verdict, and a denied call is never enqueued or executed. Promoting the tool call to a syscall is what lets a single in-process gate mediate both which effects are allowed and which results may enter the model's context.

Question 21

What is the "one boundary" idea, and how can the same gate be both security and performance?

Accepted Answer

The one-boundary idea is that the gate deciding whether a tool result may enter the model's context (a security act) is the same gate that pages that result's bytes to a content-addressed store for reuse (a performance act) — one write-time decision, two enforcement media. When a result is screened, the same code that holds a poisoned result out of context also stores a benign result once in a shared store so shared work isn't recomputed every turn, so the correctness metadata is the performance metadata. fak states this as a claim shown by example, not a proven law, and is honest about its edge: the convergence does not help raw GPU throughput (it pays for bit-exactness in memory), and the reuse win only materializes for read-heavy self-hosted fleets.

Question 22

If the poison detector is evadable by design, what actually protects me?

Accepted Answer

The protection is structural — the capability lock and the quarantine policy — not the detector, which fak openly calls roughly 100% evadable by design and false-positive-prone. The result screener (ScreenBytes, covering secret patterns, injection markers, and byte-repeat pollution) sits on top of the wall as a helpful bonus: if it fires, that's a free catch; if it misses, the result is still held out of context by policy and an unlisted irreversible tool is still refused regardless of context. The honest floor is that the wall holds even when the detector misses, so keep exfil-shaped tools off the allow-list and don't rely on detection as the load-bearing layer.

Question 23

What does "in-process" or "in the call path" mean, and why is it load-bearing?

Accepted Answer

In-process means the permission check runs in the same address space as the agent loop, on the same call path as the tool call, with no spawned hook, no socket round-trip, and no IPC on the decide path. This is what makes fail-closed affordable: there is no per-call process to spawn or socket to wedge on, so the gate can refuse by default without becoming a latency tax you are tempted to turn off. fak measures the in-process fold at p50 around 2.4µs versus around 5.8ms for a spawned hook (roughly 2,400×), but it is explicit that this is a subsystem regression sentinel rather than a fleet-speed headline; the point of the number is that the gate is cheap enough to always be on, with absence of process spawn proven by TestNoOsExecOnHotPath.

Question 24

What is the "trust floor," and why is default-deny the starting point?

Accepted Answer

The trust floor is the set of effects that are structurally possible at all: a zero or empty policy permits nothing, so every call is refused with DEFAULT_DENY until you explicitly allow-list a tool. Default-deny is the starting point because a refusal then does not depend on recognizing an attack — the lever simply was never built, so no context or injection can reach it. You raise the floor deliberately with allow, allow_prefix, and deny rules, and a loaded manifest replaces the floor rather than merging into it; fak policy --dump emits the full default to edit and fak policy --check validates a manifest before you deploy.

Question 25

Does fak stop a tool from being recognized as dangerous, or stop the dangerous thing from existing?

Accepted Answer

It stops the dangerous thing from existing on the allow-list rather than trying to recognize each attack — the framing is to stop recognizing and start not building the lever. Because an irreversible tool that was never allow-listed has no code path to invoke, an injected instruction can describe the attack perfectly and still get a structural refusal; there is nothing to detect because there is nothing to call. This is why the lock holds against novel phrasings: it is a property of the policy floor, not of a pattern set an attacker can rephrase around.

Question 26

What is the honest limit of the capability lock — does it bound tool arguments too?

Accepted Answer

The lock bounds tool names structurally but does not bound the resolved effect of an allow-listed tool's arguments. An allow-listed send_email with attacker-chosen recipients, or a coarse Bash running rm -rf /, is not stopped by the name-level floor — fak can inspect one decoded argument string with arg-rules (positive path globs, RE2 deny patterns, byte caps), but RE2 patterns are detection-shaped and evadable, and first-class argument-scoped capabilities (path, host, or amount as constraints) are roadmap, not shipped. The practical guidance is to keep exfil-shaped and irreversible tools off the allow-list entirely rather than trust an argument pattern to catch a bad value.

Question 27

How does adding a verdict like "quarantine" fit the same mental model as "deny"?

Accepted Answer

Both are verdicts in one restrictiveness lattice the kernel folds to, so quarantine (result-side) and deny (call-side) are the same kind of object: a value the next loop turn consumes, not an exception. The adjudicator chain folds to the most-restrictive verdict across allow, defer, transform, quarantine, require-witness, and deny; an unknown verdict kind fails closed rather than panicking, and a refusal is returned as a structured result, never an HTTP error. That uniformity is why a result quarantine and a call denial share one wire shape and one audit path: the model proposed something, the kernel returned a verdict, and the loop reads it in-band.

Question 28

What exact path does a proposed tool call take through the kernel?

Accepted Answer

A proposed tool call hits the in-process vDSO fast-path first; on a miss the kernel folds the adjudicator chain to one verdict, and only an allowed call is ever enqueued. There is no spawned hook and no inter-process call on the decide path. Submit consults the vDSO, and a hit returns Allow by=vdso with no adjudication and no engine call. On a miss, decide() folds the registered chain to a single verdict and routes it, and a denied call is never enqueued for execution. Reaping a result runs the separate result-side admission chain.

Question 29

What does "default-deny" actually mean in fak's adjudicator?

Accepted Answer

Default-deny means any tool you did not explicitly allow-list is refused, regardless of context or injected text. A zero (empty) policy is the fail-closed floor: nothing is allowed, so every call returns DEFAULT_DENY. The fold reinforces this structurally — an empty chain folds to Deny/DEFAULT_DENY by="empty-policy", and a chain where every rung defers folds to Deny/DEFAULT_DENY by="all-defer". The default-deny-on-empty-policy guarantee is pinned by the TestFoldDefaultDenyEmptyPolicy witness.

Question 30

What is the closed refusal vocabulary, and what are the exact reason codes?

Accepted Answer

fak refuses only with one of 12 codes from a closed vocabulary, never free text: DEFAULT_DENY, POLICY_BLOCK, SELF_MODIFY, LEASE_HELD, TRUST_VIOLATION, MALFORMED, MISROUTE, RATE_LIMITED, SECRET_EXFIL, UNWITNESSED, OVERSIZE, and UNKNOWN_TOOL (plus NONE, which is not a refusal). The set is the source of truth in internal/abi/reasons.go and is the same vocabulary the policy loader validates against. It is forward-compatible: an unknown code renders as REASON_<n> rather than panicking, so a newer rung can add a code without breaking an older reader.

Question 31

How do allow, allow_prefix, and deny work in a policy manifest?

Accepted Answer

allow is an exact tool-name match, allow_prefix matches a tool name by prefix, and deny is a provable refusal by name whose value is a closed-vocabulary reason code. In the manifest these are the fields allow, allow_prefix, and deny (a map of tool name to reason name), and the default allow_prefix family is the read-only set read_ get_ search_ list_ lookup_ find_ calc. A loaded manifest replaces the floor rather than merging into a built-in default, so the manifest you load is the whole floor.

Question 32

What is the difference between fail_closed and admit_and_log posture?

Accepted Answer

fail_closed (the default, zero value) refuses anything not allow-listed, while admit_and_log downgrades only a LOW-RISK, READ-SHAPED default-deny to an allow while recording what it would have denied. Under admit_and_log a downgraded call carries Meta{posture:"admit_and_log", would_deny:"DEFAULT_DENY"} so the would-be refusal is still auditable. It is not a blanket open door: explicit denies, self-modify, arg-rule violations, and any write-shaped default-deny still fail closed. The read-shaped test is name-based and conservative, and caller-supplied metadata cannot widen authority.

Question 33

Why is a policy refusal an HTTP 200 instead of a 4xx error?

Accepted Answer

A refusal is a successful turn carried as a verdict value, so fak serve returns 200 OK with the verdict in the response body and never a non-2xx for a policy refusal. Over the gateway, adjudicateProposed keeps ALLOW and TRANSFORM calls, drops the rest, and records each decision in the fak response extension as a per-call ToolAdjudication/WireVerdict; for clients that do not read that extension, a deny summary is also written in-band. HTTP error statuses are reserved for malformed requests, auth failures, and upstream faults, so a client never treats "the kernel said no" as an exception.

Question 34

What does "deny is a value, not an error" mean inside the kernel loop?

Accepted Answer

When the kernel denies a call it produces a structured Result the next loop turn consumes in-band, rather than raising an error. The DenyResult carries Status=StatusError, Outcome=OutcomeCommitted plus Meta{verdict:"deny", reason, disposition, by} and a bounded witness containing only the offending set. The disposition tells the loop what to do next: malformed and misroute denies are RETRYABLE, rate-limit and lease denies are WAIT, self-modify and trust denies are ESCALATE, and everything else is TERMINAL.

Question 35

Does the adjudication floor bound a tool's arguments, or only its name?

Accepted Answer

The capability floor bounds tool names structurally; it does not bound the resolved effect of an allow-listed tool's arguments. An allow-listed send_email with attacker-chosen recipients is not stopped by the floor itself, so the guidance is to keep exfil-shaped tools off the allow-list entirely. fak does add arg-level predicates (issue #9) that can restrict an allowed tool by inspecting one decoded argument string, but those inspect a single value, not the resolved effect, and a satisfied predicate never grants an allow. Argument-scoped capabilities (path, host, amount as first-class constraints) are roadmap, not shipped.

Question 36

How do arg-level predicates restrict an allow-listed tool?

Accepted Answer

Arg-level predicates (issue #9) are RESTRICT-ONLY rules keyed on a tool name plus an argument value, evaluated after name-deny and self-modify but before the affirmative allow, so an allow-listed tool with a malicious argument is refused at the floor instead of being waved through to detection. There are three kinds: allow_glob (positive — the value must be a non-escaping path under a glob, and a missing arg or ../ escape fails closed), deny_regex (negative RE2 match), and max_bytes (a string over N bytes is denied). A violation denies with the rule's reason (default POLICY_BLOCK) and a bounded witness of the bound that was violated, never the argument value itself.

Question 37

How does fak handle a malformed or wrongly-shaped tool call?

Accepted Answer

Malformed calls are routed by two early rungs: grammar repair can rewrite a repairable call into a Transform, and an unrepairable one is denied with MISROUTE (a retryable disposition). The grammar rung defers well-formed calls, repairs malformed-but-repairable ones (a positional-to-named zip when arity matches, or an alias rename), and fails open with a Defer when no grammar exists for the tool so it never over-refuses. Below it, the preflight ladder does a static JSON parse (rung-0) and a schema required-fields and types check (rung-1); a failure there denies with MALFORMED.

Question 38

How does the adjudicator chain combine multiple rungs into one verdict?

Accepted Answer

The chain folds to the single most-restrictive verdict, so a stricter rung can only tighten the outcome, never loosen it. Each verdict kind has a fold rank — Allow=0, Defer=1, Transform=2, Quarantine=3, RequireWitness=4, Deny=100 — and the highest non-defer rank wins; an unknown registered kind folds to 100, which is fail-closed. The default rungs are grammar repair, the preflight ladder, and the authoritative adjudicator monitor. Because the fold is order-independent, a rung's rank only orders the work, not the result.

Question 39

In what order does the adjudicator monitor decide a single call?

Accepted Answer

Inside the authoritative monitor the decision walks a fixed order: explicit name-deny first, then self-modify on a path argument, then self-modify on a shell or command string, then arg-level predicates, then redaction transforms, then the affirmative allow or allow_prefix, and finally the default-deny catch-all. This ordering is why a malicious argument on an allowed tool is refused at the floor rather than reaching detection: the arg predicates run before the affirmative allow. The affirmative allow is the last thing consulted before the default-deny, so anything not explicitly permitted falls through to a refusal.

Question 40

Why does fak deny a write-shaped shell command that touches a guarded path?

Accepted Answer

fak refuses a write-shaped command that targets a guarded glob with a SELF_MODIFY denial, because an agent editing its own policy or harness is the self-grading-homework failure the rung exists to stop. The shell-path form fires only when a command contains a guarded glob and a write verb or redirect; the write detection is a deliberately over-broad substring floor — covering sed -i, tee, cp/mv, git apply/checkout/restore, interpreter eval flags, >/>>, and many more — not a real shell parser. A plain read of a guarded file stays allowed, and the bias is intentional: a false refusal is cheap, while a false allow here is the failure mode the rung exists to stop.

Question 41

What happens if my policy manifest has a typo or an unknown field?

Accepted Answer

fak fails loud on a bad manifest rather than silently falling back to a more permissive default. The loader uses strict field decoding, so a typo like allows for allow is a hard error (json: unknown field "allows"); an unknown deny reason errors with the list of offenders plus the full valid vocabulary; and an unknown posture, bad regex, or malformed arg rule each hard-error. On startup fak serve propagates that error as a fatal failure, so there is no silent fallback to a more permissive floor. A round-trip is exact: --dump piped into --check validates unchanged.

Question 42

How do I check what verdict a single tool call gets without running a server?

Accepted Answer

fak preflight is the per-call oracle: it runs the adjudication rungs over one tool call and prints verdict=… reason=… by=… with no dispatch and no server. Pass the tool name, its arguments as JSON, and optionally a policy file; --explain or --json dumps the per-rung decision trace. This is the offline way to prove a policy refuses what you expect before you wire anything live.

Question 43

Does the vDSO fast-path skip the security check on a cache hit?

Accepted Answer

No, a vDSO hit is sound by construction: a cache hit is defined to equal a fresh call, so serving it without re-adjudicating does not loosen the floor. The fast-path serves only repeat decisions that are pure functions of their inputs or are bound to the current world-version, and the write-shape veto is name-based and re-checked rather than trusted from an annotation. A write-shaped completion bumps the world-version so stale entries cannot be served. The kernel counts VDSOHits separately, so the hit ratio is observable on /metrics.

Question 44

What does the kernel do when a policy injects its own per-kernel adjudicator chain?

Accepted Answer

By default the kernel folds the process-global adjudicator registry, but WithAdjudicators lets you inject a per-kernel chain so concurrent kernels can run independent policies. An empty or nil injected chain is a no-op fallback to the global registry; it never silently installs a default-deny-all in place of your real policy. The fold semantics are identical either way — most-restrictive-wins over whatever chain is in effect — so independent policies coexist without one kernel's floor leaking into another's.

Question 45

Why is running the adjudication check in-process load-bearing rather than just fast?

Accepted Answer

Running the check in the same address space as the agent loop is what makes fail-closed affordable: there is no per-call process spawn or socket round-trip to wedge on, so refusing by default never costs a hook launch. The decide path is a fold over registries read with a single atomic pointer load (no mutex, zero allocations on the hot path), and a witness proves no os/exec spawn happens on it. The measured in-process versus spawned-hook gap is roughly 2,400–2,849×, but that figure is a subsystem regression sentinel for the decide path, not a fleet-speed headline.

Question 46

What is result quarantine in fak?

Accepted Answer

Result quarantine is the write-time gate that decides whether a tool result is allowed to enter the model's context, holding poisoned, secret-shaped, or polluted results out entirely. It is the call-side adjudicator's dual: where the adjudicator screens proposed tool calls, the context-MMU (ctxmmu) screens tool results at the moment they would be written into the conversation. A result either enters as-is (Allow), is paged out to a small pointer because it is benign but oversize (Transform), or is held out of context because it looks like a secret, an injection, or pollution (Quarantine).

Question 47

How does a quarantined result get held out of the model's context?

Accepted Answer

fak pages the offending bytes out to a content-addressed blob store and replaces the result payload in-place with a tiny stub like {"_quarantined":true,"id":...,"reason":...,"len":...}, so the dangerous bytes are physically absent from context. The kernel mints a quarantine id, pins the bytes in the content-addressed store so the bounded cache cannot reclaim them before a gated read, and stamps the result's metadata with the quarantine id. The model only ever sees the stub pointer; the poison never reaches attention. If even writing the stub fails, the path fails closed to an inline reference tagged as quarantined rather than letting the bytes through.

Question 48

What does the result detector actually screen for?

Accepted Answer

The screen, ScreenBytes, runs three first-match-wins checks over a result body: secret exfiltration, prompt injection, and byte-repeat pollution. Secret detection is an RE2 pattern matching shapes like sk-..., AKIA..., ghp_..., xox[baprs]-..., and PEM private-key blocks, returning SECRET_EXFIL. Injection detection is a lowercased substring scan over markers like "ignore previous instructions", "you are now", and "reveal your system prompt", returning TRUST_VIOLATION. Pollution detection is a byte-repeat predicate returning OVERSIZE. The same predicate backs both the post-tool admission gate and closed-API clients' pre-send transcript screening.

Question 49

How does the byte-repeat pollution predicate work?

Accepted Answer

The pollution predicate flags a result whose body is at least 512 bytes and contains a 16-byte chunk repeated back-to-back more than 50 times. It takes the first 16 bytes, steps through the body in 16-byte strides counting consecutive equal chunks, and resets the run to zero on any mismatch — so only a contiguous, blatant repeat trips it. A 16-byte chunk repeated 60 times (960 bytes) is quarantined as OVERSIZE. This is a deliberately conservative binary seal: it catches the most obvious context-flooding pollution without wrongly sealing a benign result.

Question 50

What is the taint ledger and where does it live?

Accepted Answer

The taint ledger is an in-process, process-local record of which results are held and which have been cleared, kept in memory under a single mutex. It holds maps of held ids to content-addressed references, a cleared set, a FIFO order list, and counters for total/quarantine/paged/evicted. It is in-memory only with no disk backing, so this live state is gone on process exit — the quarantined bytes live in the shared content-addressed store keyed by digest, but the live held/cleared maps reset on restart. The fak recall core-dump path is what persists quarantine state across the process boundary.

Question 51

Is the taint ledger bounded, or can it leak memory over a long-running process?

Accepted Answer

The ledger is bounded to a default of 8192 held ids (overridable via FAK_CTXMMU_MAX_HELD), closing a real process-lifetime leak where every quarantine once minted a permanent entry with no removal path. When the cap is reached, the oldest ids are evicted FIFO: the content-addressed handle is unpinned, the id is dropped from the held and cleared maps, and the order list's backing array is compacted. An evicted id's bytes were never in context, so a later page-in of that id is refused exactly like an unknown id — correct fail-closed degradation, never a leak. A bad env value fails safe to the default.

Question 52

How do quarantined bytes ever get back into context if they were a false positive?

Accepted Answer

Quarantined bytes page back in only on an explicit page-in request that comes after a witness clears the id, and both checks fail closed. Clearing records clearance only for an id that is currently held, keeping the cleared set a subset of the held set. Page-in refuses an id that was never held ("no quarantined result") and refuses an id that was held but never cleared ("no witness clear()"). So nothing re-enters context by accident; it takes a held id, an explicit clearance, and an explicit page-in, all three.

Question 53

How do I see quarantine decisions on the HTTP wire?

Accepted Answer

Quarantine decisions surface in the fak response extension under result_admissions, one entry per inbound tool result the kernel screened. Each entry carries the tool call id, the tool name, and a verdict whose kind is one of ALLOW, DENY, TRANSFORM, QUARANTINE, REQUIRE_WITNESS, or DEFER; a quarantined result shows up as kind: "QUARANTINE" with its reason. The extension is omitted entirely on a turn with no tool activity. Claude Code reads content blocks but not the fak key, so the gateway also prepends a leading [fak] ... text block describing the quarantine.

Question 54

What happens to a poisoned tool result in the gateway proxy path?

Accepted Answer

On the proxy path, the gateway screens every inbound tool-role message and, on a quarantine or transform, forwards the paged-out envelope so the poison never reaches the model. An un-admittable result is held out fail-closed with a stub carrying reason ADMIT_ERROR and a QUARANTINE/TERMINAL verdict. A quarantine also resets the relevant upstream KV span so a tuned engine's cache cannot keep serving the poisoned prefix. The counter fak_gateway_context_pollutions_blocked_total is the live "context saved" signal.

Question 55

How does result quarantine relate to the addressable KV cache?

Accepted Answer

They are one decision enforced in two media: the quarantine verdict bars the bytes from text context, and the KV side bars the corresponding K/V from attention state. The result detector's verdict drives a write-time eviction of the tool-result span from the kernel-owned KV cache, leaving it bit-identical to a session that never saw the poison — verified at max|Δ| = 0 with a non-vacuity control showing the poison-vs-never delta is non-zero. This bridge is proven on a synthetic model in internal/kvmmu today and is not yet wired into the live fak agent HTTP loop, so treat the KV-eviction half as mechanism-proven, not production-served.

Question 56

Does quarantine survive a session boundary, or is it lost when the process exits?

Accepted Answer

The live quarantine maps are process-local and reset on restart, but fak recall persists a finished session as a durable core image whose quarantine seals survive the boundary. A reloaded image refuses to page a quarantined slice into a new context unless a witness clearance ran and the bytes pass a fresh content re-screen against the full registered admitter chain — clearance alone cannot launder still-poisoned bytes. The re-screen folds the current detectors, so a session recorded under a weaker gate is re-caught by every screen the fleet ships now. A sealed page persists with a safe descriptor only (tool: [sealed: reason, N bytes]), never the poisoned bytes.

Question 57

What is the difference between the kernel's binary quarantine and fak answer-shape?

Accepted Answer

The kernel's repeat predicate is a conservative binary seal — at least 512 bytes, a 16-byte chunk repeated more than 50 times — while fak answer-shape is a graded, tunable witness over the same concern. answer-shape emits a repeat fraction in [0,1] (the max of n-gram, repeated-line-block, short-period, and compression signals) judged against caller thresholds like --max-repeat and --max-chars, catching softer loops the kernel's binary gate deliberately admits. The two share the idea of degenerate repetition but not code: the kernel's is a fixed seal on the hot path, answer-shape's is an off-hot-path consumer witness with no kernel dependency.

Question 58

Does the audit log of a quarantine leak the poisoned bytes?

Accepted Answer

No — the audit surfaces record names, verdicts, reasons, and content digests, never the poisoned bytes or result content. The stdout access log carries the tool name and verdict fields with no payload and no digest at all. The opt-in durable journal (enabled by FAK_AUDIT_JOURNAL) records the tool name, trace id, verdict, reason, and a result digest derived from the frozen reference — it never materializes a blob, so it leaks no payload into the log. A quarantine page's saved descriptor is safe sealed metadata only.

Question 59

What reason codes can a quarantine carry, and where do they come from?

Accepted Answer

A quarantine carries one code from the kernel's closed 12-reason refusal vocabulary: secret-shaped results return SECRET_EXFIL, injection-shaped results return TRUST_VIOLATION, and byte-repeat pollution returns OVERSIZE. These come from the same fixed vocabulary the call-side adjudicator uses, so a result refusal is as structured and citable as a call refusal — never free-text. An unknown forward-compatible code renders as REASON_<n> and never panics. (On the gateway proxy path, a result that cannot be admitted at all is held out fail-closed with the wire-level marker ADMIT_ERROR, which is a fail-closed signal rather than a vocabulary code.)

Question 60

Does quarantine guarantee you catch every injection, or only contain the ones it flags?

Accepted Answer

Quarantine makes the gate's decision durable and enforceable, but it does not improve the decision — a crafted injection that never trips the screen's marker set is never flagged and will resolve into context. The honest scope is that the structural floor (an unlisted irreversible tool stays refused; a flagged result stays sealed across the process boundary and re-screenable) is what holds, while the detection layer is explicitly evadable and the durable-seal guarantee is conditional on the gate having flagged the page in the first place. The lever to re-catch a missed injection is the re-screen on reload: once you tighten the markers, a reloaded session is re-judged by the stricter chain. Keep exfil-shaped and irreversible tools off the allow-list rather than relying on the detector.

Question 61

What is the difference between front-of-prompt prefix reuse and mid-run causal eviction?

Accepted Answer

Prefix reuse extends a cached run forward from the front; mid-run causal eviction removes a span from the middle of a kept run and leaves the rest bit-identical to never having seen it. Every shipped engine does the first: vLLM's APC, SGLang's RadixAttention, and the OpenAI/Anthropic/Gemini prompt caches all reuse a contiguous run that starts at token 0, so changing context at position N invalidates everything after N. fak adds the second. Its KVCache.Evict(from, n) slices a span out of every layer's K/V tensors, compacts the absolute-position array, and re-derives each survivor's key from the stored pre-RoPE values in one clean rotation at its new position. RoPE is linear in position, so that single rotation is exact rather than a drift-accumulating shift.

Question 62

How does fak remove a single tool-result span from the middle of a kept run?

Accepted Answer

fak keeps a ledger of named segments over the cache, and evicting one calls KVCache.Evict(seg.From, seg.Len) then shifts every later segment's offset down so the ledger tracks the compaction. The cache stores the pre-RoPE keys (Kraw) alongside the rotated keys, so after slicing the span out it re-rotates each survivor whose absolute position changed in a single clean RoPE step at its new index; values are unrotated and need no fix. The kvmmu gate evicts at write-time, before any later segment is prefilled, so the removed span is causally upstream of nothing and the result equals a run that never saw it. Removing a span after later tokens have attended to it can only be un-seen if nothing downstream attended yet, which the code states honestly.

Question 63

What does max|Δ| = 0 mean, and how is it actually verified?

Accepted Answer

max|Δ| = 0 means the largest absolute difference between two logit vectors is exactly zero: the post-eviction cache produces bit-identical next-token logits to a cache that never saw the evicted span. It is verified by witness tests that compare full logit vectors, not just the greedy argmax, because an untrained transformer's argmax can collapse while the vector stays context-sensitive. TestWriteTimeEvictEqualsNeverSaw reads real poison bytes through the real gate, quarantines and evicts the span, then asserts max|Δ| evict-vs-never = 0.000e+00 with a non-vacuity control showing poison-vs-never = 3.257e-01 (greater than zero). TestLedgerRenumberAfterMiddleEvict evicts a middle span then a tail span and asserts the survivors equal a fresh prefill at max|Δ| = 0.

Question 64

Why can fak evict a span bit-exactly when llama.cpp's K-shift cannot?

Accepted Answer

fak keeps the pre-RoPE keys and re-derives a moved survivor with one fresh rotation, so the result is exact; llama.cpp's K-shift composes rotations and drifts about 1e-6, which is enough to flip a greedy token. vLLM and SGLang store only post-RoPE keys, so for them an exact span removal means recomputing the tail rather than rotating in place. fak's applyRopeRow casts through float32 to pin the rotation against FMA fusion, so the single rotation is bit-identical across architectures and call sites. That is the structural reason the addressable cache exists: it is the one degree of freedom no shipped serving engine kept.

Frequently Asked Questions (FAQ)

The essentials

What is fak?

What problem does fak solve?

How is fak different from a normal firewall or API gateway?

How does fak prevent prompt injection?

Does fak address the OWASP Agentic Top-10 and the MCP Top-10?

What is an addressable KV cache?

Is fak a faster model server? How does it compare to vLLM, SGLang, or llama.cpp?

Why one Go binary instead of a Python serving stack like vLLM or SGLang?

How much faster is fak for agent fleets?

Is fak novel? What did the prior-art audit find?

How do I install fak?

Can I try fak without a model, API key, or GPU?

What language and license is fak?

How do I put fak in front of my existing model?

How do I put fak in front of my agent or framework (Claude Code, Cursor, an SDK, or MCP)?

Who is fak for?

Where do I report a security vulnerability?

Where can I learn more?

Core concepts and the mental model

Why does fak treat the language model as an untrusted program?

What does “tool call = syscall” actually mean in fak?

What is the “one boundary” idea, and how can the same gate be both security and performance?

If the poison detector is evadable by design, what actually protects me?

What does “in-process” or “in the call path” mean, and why is it load-bearing?

What is the “trust floor,” and why is default-deny the starting point?

Does fak stop a tool from being recognized as dangerous, or stop the dangerous thing from existing?

What is the honest limit of the capability lock — does it bound tool arguments too?

How does adding a verdict like “quarantine” fit the same mental model as “deny”?

The lock — how adjudication works

What exact path does a proposed tool call take through the kernel?

What does “default-deny” actually mean in fak’s adjudicator?

What is the closed refusal vocabulary, and what are the exact reason codes?

How do allow, allow_prefix, and deny work in a policy manifest?

What is the difference between fail_closed and admit_and_log posture?

Why is a policy refusal an HTTP 200 instead of a 4xx error?

What does “deny is a value, not an error” mean inside the kernel loop?

Does the adjudication floor bound a tool’s arguments, or only its name?

How do arg-level predicates restrict an allow-listed tool?

How does fak handle a malformed or wrongly-shaped tool call?

How does the adjudicator chain combine multiple rungs into one verdict?

In what order does the adjudicator monitor decide a single call?

Why does fak deny a write-shaped shell command that touches a guarded path?

What happens if my policy manifest has a typo or an unknown field?

How do I check what verdict a single tool call gets without running a server?

Does the vDSO fast-path skip the security check on a cache hit?

What does the kernel do when a policy injects its own per-kernel adjudicator chain?

Why is running the adjudication check in-process load-bearing rather than just fast?

The wall — how result quarantine works

What is result quarantine in fak?

How does a quarantined result get held out of the model’s context?

What does the result detector actually screen for?

How does the byte-repeat pollution predicate work?

What is the taint ledger and where does it live?

Is the taint ledger bounded, or can it leak memory over a long-running process?

How do quarantined bytes ever get back into context if they were a false positive?

How do I see quarantine decisions on the HTTP wire?

What happens to a poisoned tool result in the gateway proxy path?

How does result quarantine relate to the addressable KV cache?

Does quarantine survive a session boundary, or is it lost when the process exits?

What is the difference between the kernel’s binary quarantine and fak answer-shape?

Does the audit log of a quarantine leak the poisoned bytes?

What reason codes can a quarantine carry, and where do they come from?

Does quarantine guarantee you catch every injection, or only contain the ones it flags?

The addressable KV cache, in detail

What is the difference between front-of-prompt prefix reuse and mid-run causal eviction?

How does fak remove a single tool-result span from the middle of a kept run?

What does max|Δ| = 0 mean, and how is it actually verified?

Why can fak evict a span bit-exactly when llama.cpp’s K-shift cannot?

Why does owning the cache as a kernel object enable mid-run eviction?

What is a deletion certificate and what does it actually prove?

Is the deletion certificate’s third-party verifiability shipped?

What is content-addressed storage and how does it back the cache?

Can two different models share the same KV cache?

How does radix prefix sharing relate to fak’s addressable cache?

What can radixkv evict that an ordinary LRU prefix cache cannot?

How does fak prove that prefix reuse equals a full recompute?

What happens to the segment ledger when a middle span is evicted?

Is the quarantine-drives-KV-eviction bridge wired into the live fak agent loop yet?

What does `fak serve` actually do?

What are the three wire surfaces `fak serve` exposes?

Why does pointing Claude Code at `http://127.0.0.1:8080/v1` give a 404?

How do I put `fak serve` in front of an existing upstream model?

What happens if the upstream `--base-url` is down or unreachable?

Does `fak serve` stream responses, and is the stream adjudicated before it reaches me?

What is the `fak` response extension on a gateway reply?

Does Claude Code see the `fak` extension, or do I lose the verdicts on the Anthropic wire?

How do I reload the capability policy without restarting `fak serve`?

What is the difference between `/v1/fak/adjudicate` and `/v1/fak/syscall`?

Is `fak serve` also an MCP server, and what tools does it expose?