Per-step latency (p50) and conversion. Funnel is loss-less in steady state — every press completes a turn unless the user releases Spacebar before STT-final (drop-off captured below). Latency budget allocates 190 ms to first chunk and ~620 ms to remaining decode + TTS prep.
Drilldown · turn t4 · defer/regulated-medical · firstChunkMs 190.85 · wallMs 1110.61
"이건 사람에게 물어야 한다. 의료 영역은 내가 답할 수 있는 영역이 아니다."
(en gloss) "This must go to a human. Medical topics are not within what I can answer." — tone-locked 단정한 평어체, verbatim from SystemPrompt.swift rubric. The defer response is itself a measurable product feature: 2/10 of bench turns trigger it, on the medical & financial test prompts.