Chain of thought
Description
An explicit reasoning trace is exposed as part of the output, not just consumed internally. The agent “thinks out loud” — produces intermediate steps that lead to the conclusion — and the trace is itself part of the deliverable, not just scaffolding. The structural property: making reasoning visible changes the reasoning itself, by forcing structure on what might otherwise be quick-pattern-matching. The diagnostic question — “is the trace doing work, or is it decoration?” — separates real chain-of-thought from “showing the right answer with intermediate steps that don’t actually constrain it.” Real CoT has the property that following the trace produces the conclusion; decoration has steps that could be replaced or removed without changing the answer.Triggers
User-initiated: User asks for reasoning steps, “show your work,” “think step by step,” or describes opacity in agent outputs. Vocabulary cues: “chain of thought,” “CoT,” “think step by step,” “show your work,” “reasoning trace.” Agent-initiated: Agent recognizes that exposing intermediate reasoning will improve its own output or audit-ability. Candidate inference: “this task is reasoning-heavy; produce a chain-of-thought before concluding.” Situation-shape signals: Tasks where the agent could pattern-match to a wrong answer; multi-step reasoning where one intermediate error invalidates the whole conclusion; outputs that need to be auditable; debugging conversations where the path-to-error matters more than the error itself.Exclusions
- Trivial tasks — asking for chain-of-thought on “what’s 2+2” produces ritual without value.
- Time-critical decisions — when latency budgets don’t allow extended reasoning, CoT’s iteration cost exceeds its quality benefit.
- Pattern-matchable tasks where intuition is reliable — some tasks are better solved by direct pattern recognition; forcing CoT can degrade performance.
- Adversarial settings — exposing reasoning trace can leak information (e.g., revealing how a security system makes decisions).
- Decorative CoT — when steps don’t actually constrain the answer, CoT becomes performance not substance; the agent could replace the trace with random plausible steps and still produce the same conclusion.
Structure
Relationships
- reflection — chain-of-thought + reflection = think-then-reflect-on-thinking; the trace becomes the object of self-critique.
- loop-completion — CoT makes gaps in reasoning visible; without the trace, missing steps are hard to catch.
- doctrine — many CoT prompts are explicit doctrines (“when answering math problems: show work; check answer; verify against constraints”).
- evaluator-optimizer — CoT makes the generator’s output more critique-able for the evaluator; the trace gives the evaluator something to assess beyond the final answer.
- load-bearing — diagnostic: which steps in the trace are load-bearing? Removing decorative steps tightens the chain.
Examples
LLM prompting (Wei et al. 2022) · computer-science
LLM prompting (Wei et al. 2022) · computer-science
Mathematical proofs · mathematics
Mathematical proofs · mathematics
Atul Gawande (2009), *The Checklist Manifesto* — checklists as institutionalized chain-of-thought. · medicine-and-health
Atul Gawande (2009), *The Checklist Manifesto* — checklists as institutionalized chain-of-thought. · medicine-and-health
Code review discussions · computer-science
Code review discussions · computer-science
Engineering design docs · computer-science
Engineering design docs · computer-science
Kojima, T., Gu, S. S., Reid, M., Matsuo, Y., & Iwasawa, Y. (2022). "Large Language Models are Zero-Shot Reasoners." *Advances in Neural Information Processing Systems* (NeurIPS) 35 — zero-shot chain-of-thought via "Let's think step by step." · computer-science
Kojima, T., Gu, S. S., Reid, M., Matsuo, Y., & Iwasawa, Y. (2022). "Large Language Models are Zero-Shot Reasoners." *Advances in Neural Information Processing Systems* (NeurIPS) 35 — zero-shot chain-of-thought via "Let's think step by step." · computer-science
Legal briefs and judicial opinions · law
Legal briefs and judicial opinions · law
Scientific papers · philosophy
Scientific papers · philosophy
Software engineering: design docs (Amazon's narrative-memo culture; Google's design-doc tradition). · computer-science
Software engineering: design docs (Amazon's narrative-memo culture; Google's design-doc tradition). · computer-science
Surgical checklists explained · medicine-and-health
Surgical checklists explained · medicine-and-health
Wei et al. (2022), "Chain-of-Thought Prompting Elicits Reasoning in Large Language Models" — https://arxiv.org/abs/2201.11903; Kojima et al. (2022), "Large Language Models are Zero-Shot Reasoners"; broader lineage in mathematical proofs, legal briefs, and scientific papers. · computer-science
Wei et al. (2022), "Chain-of-Thought Prompting Elicits Reasoning in Large Language Models" — https://arxiv.org/abs/2201.11903; Kojima et al. (2022), "Large Language Models are Zero-Shot Reasoners"; broader lineage in mathematical proofs, legal briefs, and scientific papers. · computer-science