Chunking
Description
The active grouping of items into larger units to compress representation. Miller’s classic finding — human working memory holds roughly 7±2 chunks — established that the chunk, not the atomic item, is the cognitive unit of capacity. A skilled chess player remembers board states as chunks (familiar configurations) rather than 32 piece-positions; a skilled reader processes words rather than individual letters. The concept generalizes far beyond cognition: any system that hits a per-unit overhead cost can win by chunking — networking (packet headers amortize over the payload), databases (batch inserts amortize transaction cost), distributed systems (paged memory amortizes cache-miss cost), ML training (mini-batch gradients amortize per-step overhead). The chunking criterion (what goes together) is the design choice; chunk size is the dial. Distinct from grain: grain is the chosen level of resolution — the “we’re operating at word-level, not letter-level” decision; chunking is the active move that produces operation at that level. Grain is the property; chunking is the action.Triggers
User-initiated: User describes grouping items together for efficiency, or asks about batch size / granularity tradeoffs. Vocabulary cues: “chunk,” “batch,” “group,” “7±2,” “working memory,” “page,” “packet,” “batch size.” Agent-initiated: Agent notices a per-unit overhead cost in a system processing many small items, and considers whether chunking would amortize the overhead. Candidate inference: “what’s the per-unit cost; would chunking N items together amortize it; what’s the right chunk size?” Situation-shape signals: Per-unit overhead costs that scale linearly with item count. Working-memory or cache-capacity constraints. Streaming vs. batching design questions. Pagination, packetization, batch-size decisions.Exclusions
- Per-item processing has zero overhead — when there’s no fixed cost per processing step, chunking doesn’t amortize anything and adds boundary-handling complexity for no gain.
- Items must be handled with low latency individually — real-time control, high-frequency trading, single-key-press feedback; chunking adds latency that violates the requirement.
- Chunk boundaries cut natural units awkwardly — chunking by arbitrary rule rather than meaningful criterion produces fragmentation costs that exceed amortization gains.
- Chunk size already at optimum — over-chunking past the diminishing-returns point (e.g., batch size already at memory capacity) doesn’t help and can hurt.
Structure
Relationships
- grain — chunking produces a grain-shift; the chunk size is the new grain. Co-occurs in any discussion of “what level are we operating at.”
- uniformity-dividend — uniform chunk size enables downstream uniformity; chunked-but-variable-size loses the dividend. The MTU / page-size / batch-size standardization is the uniformity-dividend lens on chunking.
- flow — chunking discretizes a flow; chunked flows have different throughput, latency, and back-pressure characteristics than continuous flows.
- cadence — chunking interacts with cadence; the chunk-arrival rhythm is a temporal shape that downstream processes must accommodate.
- seam — chunk boundaries are seams; the question “what goes together vs. apart” is the seam-placement question.
Examples
Phone-number memorization · psychology
Phone-number memorization · psychology
Network packet structure · computer-science
Network packet structure · computer-science
Chase, W. G., & Simon, H. A. (1973). "Perception in chess." Cognitive Psychology, 4(1), 55-81. · psychology
Chase, W. G., & Simon, H. A. (1973). "Perception in chess." Cognitive Psychology, 4(1), 55-81. · psychology
Chess masters seeing board states · psychology
Chess masters seeing board states · psychology
Cowan, N. (2001). "The magical number 4 in short-term memory" — refinement of Miller's bound. · psychology
Cowan, N. (2001). "The magical number 4 in short-term memory" — refinement of Miller's bound. · psychology
Database batch inserts · computer-science
Database batch inserts · computer-science
Language phonology · linguistics
Language phonology · linguistics
Miller, G. A. (1956). "The Magical Number Seven, Plus or Minus Two: Some Limits on Our Capacity for Processing Information." Psychological Review, 63(2), 81-97. · psychology
Miller, G. A. (1956). "The Magical Number Seven, Plus or Minus Two: Some Limits on Our Capacity for Processing Information." Psychological Review, 63(2), 81-97. · psychology
ML mini-batch training · computer-science
ML mini-batch training · computer-science
Music phrasing · performing-arts
Music phrasing · performing-arts
Paged memory · computer-science
Paged memory · computer-science
Tanenbaum, A. S., & Bos, H. *Modern Operating Systems* (4th ed., Pearson, 2014), §3.5.3 "Page Size" — paged memory and the page-size design tradeoff. · computer-science
Tanenbaum, A. S., & Bos, H. *Modern Operating Systems* (4th ed., Pearson, 2014), §3.5.3 "Page Size" — paged memory and the page-size design tradeoff. · computer-science