Web & Dev

Computer Architecture Diagram

Modern CPU architecture overview — cores, caches, pipelines, memory controller, and interconnect.

Major blocks

BlockRole
CoreExecutes instructions — fetch, decode, execute, retire
L1 instruction / data cacheSmall, per-core, 1–2 cycle access
L2 cacheLarger per-core or shared pair, ~10 cycles
L3 / last-level cache (LLC)Shared across cores on socket, tens of cycles
Memory controllerInterfaces to DDR DRAM (2–4 channels)
PCIe root complexConnects to GPU, NVMe, network
Chipset / IO hubUSB, SATA, slower IO
Coherence fabricRing, mesh, or point-to-point between cores
Power / clock managementDVFS, C-states, P-states

Pipeline stages (simplified)

FetchGet instructions from I-cache; branch prediction
DecodeConvert to micro-ops (µops)
RenameMap architectural → physical registers
DispatchIssue to reservation stations
ExecuteInteger / FP / load-store units
WritebackStore result in physical register
RetireCommit to architectural state (in program order)

Modern CPU features

  • Superscalar: multiple instructions issue per cycle.
  • Out-of-order: executes ready ops first; retires in order.
  • SMT / hyperthreading: 2 threads share one core's front end.
  • SIMD (AVX, NEON): operate on many data elements per instruction.
  • Branch prediction: modern predictors reach >95% accuracy.
  • Speculative execution: guess branches; rollback if wrong (Spectre/Meltdown mitigations).
  • NUMA: multiple sockets with local memory — pin workloads to local CPU.
  • Chiplets / tiles: AMD / Intel split cores and IO across dies.
Was this article helpful?