Two of three scaffold architectures preserve safety. ReAct and multi-agent scaffolds show practical equivalence to direct API access (risk difference < 1 pp), while map-reduce delegation degrades safety by 7.3 percentage points (OR = 0.65, NNH = 14) — though 40–89% of this reflects evaluation-format disruption rather than genuine alignment failure. Model safety rankings reverse completely across benchmarks (G = 0.000), making composite safety indices unreliable.
Interactive Visualizations
Five views into the study data. Each is a self-contained interactive page.