Lab notebook · why depth is RG flow, in two paragraphs
Each layer of a plain MLP integrates out short-distance degrees of freedom in input space. The mutual information $I(X; T_\ell)$ at layer $\ell$ follows a renormalisation-group trajectory — the same monotonicity (Data Processing Inequality) as block-spin RG.
Writing it up; preprint by July.