Fig. 3From: Reversible designs for extreme memory cost reduction of CNN trainingIllustration of the Revnet architecture and its memory consumption. Modules contributing to the peak memory consumption are shown in red. The peak memory consumption happens during the backward pass through the first reversible block. At this step of the computations, all hidden activations within the reversible block are stored in memory simultaneouslyBack to article page