In many applications we only need the last layer and letting go of references to intermediate layers can save some memory during inference.