PyTorch doesn't offer an inplace softmax which contributes about 1GiB extra memo...

liuliu on Sept 28, 2022 | parent | context | favorite | on: High-performance image generation using Stable Dif...

PyTorch doesn't offer an inplace softmax which contributes about 1GiB extra memory for inference (of stable diffusion). Although all these are not significant improvements comparing to just switch to FlashAttention inside the UNet model.