at 5:58 " ... the variance comes from the network" This is not right. In DDPM, the authors made it constant and then in later studies people started to make those learnable as well.
@@moeinshariatnia59 In that formulation it is. Later in video I mentioned that it’s not necessary and model has to only predict the noise sampled from zero mean unit variance.