MAISI scaling factor #1994
Unanswered
nordinbelkacemi
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I'm having trouble understanding why the scaling factor is needed when training the diffusion model (and controlnet). The autoencoder is supposed to produce embeddings that are normally distributed (mean≈0, std≈1), and it looks like the scaling factor is calculated using only the first batch, making things pretty unclear to me.
All batches are scaled with this number when training. In the paper appendix B, it says that it is ensured the VAE's latents have standard distribution between 0.9 and 1.1, so maybe this scaling factor is to correct for that, but then shouldn't the scaling factor should be computed using the entire dataset?
Beta Was this translation helpful? Give feedback.
All reactions