I try to train tfd.TransformedDistribution with stack of tfb.ffjord as bijectors. Basically, more bijectors make model deeper and having more param
tfd.TransformedDistribution
tfb.ffjord