I have the following code.
x = keras.layers.Input(batch_shape = (None, 4096)) hidden = keras.layers.Dense(512, activation = \'relu\')(x) hidden = keras.layers.Ba
These 2048 parameters are in fact [gamma weights, beta weights, moving_mean(non-trainable), moving_variance(non-trainable)], each having 512 elements (the size of the input layer).
[gamma weights, beta weights, moving_mean(non-trainable), moving_variance(non-trainable)]