After reading this and this, I understand that the .mp3 encoder appends zeros at the start and at the end of an audio.
.mp3
With this approach, the encoder c