I\'m trying to understand how text is converted to Mel spectrograms.
I\'m having difficulty understanding how the text is mapped to the Mel s