STFT understanding using librosa
问题 I have an audio sample of about 14 seconds in 8khz Sample Rate. Im using librosa to extract some features from this audio file. y, sr = librosa.load(file_name) stft = np.abs(librosa.stft(y, n_fft=n_fft)) # file_length = 14.650022675736961 #sec # defaults # n_fft =2048 # hop_length = 512 # win_length/4 = n_fft/4 = 512 (win_length = n_fft default) #windowsTime = n_fft * Ts # (1/sr) stft.shape # (1025, 631) Specshow : librosa.display.specshow(stft, x_axis='time', y_axis='log') [![stft sr = 22050