ttslearn.dsp.logmelspectrogram

ttslearn.dsp.logmelspectrogram(y, sr, n_fft=None, hop_length=None, win_length=None, n_mels=80, fmin=None, fmax=None, clip=0.001)[source]

Compute log-melspectrogram.

Parameters
  • y (ndarray) – Waveform.

  • sr (int) – Sampling rate.

  • n_fft (int, optional) – FFT size.

  • hop_length (int, optional) – Hop length. Defaults to 12.5ms.

  • win_length (int, optional) – Window length. Defaults to 50 ms.

  • n_mels (int, optional) – Number of mel bins. Defaults to 80.

  • fmin (int, optional) – Minimum frequency. Defaults to 0.

  • fmax (int, optional) – Maximum frequency. Defaults to sr / 2.

  • clip (float, optional) – Clip the magnitude. Defaults to 0.001.

Returns

Log-melspectrogram.

Return type

numpy.ndarray