functional_sliding_window_cmn {torchaudio} | R Documentation |
sliding-window Cepstral Mean Normalization (functional)
Description
Apply sliding-window cepstral mean (and optionally variance) normalization per utterance.
Usage
functional_sliding_window_cmn(
waveform,
cmn_window = 600,
min_cmn_window = 100,
center = FALSE,
norm_vars = FALSE
)
Arguments
waveform |
(Tensor): Tensor of audio of dimension (..., freq, time) |
cmn_window |
(int, optional): Window in frames for running average CMN computation (int, default = 600) |
min_cmn_window |
(int, optional): Minimum CMN window used at start of decoding (adds latency only at start).
Only applicable if center == |
center |
(bool, optional): If |
norm_vars |
(bool, optional): If |
Value
tensor
: Tensor of freq of dimension (..., frame)
[Package torchaudio version 0.3.1 Index]