model_melresnet {torchaudio} | R Documentation |
MelResNet
Description
MelResNet layer uses a stack of ResBlocks on spectrogram. Pass the input through the MelResNet layer.
Usage
model_melresnet(
n_res_block = 10,
n_freq = 128,
n_hidden = 128,
n_output = 128,
kernel_size = 5
)
Arguments
n_res_block |
the number of ResBlock in stack. (Default: |
n_freq |
the number of bins in a spectrogram. (Default: |
the number of hidden dimensions of resblock. (Default: | |
n_output |
the number of output dimensions of melresnet. (Default: |
kernel_size |
the number of kernel size in the first Conv1d layer. (Default: |
Details
forward param: specgram (Tensor): the input sequence to the MelResNet layer (n_batch, n_freq, n_time).
Value
Tensor shape: (n_batch, n_output, n_time - kernel_size + 1)
Examples
if(torch::torch_is_installed()) {
melresnet = model_melresnet()
input = torch::torch_rand(10, 128, 512) # a random spectrogram
output = melresnet(input) # shape: (10, 128, 508)
}
[Package torchaudio version 0.3.1 Index]