espnet2.s2st.synthesizer.translatotron2.GaussianUpsampling
Less than 1 minute
espnet2.s2st.synthesizer.translatotron2.GaussianUpsampling
class espnet2.s2st.synthesizer.translatotron2.GaussianUpsampling
Bases: Module
Gaussian Upsample.
Non-attention Tacotron: : - https://arxiv.org/abs/2010.04301
this source code is implemenation of the ExpressiveTacotron from BridgetteSong : - https://github.com/BridgetteSong/ExpressiveTacotron/
Initializes internal Module state, shared by both nn.Module and ScriptModule.
forward(encoder_outputs, durations, vars, input_lengths=None)
Gaussian upsampling.
- Parameters:
- encoder_outputs – encoder outputs [batch_size, hidden_length, dim]
- durations – phoneme durations [batch_size, hidden_length]
- vars – phoneme attended ranges [batch_size, hidden_length]
- input_lengths – [batch_size]
- Returns: upsampled encoder_output : [batch_size, frame_length, dim]
- Return type: encoder_upsampling_outputs
get_mask_from_lengths(lengths, max_len=None)