Less than 1 minute
class espnet2.gan_codec.shared.discriminator.stft_discriminator.ComplexSTFTDiscriminator(*, in_channels: int = 1, channels: int = 32, strides: Any = [[1, 2], [2, 2], [1, 2], [2, 2], [1, 2], [2, 2]], chan_mults: List[int] = [1, 2, 4, 4, 8, 8], n_fft: int = 1024, hop_length: int = 256, win_length: int = 1024, stft_normalized: bool = False, logits_abs: bool = True)
Bases: Module
ComplexSTFT Discriminator used in SoundStream.
Initialize Complex STFT Discriminator used in SoundStream.
Adapted from
- Parameters:
- in_channels (int) – Input channel.
- channels (int) – Output channel.
- strides (List *[*List *(*int , int ) ]) – detailed strides in conv2d modules.
- chan_mults (List *[*int ]) – Channel multiplers.
- n_fft (int) – n_fft in the STFT.
- hop_length (int) – hop_length in the STFT.
- stft_normalized (bool) – whether to normalize the stft output.
- logits_abs (bool) – whether to use the absolute number of output logits.
forward(x: Tensor)
Calculate forward propagation.
- Parameters:x (Tensor) – Input signal (B, 1, T).
- Returns: List of list of the discriminator output.
- Return type: List[List[Tensor]]
Reference: : Paper: Implementation: