Skip to main content
Demos
Roadmap
ESPnet2
Demo
Course
ESPnet-EZ
ESPnet EZ
ESPnet1 (Legacy)
ESPnet1
Recipes
What is a recipe template?
Automatic Speech Recognition (Multi-tasking)
Automatic Speech Recognition with Discrete Units
Speaker Verification Spoofing and Countermeasures
Speech Codec
Speaker Diarisation
Speech Enhancement
Speech Recognition with Speech Enhancement
Speaker Diarisation with Speech Enhancement
Speech-to-Text Translation with Speech Enhancement
Language Modeling
Machine Translation
Speech-to-Speech Translation
Weakly-supervised Learning (Speech-to-Text)
Spoken Language Understanding
Speech Language Model
Speaker Representation
Self-supervised Learning
Speech-to-Text Translation
Singing Voice Synthesis
Text-to-Speech
Text-to-Speech with Discrete Units
Unsupervised Automatic Speech Recognition
Python API
espnet
asr
distributed
lm
mt
nets
optimizer
scheduler
st
transform
tts
utils
vc
espnet2
asr
asr_transducer
asvspoof
diar
enh
fileio
fst
gan_codec
gan_svs
gan_tts
hubert
iterators
layers
lm
main_funcs
mt
optimizers
s2st
s2t
samplers
schedulers
slu
speechlm
spk
st
svs
tasks
text
torch_utils
train
tts
tts2
uasr
utils
espnetez
config
data
dataloader
dataset
preprocess
task
trainer
Shell API
espnet2_bin
espnet_bin
spm
utils
utils_py
Search
Ctrl
K
Gan Svs
Less than 1 minute
Catalog
espnet2.gan_svs.abs_gan_svs.AbsGANSVS
espnet2.gan_svs.avocodo.avocodo.AvocodoDiscriminator
espnet2.gan_svs.avocodo.avocodo.AvocodoDiscriminatorPlus
espnet2.gan_svs.avocodo.avocodo.AvocodoGenerator
espnet2.gan_svs.avocodo.avocodo.CoMBD
espnet2.gan_svs.avocodo.avocodo.CoMBDBlock
espnet2.gan_svs.avocodo.avocodo.get_padding
espnet2.gan_svs.avocodo.avocodo.MDC
espnet2.gan_svs.avocodo.avocodo.MDCDConfig
espnet2.gan_svs.avocodo.avocodo.SBD
espnet2.gan_svs.avocodo.avocodo.SBDBlock
espnet2.gan_svs.espnet_model.ESPnetGANSVSModel
espnet2.gan_svs.joint.joint_score2wav.JointScore2Wav
espnet2.gan_svs.pits.modules.WN
espnet2.gan_svs.pits.ying_decoder.YingDecoder
espnet2.gan_svs.post_frontend.fused.FusedPostFrontends
espnet2.gan_svs.post_frontend.s3prl.S3prlPostFrontend
espnet2.gan_svs.uhifigan.sine_generator.SineGen
espnet2.gan_svs.uhifigan.uhifigan.UHiFiGANGenerator
espnet2.gan_svs.utils.expand_f0.expand_f0
espnet2.gan_svs.visinger2.ddsp.amp_to_impulse_response
espnet2.gan_svs.visinger2.ddsp.extract_loudness
espnet2.gan_svs.visinger2.ddsp.extract_pitch
espnet2.gan_svs.visinger2.ddsp.fft_convolve
espnet2.gan_svs.visinger2.ddsp.gru
espnet2.gan_svs.visinger2.ddsp.harmonic_synth
espnet2.gan_svs.visinger2.ddsp.init_kernels
espnet2.gan_svs.visinger2.ddsp.mean_std_loudness
espnet2.gan_svs.visinger2.ddsp.mlp
espnet2.gan_svs.visinger2.ddsp.multiscale_fft
espnet2.gan_svs.visinger2.ddsp.remove_above_nyquist
espnet2.gan_svs.visinger2.ddsp.resample
espnet2.gan_svs.visinger2.ddsp.safe_log
espnet2.gan_svs.visinger2.ddsp.scale_function
espnet2.gan_svs.visinger2.ddsp.upsample
espnet2.gan_svs.visinger2.visinger2_vocoder.BaseFrequenceDiscriminator
espnet2.gan_svs.visinger2.visinger2_vocoder.ConvReluNorm
espnet2.gan_svs.visinger2.visinger2_vocoder.create_fb_matrix
espnet2.gan_svs.visinger2.visinger2_vocoder.Generator_Harm
espnet2.gan_svs.visinger2.visinger2_vocoder.Generator_Noise
espnet2.gan_svs.visinger2.visinger2_vocoder.LayerNorm
espnet2.gan_svs.visinger2.visinger2_vocoder.MelScale
espnet2.gan_svs.visinger2.visinger2_vocoder.MultiFrequencyDiscriminator
espnet2.gan_svs.visinger2.visinger2_vocoder.TorchSTFT
espnet2.gan_svs.visinger2.visinger2_vocoder.VISinger2Discriminator
espnet2.gan_svs.visinger2.visinger2_vocoder.VISinger2VocoderGenerator
espnet2.gan_svs.vits.duration_predictor.DurationPredictor
espnet2.gan_svs.vits.generator.VISingerGenerator
espnet2.gan_svs.vits.length_regulator.LengthRegulator
espnet2.gan_svs.vits.modules.Projection
espnet2.gan_svs.vits.modules.sequence_mask
espnet2.gan_svs.vits.phoneme_predictor.PhonemePredictor
espnet2.gan_svs.vits.pitch_predictor.Decoder
espnet2.gan_svs.vits.prior_decoder.PriorDecoder
espnet2.gan_svs.vits.text_encoder.TextEncoder
espnet2.gan_svs.vits.vits.VITS
Prev
Gan Codec
Next
Gan Tts