Skip to main content
Tutorials
Full ESPnet installation
ESPnet2
ESPnet1
Training configurations
Recipe tips
Audio formatting
Task class and data input system
Docker
Job scheduling system
Distributed training
Document Generation
Demos
Roadmap
ESPnet2
Demo
Course
ESPnet-EZ
ESPnet EZ
ESPnet1 (Legacy)
ESPnet1
Recipes
What is a recipe template?
Automatic Speech Recognition (Multi-tasking)
Automatic Speech Recognition with Discrete Units
Speaker Verification Spoofing and Countermeasures
Classification
Speech Codec
Speaker Diarisation
Speech Enhancement
Speech Recognition with Speech Enhancement
Speaker Diarisation with Speech Enhancement
Speech-to-Text Translation with Speech Enhancement
Self-supervised Learning
Language Modeling
Machine Translation
Speech-to-Speech Translation
Weakly-supervised Learning (Speech-to-Text)
ESPnet-SDS
Spoken Language Understanding
Speech Language Model
Speaker Representation
Self-supervised Learning
Speech-to-Text Translation
Singing Voice Synthesis
Text-to-Speech
Text-to-Speech with Discrete Units
Unsupervised Automatic Speech Recognition
Python API
espnet
asr
distributed
lm
mt
nets
optimizer
scheduler
st
transform
tts
utils
vc
espnet2
asr
asr_transducer
asvspoof
cls
diar
enh
fileio
fst
gan_codec
gan_svs
gan_tts
hubert
iterators
layers
lm
main_funcs
mt
optimizers
s2st
s2t
samplers
schedulers
sds
slu
speechlm
spk
ssl
st
svs
tasks
text
torch_utils
train
tts
tts2
uasr
utils
espnetez
config
data
dataloader
dataset
preprocess
task
trainer
Shell API
espnet2_bin
espnet_bin
spm
utils
utils_py
Search
Ctrl
K
Gan Svs
Less than 1 minute
Catalog
espnet2.gan_svs.abs_gan_svs.AbsGANSVS
espnet2.gan_svs.avocodo.avocodo.AvocodoDiscriminator
espnet2.gan_svs.avocodo.avocodo.AvocodoDiscriminatorPlus
espnet2.gan_svs.avocodo.avocodo.AvocodoGenerator
espnet2.gan_svs.avocodo.avocodo.CoMBD
espnet2.gan_svs.avocodo.avocodo.CoMBDBlock
espnet2.gan_svs.avocodo.avocodo.get_padding
espnet2.gan_svs.avocodo.avocodo.MDC
espnet2.gan_svs.avocodo.avocodo.MDCDConfig
espnet2.gan_svs.avocodo.avocodo.SBD
espnet2.gan_svs.avocodo.avocodo.SBDBlock
espnet2.gan_svs.espnet_model.ESPnetGANSVSModel
espnet2.gan_svs.joint.joint_score2wav.JointScore2Wav
espnet2.gan_svs.pits.modules.WN
espnet2.gan_svs.pits.ying_decoder.YingDecoder
espnet2.gan_svs.post_frontend.fused.FusedPostFrontends
espnet2.gan_svs.post_frontend.s3prl.S3prlPostFrontend
espnet2.gan_svs.uhifigan.sine_generator.SineGen
espnet2.gan_svs.uhifigan.uhifigan.UHiFiGANGenerator
espnet2.gan_svs.utils.expand_f0.expand_f0
espnet2.gan_svs.visinger2.ddsp.amp_to_impulse_response
espnet2.gan_svs.visinger2.ddsp.extract_loudness
espnet2.gan_svs.visinger2.ddsp.extract_pitch
espnet2.gan_svs.visinger2.ddsp.fft_convolve
espnet2.gan_svs.visinger2.ddsp.gru
espnet2.gan_svs.visinger2.ddsp.harmonic_synth
espnet2.gan_svs.visinger2.ddsp.init_kernels
espnet2.gan_svs.visinger2.ddsp.mean_std_loudness
espnet2.gan_svs.visinger2.ddsp.mlp
espnet2.gan_svs.visinger2.ddsp.multiscale_fft
espnet2.gan_svs.visinger2.ddsp.remove_above_nyquist
espnet2.gan_svs.visinger2.ddsp.resample
espnet2.gan_svs.visinger2.ddsp.safe_log
espnet2.gan_svs.visinger2.ddsp.scale_function
espnet2.gan_svs.visinger2.ddsp.upsample
espnet2.gan_svs.visinger2.visinger2_vocoder.BaseFrequenceDiscriminator
espnet2.gan_svs.visinger2.visinger2_vocoder.ConvReluNorm
espnet2.gan_svs.visinger2.visinger2_vocoder.create_fb_matrix
espnet2.gan_svs.visinger2.visinger2_vocoder.Generator_Harm
espnet2.gan_svs.visinger2.visinger2_vocoder.Generator_Noise
espnet2.gan_svs.visinger2.visinger2_vocoder.LayerNorm
espnet2.gan_svs.visinger2.visinger2_vocoder.MelScale
espnet2.gan_svs.visinger2.visinger2_vocoder.MultiFrequencyDiscriminator
espnet2.gan_svs.visinger2.visinger2_vocoder.TorchSTFT
espnet2.gan_svs.visinger2.visinger2_vocoder.VISinger2Discriminator
espnet2.gan_svs.visinger2.visinger2_vocoder.VISinger2VocoderGenerator
espnet2.gan_svs.vits.duration_predictor.DurationPredictor
espnet2.gan_svs.vits.generator.VISingerGenerator
espnet2.gan_svs.vits.length_regulator.LengthRegulator
espnet2.gan_svs.vits.modules.Projection
espnet2.gan_svs.vits.modules.sequence_mask
espnet2.gan_svs.vits.phoneme_predictor.PhonemePredictor
espnet2.gan_svs.vits.pitch_predictor.Decoder
espnet2.gan_svs.vits.prior_decoder.PriorDecoder
espnet2.gan_svs.vits.text_encoder.TextEncoder
espnet2.gan_svs.vits.vits.VITS
Prev
Gan Codec
Next
Gan Tts