Skip to main content
Tutorials
Full ESPnet installation
ESPnet2
ESPnet1
Training configurations
Recipe tips
Audio formatting
Task class and data input system
Docker
Job scheduling system
Distributed training
Document Generation
Demos
Roadmap
ESPnet2
Demo
Course
ESPnet-EZ
ESPnet EZ
ESPnet1 (Legacy)
ESPnet1
Recipes
What is a recipe template?
Automatic Speech Recognition (Multi-tasking)
Automatic Speech Recognition with Discrete Units
Speaker Verification Spoofing and Countermeasures
Classification
Speech Codec
Speaker Diarisation
Speech Enhancement
Speech Recognition with Speech Enhancement
Speaker Diarisation with Speech Enhancement
Speech-to-Text Translation with Speech Enhancement
Self-supervised Learning
Language Modeling
Machine Translation
Speech-to-Speech Translation
Weakly-supervised Learning (Speech-to-Text)
ESPnet-SDS
Spoken Language Understanding
Speech Language Model
Speaker Representation
Self-supervised Learning
Speech-to-Text Translation
Singing Voice Synthesis
ESPnet2 SVS2 Recipe TEMPLATE
Text-to-Speech
Text-to-Speech with Discrete Units
Unsupervised Automatic Speech Recognition
Python API
espnet
asr
distributed
lm
mt
nets
optimizer
scheduler
st
transform
tts
utils
vc
espnet2
asr
asr_transducer
asvspoof
cls
diar
enh
fileio
fst
gan_codec
gan_svs
gan_tts
hubert
iterators
layers
lm
main_funcs
mt
optimizers
s2st
s2t
samplers
schedulers
sds
slu
speechlm
spk
ssl
st
svs
tasks
text
torch_utils
train
tts
tts2
uasr
utils
espnetez
config
data
dataloader
dataset
preprocess
task
trainer
Shell API
espnet2_bin
espnet_bin
spm
utils
utils_py
Search
Ctrl
K
Tts
Less than 1 minute
Catalog
espnet2.tts.abs_tts.AbsTTS
espnet2.tts.espnet_model.ESPnetTTSModel
espnet2.tts.fastspeech.fastspeech.FastSpeech
espnet2.tts.fastspeech2.fastspeech2.FastSpeech2
espnet2.tts.fastspeech2.loss.FastSpeech2Loss
espnet2.tts.fastspeech2.variance_predictor.VariancePredictor
espnet2.tts.feats_extract.abs_feats_extract.AbsFeatsExtract
espnet2.tts.feats_extract.dio.Dio
espnet2.tts.feats_extract.energy.Energy
espnet2.tts.feats_extract.linear_spectrogram.LinearSpectrogram
espnet2.tts.feats_extract.log_mel_fbank.LogMelFbank
espnet2.tts.feats_extract.log_spectrogram.LogSpectrogram
espnet2.tts.feats_extract.yin.cumulativeMeanNormalizedDifferenceFunction
espnet2.tts.feats_extract.yin.cumulativeMeanNormalizedDifferenceFunctionTorch
espnet2.tts.feats_extract.yin.differenceFunction
espnet2.tts.feats_extract.yin.differenceFunction_np
espnet2.tts.feats_extract.yin.differenceFunctionTorch
espnet2.tts.feats_extract.ying.Ying
espnet2.tts.gst.style_encoder.MultiHeadedAttention
espnet2.tts.gst.style_encoder.ReferenceEncoder
espnet2.tts.gst.style_encoder.StyleEncoder
espnet2.tts.gst.style_encoder.StyleTokenLayer
espnet2.tts.prodiff.denoiser.Mish
espnet2.tts.prodiff.denoiser.noise_scheduler
espnet2.tts.prodiff.denoiser.ResidualBlock
espnet2.tts.prodiff.denoiser.SpectogramDenoiser
espnet2.tts.prodiff.loss.gaussian
espnet2.tts.prodiff.loss.ProDiffLoss
espnet2.tts.prodiff.loss.SSimLoss
espnet2.tts.prodiff.prodiff.ProDiff
espnet2.tts.tacotron2.tacotron2.Tacotron2
espnet2.tts.transformer.transformer.Transformer
espnet2.tts.utils.duration_calculator.DurationCalculator
espnet2.tts.utils.parallel_wavegan_pretrained_vocoder.ParallelWaveGANPretrainedVocoder
Prev
Train
Next
Tts2