Skip to main content
Tutorials
Full ESPnet installation
ESPnet2
ESPnet1
Training configurations
Recipe tips
Audio formatting
Task class and data input system
Docker
Job scheduling system
Distributed training
Document Generation
Demos
Roadmap
ESPnet2
/notebook/ESPnet2/Demo/
/notebook/ESPnet2/Course/
ESPnet-EZ
/notebook/ESPnetEZ/
ESPnet1 (Legacy)
/notebook/ESPnet1/
Recipes
What is a recipe template?
Automatic Speech Recognition (Multi-tasking)
Automatic Speech Recognition with Discrete Units
Speaker Verification Spoofing and Countermeasures
Classification
Speech Codec
Speaker Diarisation
Speech Enhancement
Speech Recognition with Speech Enhancement
Speaker Diarisation with Speech Enhancement
Speech-to-Text Translation with Speech Enhancement
Self-supervised Learning
Language Identification
Language Modeling
Machine Translation
Speech-to-Speech Translation
Weakly-supervised Learning (Speech-to-Text)
ESPnet-SDS
Spoken Language Understanding
Speech Language Model
Speaker Representation
Self-supervised Learning
Speech-to-Text Translation
Singing Voice Synthesis
ESPnet2 SVS2 Recipe TEMPLATE
Text-to-Speech
Text-to-Speech with Discrete Units
Unsupervised Automatic Speech Recognition
Python API
espnet2
asr
asr_transducer
asvspoof
beats
cls
diar
enh
fileio
fst
gan_codec
gan_svs
gan_tts
hubert
iterators
layers
legacy
lid
lm
main_funcs
mt
optimizers
ps2st
s2st
s2t
samplers
schedulers
sds
slu
speechlm
spk
ssl
st
svs
tasks
text
torch_utils
train
tts
tts2
uasr
utils
Shell API
espnet2_bin
spm
utils
utils_py
Search
Ctrl
K
Beats
Less than 1 minute
Catalog
espnet2.beats.audio_tokenizer.AudioTokenizer
espnet2.beats.audio_tokenizer.load_beats_config
espnet2.beats.encoder.BeatsConfig
espnet2.beats.encoder.BeatsEncoder
espnet2.beats.encoder.BeatsPretrainingPredictor
espnet2.beats.encoder.gelu
espnet2.beats.encoder.gelu_accurate
espnet2.beats.encoder.get_activation_fn
espnet2.beats.encoder.GLU_Linear
espnet2.beats.encoder.GradMultiply
espnet2.beats.encoder.init_bert_params
espnet2.beats.encoder.MultiheadAttention
espnet2.beats.encoder.quant_noise
espnet2.beats.encoder.SamePad
espnet2.beats.encoder.Swish
espnet2.beats.encoder.TransformerEncoder
espnet2.beats.encoder.TransformerSentenceEncoderLayer
espnet2.beats.random_tokenizer.RandomProjectionQuantizer
espnet2.beats.tokenizer.BeatsRandomTokenizer
espnet2.beats.tokenizer.BeatsTokenizer
espnet2.beats.tokenizer.BeatsTokenizerConfig
espnet2.beats.tokenizer.BeatsTokenizerPretrainingPredictor
espnet2.beats.tokenizer.EmbeddingEMA
espnet2.beats.tokenizer.NormEMAVectorQuantizer
espnet2.beats.utils.beats_frontend
espnet2.beats.utils.ema_inplace
espnet2.beats.utils.forward_padding_mask_conv
espnet2.beats.utils.freeze_conv_module
espnet2.beats.utils.kmeans
espnet2.beats.utils.l2norm
espnet2.beats.utils.make_pad_mask
espnet2.beats.utils.norm_ema_inplace
espnet2.beats.utils.sample_vectors
Prev
Asvspoof
Next
Cls