Skip to main content
Tutorials
Full ESPnet installation
ESPnet2
ESPnet1
Training configurations
Recipe tips
Audio formatting
Task class and data input system
Docker
Job scheduling system
Distributed training
Document Generation
Demos
Roadmap
ESPnet2
Demo
Course
ESPnet-EZ
ESPnet EZ
ESPnet1 (Legacy)
ESPnet1
Recipes
What is a recipe template?
Automatic Speech Recognition (Multi-tasking)
Automatic Speech Recognition with Discrete Units
Speaker Verification Spoofing and Countermeasures
Classification
Speech Codec
Speaker Diarisation
Speech Enhancement
Speech Recognition with Speech Enhancement
Speaker Diarisation with Speech Enhancement
Speech-to-Text Translation with Speech Enhancement
Self-supervised Learning
Language Modeling
Machine Translation
Speech-to-Speech Translation
Weakly-supervised Learning (Speech-to-Text)
ESPnet-SDS
Spoken Language Understanding
Speech Language Model
Speaker Representation
Self-supervised Learning
Speech-to-Text Translation
Singing Voice Synthesis
Text-to-Speech
Text-to-Speech with Discrete Units
Unsupervised Automatic Speech Recognition
Python API
espnet
asr
distributed
lm
mt
nets
optimizer
scheduler
st
transform
tts
utils
vc
espnet2
asr
asr_transducer
asvspoof
cls
diar
enh
fileio
fst
gan_codec
gan_svs
gan_tts
hubert
iterators
layers
lm
main_funcs
mt
optimizers
s2st
s2t
samplers
schedulers
sds
slu
speechlm
spk
ssl
st
svs
tasks
text
torch_utils
train
tts
tts2
uasr
utils
espnetez
config
data
dataloader
dataset
preprocess
task
trainer
Shell API
espnet2_bin
espnet_bin
spm
utils
utils_py
Search
Ctrl
K
Text
Less than 1 minute
Catalog
espnet2.text.abs_tokenizer.AbsTokenizer
espnet2.text.build_tokenizer.build_tokenizer
espnet2.text.char_tokenizer.CharTokenizer
espnet2.text.cleaner.TextCleaner
espnet2.text.hugging_face_token_id_converter.HuggingFaceTokenIDConverter
espnet2.text.hugging_face_tokenizer.HuggingFaceTokenizer
espnet2.text.korean_cleaner.KoreanCleaner
espnet2.text.phoneme_tokenizer.G2p_en
espnet2.text.phoneme_tokenizer.G2pk
espnet2.text.phoneme_tokenizer.IsG2p
espnet2.text.phoneme_tokenizer.Jaso
espnet2.text.phoneme_tokenizer.PhonemeTokenizer
espnet2.text.phoneme_tokenizer.Phonemizer
espnet2.text.phoneme_tokenizer.pyopenjtalk_g2p
espnet2.text.phoneme_tokenizer.pyopenjtalk_g2p_accent
espnet2.text.phoneme_tokenizer.pyopenjtalk_g2p_accent_with_pause
espnet2.text.phoneme_tokenizer.pyopenjtalk_g2p_kana
espnet2.text.phoneme_tokenizer.pyopenjtalk_g2p_prosody
espnet2.text.phoneme_tokenizer.pypinyin_g2p
espnet2.text.phoneme_tokenizer.pypinyin_g2p_phone
espnet2.text.phoneme_tokenizer.pypinyin_g2p_phone_without_prosody
espnet2.text.phoneme_tokenizer.split_by_space
espnet2.text.sentencepiece_tokenizer.SentencepiecesTokenizer
espnet2.text.token_id_converter.TokenIDConverter
espnet2.text.whisper_token_id_converter.OpenAIWhisperTokenIDConverter
espnet2.text.whisper_tokenizer.OpenAIWhisperTokenizer
espnet2.text.word_tokenizer.WordTokenizer
Prev
Tasks
Next
Torch Utils