Skip to main content
Demos
Roadmap
ESPnet2
Demo
Course
ESPnet-EZ
ESPnet EZ
ESPnet1 (Legacy)
ESPnet1
Recipes
What is a recipe template?
Automatic Speech Recognition (Multi-tasking)
Automatic Speech Recognition with Discrete Units
Speaker Verification Spoofing and Countermeasures
Speech Codec
Speaker Diarisation
Speech Enhancement
Speech Recognition with Speech Enhancement
Speaker Diarisation with Speech Enhancement
Speech-to-Text Translation with Speech Enhancement
Language Modeling
Machine Translation
Speech-to-Speech Translation
Weakly-supervised Learning (Speech-to-Text)
Spoken Language Understanding
Speech Language Model
Speaker Representation
Self-supervised Learning
Speech-to-Text Translation
Singing Voice Synthesis
Text-to-Speech
Text-to-Speech with Discrete Units
Unsupervised Automatic Speech Recognition
Python API
espnet
asr
distributed
lm
mt
nets
optimizer
scheduler
st
transform
tts
utils
vc
espnet2
asr
asr_transducer
asvspoof
diar
enh
fileio
fst
gan_codec
gan_svs
gan_tts
hubert
iterators
layers
lm
main_funcs
mt
optimizers
s2st
s2t
samplers
schedulers
slu
speechlm
spk
st
svs
tasks
text
torch_utils
train
tts
tts2
uasr
utils
espnetez
config
data
dataloader
dataset
preprocess
task
trainer
Shell API
espnet2_bin
espnet_bin
spm
utils
utils_py
Search
Ctrl
K
Utils
Less than 1 minute
Catalog
asr_align_wav.sh
clean_corpus.sh
convert_fbank.sh
data2json.sh
divide_lang.sh
download_from_google_drive.sh
dump_pcm.sh
dump.sh
eval_source_separation.sh
feat_to_shape.sh
free-gpu.sh
generate_wav.sh
make_fbank.sh
make_stft.sh
pack_model.sh
recog_wav.sh
reduce_data_dir.sh
remove_longshortdata.sh
score_bleu.sh
score_sclite_case.sh
score_sclite_wo_dict.sh
score_sclite.sh
show_result.sh
speed_perturb.sh
synth_wav.sh
translate_wav.sh
trim_silence.sh
update_json.sh
Prev
Spm
Next
Utils Py