Skip to main content
Tutorials
Full ESPnet installation
ESPnet2
ESPnet1
Training configurations
Recipe tips
Audio formatting
Task class and data input system
Docker
Job scheduling system
Distributed training
Document Generation
Demos
Roadmap
ESPnet2
Demo
Course
ESPnet-EZ
ESPnet EZ
ESPnet1 (Legacy)
ESPnet1
Recipes
What is a recipe template?
Automatic Speech Recognition (Multi-tasking)
Automatic Speech Recognition with Discrete Units
Speaker Verification Spoofing and Countermeasures
Classification
Speech Codec
Speaker Diarisation
Speech Enhancement
Speech Recognition with Speech Enhancement
Speaker Diarisation with Speech Enhancement
Speech-to-Text Translation with Speech Enhancement
Self-supervised Learning
Language Modeling
Machine Translation
Speech-to-Speech Translation
Weakly-supervised Learning (Speech-to-Text)
ESPnet-SDS
Spoken Language Understanding
Speech Language Model
Speaker Representation
Self-supervised Learning
Speech-to-Text Translation
Singing Voice Synthesis
ESPnet2 SVS2 Recipe TEMPLATE
Text-to-Speech
Text-to-Speech with Discrete Units
Unsupervised Automatic Speech Recognition
Python API
espnet
asr
distributed
lm
mt
nets
optimizer
scheduler
st
transform
tts
utils
vc
espnet2
asr
asr_transducer
asvspoof
cls
diar
enh
fileio
fst
gan_codec
gan_svs
gan_tts
hubert
iterators
layers
lm
main_funcs
mt
optimizers
s2st
s2t
samplers
schedulers
sds
slu
speechlm
spk
ssl
st
svs
tasks
text
torch_utils
train
tts
tts2
uasr
utils
espnetez
config
data
dataloader
dataset
preprocess
task
trainer
Shell API
espnet2_bin
espnet_bin
spm
utils
utils_py
Search
Ctrl
K
Train
Less than 1 minute
Catalog
espnet2.train.abs_espnet_model.AbsESPnetModel
espnet2.train.abs_gan_espnet_model.AbsGANESPnetModel
espnet2.train.class_choices.ClassChoices
espnet2.train.collate_fn.common_collate_fn
espnet2.train.collate_fn.CommonCollateFn
espnet2.train.collate_fn.HuBERTCollateFn
espnet2.train.dataset.AbsDataset
espnet2.train.dataset.AdapterForLabelScpReader
espnet2.train.dataset.AdapterForSingingScoreScpReader
espnet2.train.dataset.AdapterForSoundScpReader
espnet2.train.dataset.ESPnetDataset
espnet2.train.dataset.ESPnetMultiTaskDataset
espnet2.train.dataset.ESPnetSpeechLMDataset
espnet2.train.dataset.H5FileWrapper
espnet2.train.dataset.kaldi_loader
espnet2.train.dataset.label_loader
espnet2.train.dataset.multi_columns_sound_loader
espnet2.train.dataset.rand_int_loader
espnet2.train.dataset.score_loader
espnet2.train.dataset.sound_loader
espnet2.train.dataset.variable_columns_sound_loader
espnet2.train.deepspeed_trainer.DeepSpeedTrainer
espnet2.train.deepspeed_trainer.DeepSpeedTrainerOptions
espnet2.train.distributed_utils.DistributedOption
espnet2.train.distributed_utils.free_port
espnet2.train.distributed_utils.get_local_rank
espnet2.train.distributed_utils.get_master_addr
espnet2.train.distributed_utils.get_master_port
espnet2.train.distributed_utils.get_node_rank
espnet2.train.distributed_utils.get_num_nodes
espnet2.train.distributed_utils.get_rank
espnet2.train.distributed_utils.get_world_size
espnet2.train.distributed_utils.is_in_slurm_job
espnet2.train.distributed_utils.is_in_slurm_step
espnet2.train.distributed_utils.resolve_distributed_mode
espnet2.train.gan_trainer.GANTrainer
espnet2.train.gan_trainer.GANTrainerOptions
espnet2.train.iterable_dataset.IterableESPnetDataset
espnet2.train.iterable_dataset.load_kaldi
espnet2.train.iterable_dataset.SplicedIterableESPnetDataset
espnet2.train.lightning_callbacks.AverageCheckpointsCallback
espnet2.train.lightning_espnet_model.LitESPnetModel
espnet2.train.preprocessor.AbsPreprocessor
espnet2.train.preprocessor.any_allzero
espnet2.train.preprocessor.CommonPreprocessor
espnet2.train.preprocessor.CommonPreprocessor_multi
espnet2.train.preprocessor.detect_non_silence
espnet2.train.preprocessor.DynamicMixingPreprocessor
espnet2.train.preprocessor.EnhPreprocessor
espnet2.train.preprocessor.framing
espnet2.train.preprocessor.LIDPreprocessor
espnet2.train.preprocessor.MutliTokenizerCommonPreprocessor
espnet2.train.preprocessor.S2TCTCPreprocessor
espnet2.train.preprocessor.S2TPreprocessor
espnet2.train.preprocessor.SLUPreprocessor
espnet2.train.preprocessor.SpeechLMPreprocessor
espnet2.train.preprocessor.SpkPreprocessor
espnet2.train.preprocessor.SVSPreprocessor
espnet2.train.preprocessor.TSEPreprocessor
espnet2.train.reporter.aggregate
espnet2.train.reporter.Average
espnet2.train.reporter.ReportedValue
espnet2.train.reporter.Reporter
espnet2.train.reporter.SubReporter
espnet2.train.reporter.to_reported_value
espnet2.train.reporter.wandb_get_prefix
espnet2.train.reporter.WeightedAverage
espnet2.train.spk_trainer.SpkTrainer
espnet2.train.trainer.Trainer
espnet2.train.trainer.TrainerOptions
espnet2.train.uasr_trainer.UASRTrainer
espnet2.train.uasr_trainer.UASRTrainerOptions
Prev
Torch Utils
Next
Tts