Skip to main content
Tutorials
Full ESPnet installation
ESPnet2
ESPnet1
Training configurations
Recipe tips
Audio formatting
Task class and data input system
Docker
Job scheduling system
Distributed training
Document Generation
Demos
Roadmap
ESPnet2
Demo
Course
ESPnet-EZ
ESPnet EZ
ESPnet1 (Legacy)
ESPnet1
Recipes
What is a recipe template?
Automatic Speech Recognition (Multi-tasking)
Automatic Speech Recognition with Discrete Units
Speaker Verification Spoofing and Countermeasures
Classification
Speech Codec
Speaker Diarisation
Speech Enhancement
Speech Recognition with Speech Enhancement
Speaker Diarisation with Speech Enhancement
Speech-to-Text Translation with Speech Enhancement
Self-supervised Learning
Language Modeling
Machine Translation
Speech-to-Speech Translation
Weakly-supervised Learning (Speech-to-Text)
ESPnet-SDS
Spoken Language Understanding
Speech Language Model
Speaker Representation
Self-supervised Learning
Speech-to-Text Translation
Singing Voice Synthesis
ESPnet2 SVS2 Recipe TEMPLATE
Text-to-Speech
Text-to-Speech with Discrete Units
Unsupervised Automatic Speech Recognition
Python API
espnet
asr
distributed
lm
mt
nets
optimizer
scheduler
st
transform
tts
utils
vc
espnet2
asr
asr_transducer
asvspoof
cls
diar
enh
fileio
fst
gan_codec
gan_svs
gan_tts
hubert
iterators
layers
lm
main_funcs
mt
optimizers
s2st
s2t
samplers
schedulers
sds
slu
speechlm
spk
ssl
st
svs
tasks
text
torch_utils
train
tts
tts2
uasr
utils
espnetez
config
data
dataloader
dataset
preprocess
task
trainer
Shell API
espnet2_bin
espnet_bin
spm
utils
utils_py
Search
Ctrl
K
Utils
Less than 1 minute
Catalog
espnet.utils.check_kwargs.check_kwargs
espnet.utils.cli_readers.file_reader_helper
espnet.utils.cli_readers.HDF5Reader
espnet.utils.cli_readers.KaldiReader
espnet.utils.cli_readers.SoundHDF5Reader
espnet.utils.cli_readers.SoundReader
espnet.utils.cli_utils.assert_scipy_wav_style
espnet.utils.cli_utils.get_commandline_args
espnet.utils.cli_utils.is_scipy_wav_style
espnet.utils.cli_utils.strtobool
espnet.utils.cli_writers.BaseWriter
espnet.utils.cli_writers.file_writer_helper
espnet.utils.cli_writers.get_num_frames_writer
espnet.utils.cli_writers.HDF5Writer
espnet.utils.cli_writers.KaldiWriter
espnet.utils.cli_writers.parse_wspecifier
espnet.utils.cli_writers.SoundHDF5Writer
espnet.utils.cli_writers.SoundWriter
espnet.utils.dataset.ChainerDataLoader
espnet.utils.dataset.Transform
espnet.utils.dataset.TransformDataset
espnet.utils.deterministic_utils.set_deterministic_chainer
espnet.utils.deterministic_utils.set_deterministic_pytorch
espnet.utils.dynamic_import.dynamic_import
espnet.utils.fill_missing_args.fill_missing_args
espnet.utils.io_utils.LoadInputsAndTargets
espnet.utils.io_utils.SoundHDF5File
espnet.utils.spec_augment.apply_interpolation
espnet.utils.spec_augment.create_dense_flows
espnet.utils.spec_augment.cross_squared_distance_matrix
espnet.utils.spec_augment.dense_image_warp
espnet.utils.spec_augment.flatten_grid_locations
espnet.utils.spec_augment.freq_mask
espnet.utils.spec_augment.get_flat_grid_locations
espnet.utils.spec_augment.get_grid_locations
espnet.utils.spec_augment.interpolate_bilinear
espnet.utils.spec_augment.interpolate_spline
espnet.utils.spec_augment.phi
espnet.utils.spec_augment.solve_interpolation
espnet.utils.spec_augment.sparse_image_warp
espnet.utils.spec_augment.specaug
espnet.utils.spec_augment.time_mask
espnet.utils.spec_augment.time_warp
espnet.utils.training.batchfy.batchfy_by_bin
espnet.utils.training.batchfy.batchfy_by_frame
espnet.utils.training.batchfy.batchfy_by_seq
espnet.utils.training.batchfy.batchfy_shuffle
espnet.utils.training.batchfy.make_batchset
espnet.utils.training.evaluator.BaseEvaluator
espnet.utils.training.iterators.ShufflingEnabler
espnet.utils.training.iterators.ToggleableShufflingMultiprocessIterator
espnet.utils.training.iterators.ToggleableShufflingSerialIterator
espnet.utils.training.tensorboard_logger.TensorboardLogger
espnet.utils.training.train_utils.check_early_stop
espnet.utils.training.train_utils.set_early_stop
Prev
Tts
Next
Vc