Skip to main content
Tutorials
Full ESPnet installation
ESPnet2
ESPnet1
Training configurations
Recipe tips
Audio formatting
Task class and data input system
Docker
Job scheduling system
Distributed training
Document Generation
Demos
Roadmap
ESPnet2
Demo
Course
ESPnet-EZ
ESPnet EZ
ESPnet1 (Legacy)
ESPnet1
Recipes
What is a recipe template?
Automatic Speech Recognition (Multi-tasking)
Automatic Speech Recognition with Discrete Units
Speaker Verification Spoofing and Countermeasures
Classification
Speech Codec
Speaker Diarisation
Speech Enhancement
Speech Recognition with Speech Enhancement
Speaker Diarisation with Speech Enhancement
Speech-to-Text Translation with Speech Enhancement
Self-supervised Learning
Language Modeling
Machine Translation
Speech-to-Speech Translation
Weakly-supervised Learning (Speech-to-Text)
ESPnet-SDS
Spoken Language Understanding
Speech Language Model
Speaker Representation
Self-supervised Learning
Speech-to-Text Translation
Singing Voice Synthesis
Text-to-Speech
Text-to-Speech with Discrete Units
Unsupervised Automatic Speech Recognition
Python API
espnet
asr
distributed
lm
mt
nets
optimizer
scheduler
st
transform
tts
utils
vc
espnet2
asr
asr_transducer
asvspoof
cls
diar
enh
fileio
fst
gan_codec
gan_svs
gan_tts
hubert
iterators
layers
lm
main_funcs
mt
optimizers
s2st
s2t
samplers
schedulers
sds
slu
speechlm
spk
ssl
st
svs
tasks
text
torch_utils
train
tts
tts2
uasr
utils
espnetez
config
data
dataloader
dataset
preprocess
task
trainer
Shell API
espnet2_bin
espnet_bin
spm
utils
utils_py
Search
Ctrl
K
Sds
Less than 1 minute
Catalog
espnet2.sds.asr.abs_asr.AbsASR
espnet2.sds.asr.espnet_asr.ESPnetASRModel
espnet2.sds.asr.owsm_asr.OWSMModel
espnet2.sds.asr.owsm_ctc_asr.OWSMCTCModel
espnet2.sds.asr.whisper_asr.WhisperASRModel
espnet2.sds.end_to_end.abs_e2e.AbsE2E
espnet2.sds.end_to_end.mini_omni_e2e.MiniOmniE2EModel
espnet2.sds.end_to_end.mini_omni.inference.A1_A2
espnet2.sds.end_to_end.mini_omni.inference.A1_A2_batch
espnet2.sds.end_to_end.mini_omni.inference.A1_T1
espnet2.sds.end_to_end.mini_omni.inference.A1_T2
espnet2.sds.end_to_end.mini_omni.inference.download_model
espnet2.sds.end_to_end.mini_omni.inference.get_input_ids_TA
espnet2.sds.end_to_end.mini_omni.inference.get_input_ids_TT
espnet2.sds.end_to_end.mini_omni.inference.get_input_ids_whisper
espnet2.sds.end_to_end.mini_omni.inference.get_input_ids_whisper_ATBatch
espnet2.sds.end_to_end.mini_omni.inference.load_audio
espnet2.sds.end_to_end.mini_omni.inference.load_model
espnet2.sds.end_to_end.mini_omni.inference.OmniInference
espnet2.sds.end_to_end.mini_omni.inference.T1_A2
espnet2.sds.end_to_end.mini_omni.inference.T1_T2
espnet2.sds.end_to_end.mini_omni.inference.test_infer
espnet2.sds.end_to_end.mini_omni.litgpt.config.Config
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.generate
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.generate_AA
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.generate_ASR
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.generate_AT
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.generate_TA
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.generate_TA_BATCH
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.generate_TT
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.multinomial_num_samples_1
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.next_token
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.next_token_A1T1
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.next_token_A1T2
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.next_token_asr
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.next_token_batch
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.sample
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.sample_top_p
espnet2.sds.end_to_end.mini_omni.litgpt.model.apply_rope
espnet2.sds.end_to_end.mini_omni.litgpt.model.Block
espnet2.sds.end_to_end.mini_omni.litgpt.model.build_mask_cache
espnet2.sds.end_to_end.mini_omni.litgpt.model.build_rope_cache
espnet2.sds.end_to_end.mini_omni.litgpt.model.CausalSelfAttention
espnet2.sds.end_to_end.mini_omni.litgpt.model.GemmaMLP
espnet2.sds.end_to_end.mini_omni.litgpt.model.GPT
espnet2.sds.end_to_end.mini_omni.litgpt.model.GptNeoxMLP
espnet2.sds.end_to_end.mini_omni.litgpt.model.KVCache
espnet2.sds.end_to_end.mini_omni.litgpt.model.LLaMAMLP
espnet2.sds.end_to_end.mini_omni.litgpt.model.LLaMAMoE
espnet2.sds.end_to_end.mini_omni.litgpt.model.RMSNorm
espnet2.sds.end_to_end.mini_omni.litgpt.model.whisperMLP
espnet2.sds.end_to_end.mini_omni.litgpt.tokenizer.Tokenizer
espnet2.sds.end_to_end.mini_omni.litgpt.utils.capture_hparams
espnet2.sds.end_to_end.mini_omni.litgpt.utils.check_valid_checkpoint_dir
espnet2.sds.end_to_end.mini_omni.litgpt.utils.choose_logger
espnet2.sds.end_to_end.mini_omni.litgpt.utils.chunked_cross_entropy
espnet2.sds.end_to_end.mini_omni.litgpt.utils.CLI
espnet2.sds.end_to_end.mini_omni.litgpt.utils.copy_config_files
espnet2.sds.end_to_end.mini_omni.litgpt.utils.CycleIterator
espnet2.sds.end_to_end.mini_omni.litgpt.utils.estimate_flops
espnet2.sds.end_to_end.mini_omni.litgpt.utils.extend_checkpoint_dir
espnet2.sds.end_to_end.mini_omni.litgpt.utils.find_multiple
espnet2.sds.end_to_end.mini_omni.litgpt.utils.find_resume_path
espnet2.sds.end_to_end.mini_omni.litgpt.utils.flops_per_param
espnet2.sds.end_to_end.mini_omni.litgpt.utils.get_argument_names
espnet2.sds.end_to_end.mini_omni.litgpt.utils.get_default_supported_precision
espnet2.sds.end_to_end.mini_omni.litgpt.utils.incremental_save
espnet2.sds.end_to_end.mini_omni.litgpt.utils.IncrementalPyTorchPickler
espnet2.sds.end_to_end.mini_omni.litgpt.utils.init_out_dir
espnet2.sds.end_to_end.mini_omni.litgpt.utils.instantiate_bnb_optimizer
espnet2.sds.end_to_end.mini_omni.litgpt.utils.instantiate_torch_optimizer
espnet2.sds.end_to_end.mini_omni.litgpt.utils.load_checkpoint
espnet2.sds.end_to_end.mini_omni.litgpt.utils.map_old_state_dict_weights
espnet2.sds.end_to_end.mini_omni.litgpt.utils.num_parameters
espnet2.sds.end_to_end.mini_omni.litgpt.utils.parse_devices
espnet2.sds.end_to_end.mini_omni.litgpt.utils.reset_parameters
espnet2.sds.end_to_end.mini_omni.litgpt.utils.save_config
espnet2.sds.end_to_end.mini_omni.litgpt.utils.save_hyperparameters
espnet2.sds.end_to_end.mini_omni.litgpt.utils.SavingProxyForStorage
espnet2.sds.end_to_end.mini_omni.litgpt.utils.SavingProxyForTensor
espnet2.sds.end_to_end.mini_omni.utils.snac_utils.generate_audio_data
espnet2.sds.end_to_end.mini_omni.utils.snac_utils.get_snac
espnet2.sds.end_to_end.mini_omni.utils.snac_utils.get_time_str
espnet2.sds.end_to_end.mini_omni.utils.snac_utils.layershift
espnet2.sds.end_to_end.mini_omni.utils.snac_utils.reconscruct_snac
espnet2.sds.end_to_end.mini_omni.utils.snac_utils.reconstruct_tensors
espnet2.sds.end_to_end.mini_omni.utils.snac_utils.SnacConfig
espnet2.sds.espnet_model.ESPnetSDSModelInterface
espnet2.sds.llm.abs_llm.AbsLLM
espnet2.sds.llm.hugging_face_llm.HuggingFaceLLM
espnet2.sds.tts.abs_tts.AbsTTS
espnet2.sds.tts.chat_tts.ChatTTSModel
espnet2.sds.tts.espnet_tts.ESPnetTTSModel
espnet2.sds.utils.chat.Chat
espnet2.sds.utils.utils.int2float
espnet2.sds.vad.abs_vad.AbsVAD
espnet2.sds.vad.webrtc_vad.WebrtcVADModel
Prev
Schedulers
Next
Slu