Skip to main content
Demos
Roadmap
ESPnet2
Demo
Course
ESPnet-EZ
ESPnet EZ
ESPnet1 (Legacy)
ESPnet1
Recipes
What is a recipe template?
Automatic Speech Recognition (Multi-tasking)
Automatic Speech Recognition with Discrete Units
Speaker Verification Spoofing and Countermeasures
Speech Codec
Speaker Diarisation
Speech Enhancement
Speech Recognition with Speech Enhancement
Speaker Diarisation with Speech Enhancement
Speech-to-Text Translation with Speech Enhancement
Language Modeling
Machine Translation
Speech-to-Speech Translation
Weakly-supervised Learning (Speech-to-Text)
ESPnet-SDS
Spoken Language Understanding
Speech Language Model
Speaker Representation
Self-supervised Learning
Speech-to-Text Translation
Singing Voice Synthesis
Text-to-Speech
Text-to-Speech with Discrete Units
Unsupervised Automatic Speech Recognition
Python API
espnet
asr
distributed
lm
mt
nets
optimizer
scheduler
st
transform
tts
utils
vc
espnet2
asr
asr_transducer
asvspoof
diar
enh
fileio
fst
gan_codec
gan_svs
gan_tts
hubert
iterators
layers
lm
main_funcs
mt
optimizers
s2st
s2t
samplers
schedulers
sds
slu
speechlm
spk
st
svs
tasks
text
torch_utils
train
tts
tts2
uasr
utils
espnetez
config
data
dataloader
dataset
preprocess
task
trainer
Shell API
espnet2_bin
espnet_bin
spm
utils
utils_py
Search
Ctrl
K
Sds
Less than 1 minute
Catalog
espnet2.sds.asr.abs_asr.AbsASR
espnet2.sds.asr.espnet_asr.ESPnetASRModel
espnet2.sds.asr.owsm_asr.OWSMModel
espnet2.sds.asr.owsm_ctc_asr.OWSMCTCModel
espnet2.sds.asr.whisper_asr.WhisperASRModel
espnet2.sds.end_to_end.abs_e2e.AbsE2E
espnet2.sds.end_to_end.mini_omni_e2e.MiniOmniE2EModel
espnet2.sds.end_to_end.mini_omni.inference.A1_A2
espnet2.sds.end_to_end.mini_omni.inference.A1_A2_batch
espnet2.sds.end_to_end.mini_omni.inference.A1_T1
espnet2.sds.end_to_end.mini_omni.inference.A1_T2
espnet2.sds.end_to_end.mini_omni.inference.download_model
espnet2.sds.end_to_end.mini_omni.inference.get_input_ids_TA
espnet2.sds.end_to_end.mini_omni.inference.get_input_ids_TT
espnet2.sds.end_to_end.mini_omni.inference.get_input_ids_whisper
espnet2.sds.end_to_end.mini_omni.inference.get_input_ids_whisper_ATBatch
espnet2.sds.end_to_end.mini_omni.inference.load_audio
espnet2.sds.end_to_end.mini_omni.inference.load_model
espnet2.sds.end_to_end.mini_omni.inference.OmniInference
espnet2.sds.end_to_end.mini_omni.inference.T1_A2
espnet2.sds.end_to_end.mini_omni.inference.T1_T2
espnet2.sds.end_to_end.mini_omni.inference.test_infer
espnet2.sds.end_to_end.mini_omni.litgpt.config.Config
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.generate
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.generate_AA
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.generate_ASR
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.generate_AT
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.generate_TA
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.generate_TA_BATCH
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.generate_TT
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.multinomial_num_samples_1
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.next_token
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.next_token_A1T1
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.next_token_A1T2
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.next_token_asr
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.next_token_batch
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.sample
espnet2.sds.end_to_end.mini_omni.litgpt.generate.base.sample_top_p
espnet2.sds.end_to_end.mini_omni.litgpt.model.apply_rope
espnet2.sds.end_to_end.mini_omni.litgpt.model.Block
espnet2.sds.end_to_end.mini_omni.litgpt.model.build_mask_cache
espnet2.sds.end_to_end.mini_omni.litgpt.model.build_rope_cache
espnet2.sds.end_to_end.mini_omni.litgpt.model.CausalSelfAttention
espnet2.sds.end_to_end.mini_omni.litgpt.model.GemmaMLP
espnet2.sds.end_to_end.mini_omni.litgpt.model.GPT
espnet2.sds.end_to_end.mini_omni.litgpt.model.GptNeoxMLP
espnet2.sds.end_to_end.mini_omni.litgpt.model.KVCache
espnet2.sds.end_to_end.mini_omni.litgpt.model.LLaMAMLP
espnet2.sds.end_to_end.mini_omni.litgpt.model.LLaMAMoE
espnet2.sds.end_to_end.mini_omni.litgpt.model.RMSNorm
espnet2.sds.end_to_end.mini_omni.litgpt.model.whisperMLP
espnet2.sds.end_to_end.mini_omni.litgpt.tokenizer.Tokenizer
espnet2.sds.end_to_end.mini_omni.litgpt.utils.capture_hparams
espnet2.sds.end_to_end.mini_omni.litgpt.utils.check_valid_checkpoint_dir
espnet2.sds.end_to_end.mini_omni.litgpt.utils.choose_logger
espnet2.sds.end_to_end.mini_omni.litgpt.utils.chunked_cross_entropy
espnet2.sds.end_to_end.mini_omni.litgpt.utils.CLI
espnet2.sds.end_to_end.mini_omni.litgpt.utils.copy_config_files
espnet2.sds.end_to_end.mini_omni.litgpt.utils.CycleIterator
espnet2.sds.end_to_end.mini_omni.litgpt.utils.estimate_flops
espnet2.sds.end_to_end.mini_omni.litgpt.utils.extend_checkpoint_dir
espnet2.sds.end_to_end.mini_omni.litgpt.utils.find_multiple
espnet2.sds.end_to_end.mini_omni.litgpt.utils.find_resume_path
espnet2.sds.end_to_end.mini_omni.litgpt.utils.flops_per_param
espnet2.sds.end_to_end.mini_omni.litgpt.utils.get_argument_names
espnet2.sds.end_to_end.mini_omni.litgpt.utils.get_default_supported_precision
espnet2.sds.end_to_end.mini_omni.litgpt.utils.incremental_save
espnet2.sds.end_to_end.mini_omni.litgpt.utils.IncrementalPyTorchPickler
espnet2.sds.end_to_end.mini_omni.litgpt.utils.init_out_dir
espnet2.sds.end_to_end.mini_omni.litgpt.utils.instantiate_bnb_optimizer
espnet2.sds.end_to_end.mini_omni.litgpt.utils.instantiate_torch_optimizer
espnet2.sds.end_to_end.mini_omni.litgpt.utils.load_checkpoint
espnet2.sds.end_to_end.mini_omni.litgpt.utils.map_old_state_dict_weights
espnet2.sds.end_to_end.mini_omni.litgpt.utils.num_parameters
espnet2.sds.end_to_end.mini_omni.litgpt.utils.parse_devices
espnet2.sds.end_to_end.mini_omni.litgpt.utils.reset_parameters
espnet2.sds.end_to_end.mini_omni.litgpt.utils.save_config
espnet2.sds.end_to_end.mini_omni.litgpt.utils.save_hyperparameters
espnet2.sds.end_to_end.mini_omni.litgpt.utils.SavingProxyForStorage
espnet2.sds.end_to_end.mini_omni.litgpt.utils.SavingProxyForTensor
espnet2.sds.end_to_end.mini_omni.utils.snac_utils.generate_audio_data
espnet2.sds.end_to_end.mini_omni.utils.snac_utils.get_snac
espnet2.sds.end_to_end.mini_omni.utils.snac_utils.get_time_str
espnet2.sds.end_to_end.mini_omni.utils.snac_utils.layershift
espnet2.sds.end_to_end.mini_omni.utils.snac_utils.reconscruct_snac
espnet2.sds.end_to_end.mini_omni.utils.snac_utils.reconstruct_tensors
espnet2.sds.end_to_end.mini_omni.utils.snac_utils.SnacConfig
espnet2.sds.espnet_model.ESPnetSDSModelInterface
espnet2.sds.llm.abs_llm.AbsLLM
espnet2.sds.llm.hugging_face_llm.HuggingFaceLLM
espnet2.sds.tts.abs_tts.AbsTTS
espnet2.sds.tts.chat_tts.ChatTTSModel
espnet2.sds.tts.espnet_tts.ESPnetTTSModel
espnet2.sds.utils.chat.Chat
espnet2.sds.utils.utils.int2float
espnet2.sds.vad.abs_vad.AbsVAD
espnet2.sds.vad.webrtc_vad.WebrtcVADModel
Prev
Schedulers
Next
Slu