espnet2.asr.encoder.avhubert_encoder.time_masking
Less than 1 minute
espnet2.asr.encoder.avhubert_encoder.time_masking
espnet2.asr.encoder.avhubert_encoder.time_masking(xs_pad, min_T=5, max_T=20)
Masking Contiguous Frames with random length of [min_T, max_T]